Android-java target batching is very slow #1051

RblSb · 2019-05-21T19:50:49Z

Because of vertex array pushing every frame. I try to optimize Float32Array implementation in #1041 replacing FloatBuffer to NativeArray<Single>, but it is still bad, so there is two ways:

Use sun.misc.Unsafe with reflection to put floats in FloatBuffer (LWJGL main way)
Use JNI for same reason (Libgdx way)

Also, maybe haxe jvm supports jvm-pasting and we can do something on jvm for that.
Some articles:
https://github.com/LWJGL/lwjgl3-wiki/wiki/1.3.-Memory-FAQ
https://www.badlogicgames.com/wordpress/?p=904

Not sure if Unsafe class available on all androids (with reflection), but it's still requres some code extraction from LWJGL/MemoryStack.java and MemoryUtil.java.
JNI should be easier to implement (if it has way to push floats in FloatBuffer), but it's requres some c/cpp compiler integration. Also not sure about Java native method call overhead.

The text was updated successfully, but these errors were encountered:

RobDangerous · 2019-05-21T19:53:36Z

Sorry but putting a large array on the stack makes no sense, at best it does not result in a difference, at worst - stack overflow. Allocating things from the stack is faster than from the heap but there's no speed difference in using things from the stack or the heap.

RblSb · 2019-05-21T19:58:26Z

Okay, so the problem is with putting floats in FloatBuffer, FloatBuffer.put methods is just too bad. It is still requres Unsafe or JNI to make it fast. Will update issue.

RblSb · 2019-09-12T23:41:47Z

I checked JNI approach with direct bytebuffer and memcpy, but its obviously bad because of jni overhead every get/set call. Also i prepare vertices to have backed colors/texcords and every drawImage only changes rect cords, but this only helps to get 1.5x speedup, so there is other hotspots in rendering and i'm too lazy to install emulator for profiling, doesn't make sense anyway. Still interested how libgdx java backend works with bunnymark, if they have same immediate mode rendering (seems almost impossible in java for me now).
Fun fact: with ndk setup gradle build slowdowns from 5 to 30s with one C file.

Spoiler with silly things

#include "test.h"
#include <stdlib.h>
#include <string.h>

JNIEXPORT
void JNICALL Java_jni_Test_copy(JNIEnv *env, jclass clazz, jobject dst, jfloatArray src, jint offset, jint len) {
	unsigned char* pDst = (unsigned char*)(*env)->GetDirectBufferAddress(env, dst);
	float* pSrc = (float*)(*env)->GetPrimitiveArrayCritical(env, src, 0);
	memcpy(pDst, pSrc + offset, len * 4);
	(*env)->ReleasePrimitiveArrayCritical(env, src, pSrc, 0);
}

JNIEXPORT
void JNICALL Java_jni_Test_set(JNIEnv *env, jclass clazz, jobject dst, jint index, jfloat value) {
	unsigned char* pDst = (unsigned char*)(*env)->GetDirectBufferAddress(env, dst);
	memcpy(pDst + index, &value, 4);
}

JNIEXPORT
jfloat JNICALL Java_jni_Test_get(JNIEnv *env, jclass clazz, jobject dst, jint index) {
	int *iBuf = (*env)->GetDirectBufferAddress(env, dst);
	float value;
	memcpy(&value, iBuf + index, 4);
	return value;
}

package jni;

@:classCode('
	static {
		System.loadLibrary("kore");
	}
')
class Test {
	@:native public static function copy(
		to: java.nio.ByteBuffer, from: java.NativeArray<Single>,
		offset: Int, size: Int
	): Void;
	@:native public static function get(to: java.nio.FloatBuffer, i: Int): Single;
	@:native public static function set(to: java.nio.FloatBuffer, i: Int, value: Single): Void;
}

build.gradle requres only two externalNativeBuild blocks and optional ndk {abiFilters 'armeabi-v7a'}, CMakeLists need only add_library(kore SHARED "path/files.c" "path/files.h")

RobDangerous · 2019-11-11T19:07:28Z

Looks like this is a won't-fix.

RobDangerous closed this as completed Nov 11, 2019

RblSb mentioned this issue Dec 27, 2019

Java Backend Outdated? #1162

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Android-java target batching is very slow #1051

Android-java target batching is very slow #1051

RblSb commented May 21, 2019 •

edited

Loading

RobDangerous commented May 21, 2019

RblSb commented May 21, 2019

RblSb commented Sep 12, 2019

RobDangerous commented Nov 11, 2019

Android-java target batching is very slow #1051

Android-java target batching is very slow #1051

Comments

RblSb commented May 21, 2019 • edited Loading

RobDangerous commented May 21, 2019

RblSb commented May 21, 2019

RblSb commented Sep 12, 2019

RobDangerous commented Nov 11, 2019

RblSb commented May 21, 2019 •

edited

Loading