Optimization for SpriteBatch when running non VertexArray VertexDataModes. (GL30 default) #7346

Tom-Ski · 2024-02-21T20:59:42Z

In flush of SpriteBatch, the index buffer gets a limit for the current size of the batch. It gets the indices buffer reference whilst marking the index buffer as dirty, so on bind of the mesh, the indices are uploaded again. Since SpriteBatch indices are static and never change, this change pre-uploads all the indices to the IndexBuffer at creation time, and skips any further updates at render.

We don't have a nice way to do this without making a bunch of new methods specifically for this case, so I'm just binding and unbinding the mesh to trigger the upload once before a render occurs.

Testers welcome! Seems to be no regressions for me in the test suite, but would be good if otherr can take a look on their systems and devices

Spritebatch defaults timings from SpriteBatchPerformanceTest

before change
gl20 0.225ms
gl30 0.280ms


after change
gl20 0.225ms
gl30 0.201ms

Related issue
#7345

… of the batch to prevent doing this each frame. Increase the performance of default SpriteBatch in gl30 where this performs worse than gl2 vertex array

obigu · 2024-04-17T10:36:19Z

tests/gdx-tests/src/com/badlogic/gdx/tests/SpriteBatchPerformanceTest.java

+		spriteBatch.begin();
+		stringBuilder.setLength(0);
+		stringBuilder.append("Mean Time ms: ");
+		stringBuilder.append(counter.getMean() / 1e6);


I'd add a conditional that shows "Please Wait..." on the text dependign if counter.hasEnoughData() returns true or not as it may take a while (over 5s on a Samsung S7) to populate depending on the device you run the test and I thought the test was not working.

obigu · 2024-04-17T10:37:18Z

tests/gdx-tests/src/com/badlogic/gdx/tests/SpriteBatchPerformanceTest.java

+
+			// fill the batch
+			for (int i = 0; i < 8190; i++) {
+				spriteBatch.draw(texture, 0, 0, 1, 1);


Minor thing but maybe instead of drawing a single point we can change it to something like 20px * 20px just for the user to see something is happening.

@obigu i think it would bias the test since it will overload fragment shader stage, considering there are about 819.000 drawings, it would have to render 20x20 pixels each, so ~327 Mega pixels which could be a lot compared to the actual amount (less than 1 Mega pixels). In this end it would defeat isolation of this test.

if we want to isolate a bit more, we could disable blending and use depth test (never pass) in order to fully bypass fragment stage.

obigu

I've tested this running the libGDX test suite on Android both GL20 and GL30 and on iOS. I've found no issues or differences between previous behaviour and with this change and for this reason I approve (the suggested changes would be nice to have but not required). I can't review from a functionality/correctness point of view even if they make sense to me.

EDIT: Sharing the results of the performance tests just for reference. On GL20 no changes observed.
On Android Samsung 7 device GL20:

Mean Without changes: 1.7s
Mean After changes: 1.7s

On Android Samsung 7 device GL30:

Mean Without changes: 1.2s
Mean After changes: 0.6s

NathanSweet · 2024-04-18T19:17:05Z

I ran Spine with GL30 with and without these changes, all seems fine. I didn't measure performance differences.

I'm good with merging but I didn't press the button in case obigu's proposed changes will be added.

Tom-Ski and others added 2 commits February 21, 2024 20:53

SpriteBatch optimizations to preupload full indices data for the size…

57ee279

… of the batch to prevent doing this each frame. Increase the performance of default SpriteBatch in gl30 where this performs worse than gl2 vertex array

Apply formatter

3973c78

obigu added this to the 1.12.2 milestone Feb 22, 2024

obigu mentioned this pull request Apr 15, 2024

Added TextureArraySpriteBatch #5914

Open

obigu reviewed Apr 17, 2024

View reviewed changes

obigu approved these changes Apr 17, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimization for SpriteBatch when running non VertexArray VertexDataModes. (GL30 default) #7346

Optimization for SpriteBatch when running non VertexArray VertexDataModes. (GL30 default) #7346

Tom-Ski commented Feb 21, 2024

obigu Apr 17, 2024

obigu Apr 17, 2024

mgsx-dev Apr 17, 2024

mgsx-dev Apr 17, 2024

obigu left a comment •

edited

NathanSweet commented Apr 18, 2024

Optimization for SpriteBatch when running non VertexArray VertexDataModes. (GL30 default) #7346

Are you sure you want to change the base?

Optimization for SpriteBatch when running non VertexArray VertexDataModes. (GL30 default) #7346

Conversation

Tom-Ski commented Feb 21, 2024

obigu Apr 17, 2024

Choose a reason for hiding this comment

obigu Apr 17, 2024

Choose a reason for hiding this comment

mgsx-dev Apr 17, 2024

Choose a reason for hiding this comment

mgsx-dev Apr 17, 2024

Choose a reason for hiding this comment

obigu left a comment • edited

Choose a reason for hiding this comment

NathanSweet commented Apr 18, 2024

obigu left a comment •

edited