Cache images across glTFs to avoid duplication #1521

azrogers · 2024-09-11T19:05:11Z

This implements the changes from CesiumGS/cesium-native#926 in Unreal. Now, when the same image is used by multiple glTFs, the texture data is only uploaded to the GPU once. Subsequent glTFs using the same image will create a new reference to the existing texture resource, reducing memory consumption.

…assets

kring

@azrogers I have some specific comments below, but I haven't looked at the code exhaustively.

More generally, I think the chains of futures here have gotten really complicated, and I'm not sure it's justified. From what I can tell, the only reason that any of the CesiumGltfComponent code has been futurified at all is because of the need to generate mipmaps, and the desire to avoid blocking worker threads (via waiting on a mutex) while some other thread generates mipmaps for a shared image.

Do I have that right? If so, I think a little restructuring will go a long way.

Currently, the mipmap generation starts right at the beginning of the process in loadModelAnyThreadPart. We get a SharedFuture for its completion. Unfortunately that runInWorkerThread continuation in preprocessImage is going to be run synchronously, because we're already in a worker thread (and AsyncSystem knows it), and so the mutex is going to be held for the entire duration of the mipmap generation. This will not only block all threads that need this same image, it will also block all glTF loading period (even across tilesets) due to the use of the global mutex. Ok, so that has to be fixed, or else all this complexity is pointless.

But let's say we fix that, without getting into the details of how we would do it....

Next we start the main part of the glTF loading process. Way down in the depths of creating of Unreal components from glTF node -> mesh -> primitive objects, we need to create some textures. And creating those textures requires first waiting on that previously-started mipmap generation. Which requires that process to be async (i.e., return a Future), and that requirement bubbles its way through everything.

I think we can drastically simplify this, with very little tradeoff, by simply moving the mipmap continuation earlier. loadModelAnyThreadPart would look something like this:

static CesiumAsync::Future<TUniquePtr<UCesiumGltfComponent::HalfConstructed>>
loadModelAnyThreadPart(
    const CesiumAsync::AsyncSystem& asyncSystem,
    const glm::dmat4x4& transform,
    const CreateModelOptions& options,
    const CesiumGeospatial::Ellipsoid& ellipsoid) {
  return createMipMapsForAllTextures(...).thenInWorkerThread([...]() {
    doTheRestOfTheNormalProcessSynchronously(...);
  });
}

In other words, we wait (via continuation, not actually waiting) for mipmaps to be available for all of our images before we do the rest of the process, and we don't need Futures in any of the rest of the process at all.

Now, you might legitimately point out that this is a little less efficient. If we have multiple tiles sharing an image, we conceptually could have one thread generating mipmaps for that image while the other threads do the rest of the model loading process in parallel, and the design I proposed would preclude this. But in this scenario, we're no less efficient than we would be if each tile had its own image. And also, if we're worried about this inefficiency, shouldn't we be more worried about the same inefficiency while actually downloading the image? We could conceptually start creating Unreal meshes while downloading the shared images, too, not just while generating mipmaps for them.

Considering how much complexity it eliminates to ignore this small inefficiency, I think it's a really good tradeoff. Especially remembering that all those extra Futures flying around have a runtime cost, too.

Now, circling back, the trick to implement createMipMapsForAllTextures is to use a Promise. Roughly:

Lock the mutex
Get/add the extension. If it already exists, return its Future and we're done. Otherwise,
Create a Promise.
Get the Promise's Future and call share to make a SharedFuture from it
Assign the SharedFuture to the field on the extension
Unlock the mutex
Generate the mipmaps in the current thread.
Call resolve on the Promise.

This is the basic process needed to fix the problems in the current design as well.

Source/CesiumRuntime/Private/CesiumTextureUtility.h

Source/CesiumRuntime/Private/Cesium3DTileset.cpp

Source/CesiumRuntime/Private/CesiumTextureUtility.h

Source/CesiumRuntime/Private/CesiumTextureUtility.cpp

Source/CesiumRuntime/Private/CesiumGltfComponent.cpp

…o shared-assets

…tion errors.

Source/CesiumRuntime/Private/Cesium3DTileset.cpp

…s-wip

Update cesium-unreal for shared asset changes

kring · 2024-10-30T09:43:38Z

A few things before we can merge this, in order from rather important to mostly trivial. 😁

I have a level with CWT + Bing + a WMS, and there seems to be quite a large memory leak. Process memory increases quickly even just spinning in place. I spent some time trying to track this down but haven't had much success yet. Presumably it's the Unreal textures leaking, because we made changes here and because the entire thing is crazy complicated.
Compilation errors in UE 5.3 / macOS, and test failures on Windows.
What's going on with Cesium3DTileset.spec.cpp? It has both a unit test and a performance test (we usually separate these into different files), and both look more like a quick experiment than something that should live in the code base long term. Should we polish these up? Remove them?
We need to update CHANGES.md.

kring · 2024-10-31T10:39:49Z

The memory leak and macOS build problems are fixed. @azrogers can you please give this a self review today, and tick off the other two issues (plus test failures if there still are any), and we should be in good shape to merge it for the release tomorrow.

azrogers added 15 commits August 16, 2024 14:25

Tests passing.

5c4d926

Merge with main

4a0eb74

Update cesium-native

0748638

Update cesium-native

e08581e

Start of test for shared images.

1d85d7b

Almost fully asynced glTF loading

fcb53ea

Fixed UniquePtr issues

21f034f

Attempting to resolve invalid pointer issue with material

164379e

Fixed crash

dd14eb1

Tile debug overlay

d9bf27e

Working! All tests passing

5c486cf

Fix test, update cesium-native

30eeee7

Merge branch 'main' of github.com:CesiumGS/cesium-unreal into shared-…

03065e9

…assets

Re-add workaround from 396f78f

cc1bd44

clang-format

1b7b940

kring self-requested a review September 12, 2024 02:17

Fix clang-formatting by letting npm dependencies update.

38042e4

kring reviewed Sep 12, 2024

View reviewed changes

azrogers added 10 commits September 13, 2024 11:00

Update cesium-native

4d68a89

Merge branch 'shared-assets' of github.com:CesiumGS/cesium-unreal int…

3784f5d

…o shared-assets

De-asyncify most of the gltf loading. Still seeing mysterious compila…

1ff1fd3

…tion errors.

Fix lifetime issues

b0f37ec

Log asset stats

6b43702

Fix mipmap generation

6fcce71

Hopefully fix CI errors

ac6dbbb

Add Map header

1ef546f

Add automation test headers

4da4e83

Add WITH_EDITOR block

0637307

kring mentioned this pull request Sep 20, 2024

Cache images across glTFs to avoid duplication CesiumGS/cesium-native#926

Merged

kring reviewed Sep 20, 2024

View reviewed changes

Source/CesiumRuntime/Private/Cesium3DTileset.cpp Outdated Show resolved Hide resolved

Source/CesiumRuntime/Private/Cesium3DTileset.cpp Outdated Show resolved Hide resolved

Source/CesiumRuntime/Private/Cesium3DTileset.cpp Outdated Show resolved Hide resolved

azrogers and others added 5 commits October 10, 2024 15:54

Format, update cesium-native

9a675e2

Clean up CesiumTextureUtility

85d6b39

Use shared-assets-wip branch of cesium-native.

d8f27c6

Update cesium-native.

4a96eea

Update cesium-native.

2d1ccd8

kring mentioned this pull request Oct 29, 2024

Update cesium-unreal for shared asset changes #1536

Merged

kring and others added 12 commits October 29, 2024 19:05

Merge remote-tracking branch 'origin/main' into shared-assets

d7cda44

Merge remote-tracking branch 'origin/shared-assets' into shared-asset…

71b885b

…s-wip

Update cesium-native.

463e027

Merge pull request #1536 from CesiumGS/shared-assets-wip

9bc94d3

Update cesium-unreal for shared asset changes

Return to shared-assets branch of cesium-native

6b7f4af

pCesium -> pAsset.

adec72e

Fix test failure.

f16dffb

Formatting.

c45f82f

Don't use deprecated methods.

04bac36

Fix compiler error on macOS, hopefully.

71d0297

Remove unused code.

d61b912

LogAssetStats -> LogSharedAssetStats

2ec8b60

azrogers and others added 3 commits October 30, 2024 16:50

Remove CesiumGltf usings to try to fix OSX build

da6f64c

Fix memory leak and non-async texture creation.

935e0db

More namespacing.

26eff5e

azrogers and others added 4 commits October 31, 2024 13:10

Fix tests

783bb4e

Update CHANGES, remove Snowdon test

358d779

Remove Class.h include

a6d7b3b

Eliminate another round of using namespace CesiumGltf.

28333a1

kring merged commit 203c697 into main Nov 1, 2024
23 checks passed

kring deleted the shared-assets branch November 1, 2024 02:44

kring mentioned this pull request Nov 15, 2024

Fix CesiumGltf namespace problems. #1548

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache images across glTFs to avoid duplication #1521

Cache images across glTFs to avoid duplication #1521

azrogers commented Sep 11, 2024

kring left a comment

kring commented Oct 30, 2024 •

edited by azrogers

Loading

kring commented Oct 31, 2024

Cache images across glTFs to avoid duplication #1521

Cache images across glTFs to avoid duplication #1521

Conversation

azrogers commented Sep 11, 2024

kring left a comment

Choose a reason for hiding this comment

kring commented Oct 30, 2024 • edited by azrogers Loading

kring commented Oct 31, 2024

kring commented Oct 30, 2024 •

edited by azrogers

Loading