-
Notifications
You must be signed in to change notification settings - Fork 345
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
stereoDNN resnet18 test fails #142
Comments
One idea that might be worth checking is to see which call/plugin causes memory spike. I would specifically check cuDNN autotuner in Conv3DPlugin plugin (also read my comments around that line for more details). This function, Ideally, Ex version of the tuner should be used as it does not allocate any memory unlike its non-Ex counterpart but I did not have time to implement that properly. Finally, with TensorRT 6.0 release, these plugins are not needed anymore - TRT now supports 3D convolutions. However, the model generation code needs to be updated to use TRT layers rather than our plugins. |
Thanks for getting back @Alexey-Kamenev
|
@Alexey-Kamenev my TX2 is running out of memory when I run the stereoDNN model tests for resnet18 fp16 and fp32. When I run:
./bin/nvstereo_sample_app_debug resnet18 1025 321 ./models/ResNet-18/TensorRT/trt_weights.bin ./sample_app/data/img_left.png ./sample_app/data/img_right.png ./bin/disp.bin
Further down in the calculations memory consumption spikes up from approx 4GB used over the course of the calculations to the max limit of the TX2 memory and the process gets killed.
All other models, NVsmall, NVtiny, and Resnet18-2D run without problems within the processing times as indicated in the wiki.
I am running Redtail on a TX2 with Jetpack 4.2.2 / Ubuntu 18.04 and ROS Melodic - I know not supported but I am not sure if that's the root cause behind this issue since the other stuff seems to work.Also, I dont want to go back to the old 3.2 release due to other dependencies.
Any ideas are appreciated!
The text was updated successfully, but these errors were encountered: