Cuda out of memory #24

Vicvickyue · 2024-04-26T21:40:40Z

Hello! Thank you so much for your amazing work. I'm posting to ask about the cuda out of memory error that I encounter when I'm running the InstanceDiffusion inference demon. I'm using one RTX3050 to run the program and there's no other process using the gpu while I'm running.

frank-xwang · 2024-04-29T17:09:11Z

Hi, you may want to use a smaller '--num_images'. Also, please confirm that the flash attention (we use it by default) is used to reduce the memory usage.

raindrop313 · 2024-06-06T06:54:37Z

I encountered the same issue, and reducing the "--num_images" did not resolve the problem. Based on the error message indicating an "out of memory" error during the model weight loading phase, could you please provide an estimate of how much GPU memory is required to run this project？

@frank-xwang

milky245 · 2024-06-08T11:22:54Z

Hello, I have met the same problem. I tried reduce the --num_image to 2 or 1, and have confirmed that flash_attn is able to run normally. I ran the demo on RTX4060 with 8GB memory, and I would like to know what GPU memory is needed for training and deployment. @frank-xwang Thanks and looking forward reply.

frank-xwang · 2024-06-10T16:35:00Z

Apologies for the delayed response.

Thank you for your interest in InstanceDiffusion. I have made further optimizations to reduce the memory usage of the code. Please update to the latest version by pulling the new InstanceDiffusion code. To run this updated code, you will likely need a GPU with at least 13G of memory. I recently tested it locally on RTX 6000 GPUs, which have 24G of memory, and the inference consumed about 12.8G of memory. For training the model, we utilize A100 GPUs with 80G of memory.

The command I used for model inference:

CUDA_VISIBLE_DEVICES=6 python inference.py \
  --num_images 8 \
  --output OUTPUT/demo/ \
  --input_json demos/demo_cat_dog_robin.json \
  --ckpt pretrained/instancediffusion_sd15.pth \
  --test_config configs/test_box.yaml \
  --guidance_scale 7.5 \
  --alpha 0.75 \
  --seed 4 \
  --mis 0.3 \
  --cascade_strength 0.3

And the memory usage is attached as below:

Hope it helps!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cuda out of memory #24

Cuda out of memory #24

Vicvickyue commented Apr 26, 2024

frank-xwang commented Apr 29, 2024

raindrop313 commented Jun 6, 2024 •

edited

Loading

milky245 commented Jun 8, 2024 •

edited

Loading

frank-xwang commented Jun 10, 2024 •

edited

Loading

Cuda out of memory #24

Cuda out of memory #24

Comments

Vicvickyue commented Apr 26, 2024

frank-xwang commented Apr 29, 2024

raindrop313 commented Jun 6, 2024 • edited Loading

milky245 commented Jun 8, 2024 • edited Loading

frank-xwang commented Jun 10, 2024 • edited Loading

raindrop313 commented Jun 6, 2024 •

edited

Loading

milky245 commented Jun 8, 2024 •

edited

Loading

frank-xwang commented Jun 10, 2024 •

edited

Loading