Usage report of the efficient_conv_bn_eval feature #1252
Replies: 6 comments 21 replies
-
I used the maskrcnn model as a baseline to test the difference of turning on this function. When the model first started training, it could indeed save 13% of the memory. But as the training continued, I used nvidia-smi to check the memory. I found that the memory continued to increase from 7GB. It soared until it occupied 24GB, but the memory usage in the training log record remained as low as it was at the beginning |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
Hello, I have set "norm_val", but there is an error:torch.fx.proxy.TraceError: symbolically traced variables cannot be used as inputs to control flow,how should to do? |
Beta Was this translation helpful? Give feedback.
-
Can this method still work if the batch_size is set to 1 during training? |
Beta Was this translation helpful? Give feedback.
-
Tested on the SlowOnly model ( |
Beta Was this translation helpful? Give feedback.
-
I have encoutered a question, please help. ./tools/dist_train.sh bisai.py 1 --cfg-options efficient_conv_bn_eval="[backbone]" my dist_train.sh #!/usr/bin/env bash
CONFIG=$1
GPUS=$2
NNODES=${NNODES:-1}
NODE_RANK=${NODE_RANK:-0}
PORT=${PORT:-29568}
MASTER_ADDR=${MASTER_ADDR:-"127.0.0.1"}
PYTHONPATH="$(dirname $0)/..":$PYTHONPATH \
CUDA_VISIBLE_DEVICES=0,1 python -m torch.distributed.launch \
--nnodes=$NNODES \
--node_rank=$NODE_RANK \
--master_addr=$MASTER_ADDR \
--nproc_per_node=$GPUS \
--master_port=$PORT \
$(dirname "$0")/train.py \
$CONFIG \
--launcher pytorch ${@:3}
Then I have the following exception:
I checked a lot of material and ask GPT, but got no answer. Please do help and thx in advance. |
Beta Was this translation helpful? Give feedback.
-
Please report here how enabling
efficient_conv_bn_eval
feature helps your training. We expect the feature to seamlessly reduce memory footprint without affecting the performance. Currently it is an optional feature, but if the feedback is good enough, we can turn this feature into default :)Beta Was this translation helpful? Give feedback.
All reactions