Version of densenet #391

haofanwang · 2018-01-17T08:40:10Z

May I ask what's the version of densenet in torchivision.models ? The original or efficient_densenet_pytorch, as the original is memory hungry. If it's the original version, would pytorch team consider add the efficient version into model zoo ?

soumith · 2018-01-17T16:56:10Z

we plan to add a memory efficient version soon via pytorch/pytorch#4594

gpleiss · 2018-04-26T12:43:10Z

Now that PyTorch 0.4 is officially out, I'm making the efficient_densenet_pytorch code use the checkpointing feature. I can make a PR to this repo once I get it working!

fmassa · 2018-04-26T12:56:10Z

@gpleiss that would be a nice addition! Maybe by specifying a constructor argument that dispatches to checkpoint? The only thing we need to keep in mind is that checkpoint currently requires the input to have requires_grad=True, which is suboptimal in cases where we don't checkpoint.

gpleiss · 2018-04-26T13:41:48Z

@fmassa I'm thinking something like this?

# prev_features = [feat_1, feat_2, ...]
# ...
 if self.efficient and any(prev_features.requires_grad):
    bottleneck_output = checkpoint(bn_function, *prev_features)
 else:
    bottleneck_output = bn_function(*prev_features)
# ...

And the self.efficient flag is something that can be passed in by the user.

fmassa · 2018-04-26T13:58:36Z

That sounds good!

gpleiss · 2018-04-26T15:32:53Z

I'm profiling it now for my repo, and it seems to be good! Should have a PR ready tomorrow.

gpleiss · 2018-05-23T22:43:06Z

So sorry for the late response this...

The efficient densenet code seems to work great on a single GPU. However, on multiple GPUs with nn.DataParallel, @wandering007 points out that the checkpointing feature is quite slow (see gpleiss/efficient_densenet_pytorch#36). I think this is because checkpointing requires some sort of inter-GPU synchronization.

I'm opening up an issue in PyTorch about this. I'm holding off on a PR for now.

fmassa · 2018-05-24T08:15:24Z

Sounds good, thanks @gpleiss !

fmassa closed this as completed Feb 21, 2021

fmassa added the module: models label Feb 21, 2021

rajveerb pushed a commit to rajveerb/vision that referenced this issue Nov 30, 2023

[BERT] update README (pytorch#391)

67c2ebe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Version of densenet #391

Version of densenet #391

haofanwang commented Jan 17, 2018

soumith commented Jan 17, 2018

gpleiss commented Apr 26, 2018

fmassa commented Apr 26, 2018

gpleiss commented Apr 26, 2018 •

edited

Loading

fmassa commented Apr 26, 2018

gpleiss commented Apr 26, 2018

gpleiss commented May 23, 2018

fmassa commented May 24, 2018

Version of densenet #391

Version of densenet #391

Comments

haofanwang commented Jan 17, 2018

soumith commented Jan 17, 2018

gpleiss commented Apr 26, 2018

fmassa commented Apr 26, 2018

gpleiss commented Apr 26, 2018 • edited Loading

fmassa commented Apr 26, 2018

gpleiss commented Apr 26, 2018

gpleiss commented May 23, 2018

fmassa commented May 24, 2018

gpleiss commented Apr 26, 2018 •

edited

Loading