Request more experiment results to compare to other architecture. #8

Luciennnnnnn · 2021-10-16T01:45:54Z

Hi!
This work is pretty interesting, but I think there should are more results like in "Demystifying Local Vision Transformer: Sparse Connectivity, Weight Sharing, and Dynamic Weight" as they replace local self-attention with depth-wise convolution in Swin Transformer. Since you conduct an advanced one with a more simple architecture compared to SwinTransformer, so I wonder if ConvMixer can get similar performance on object detection and semantic segmentation.

BradKML · 2021-10-21T04:11:53Z

This sounds like a good idea, but it requires standard benchmarks and model zoos.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Request more experiment results to compare to other architecture. #8

Request more experiment results to compare to other architecture. #8

Luciennnnnnn commented Oct 16, 2021

BradKML commented Oct 21, 2021

Request more experiment results to compare to other architecture. #8

Request more experiment results to compare to other architecture. #8

Comments

Luciennnnnnn commented Oct 16, 2021

BradKML commented Oct 21, 2021