ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

Hugo Touvron, Piotr Bojanowski, Mathilde Caron, Matthieu Cord, Alaaeldin El-Nouby, Edouard Grave, Gautier Izacard, Armand Joulin, Gabriel Synnaeve, Jakob Verbeek, Herve Jegou

PMID: 36094972
DOI: 10.1109/TPAMI.2022.3206148

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

Hugo Touvron et al. IEEE Trans Pattern Anal Mach Intell. 2023 Apr.

. 2023 Apr;45(4):5314-5321.

doi: 10.1109/TPAMI.2022.3206148. Epub 2023 Mar 7.

Authors

Hugo Touvron, Piotr Bojanowski, Mathilde Caron, Matthieu Cord, Alaaeldin El-Nouby, Edouard Grave, Gautier Izacard, Armand Joulin, Gabriel Synnaeve, Jakob Verbeek, Herve Jegou

PMID: 36094972
DOI: 10.1109/TPAMI.2022.3206148

Abstract

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly good results. We share pre-trained models and our code based on the Timm library.

PubMed Disclaimer

LinkOut - more resources

Full Text Sources
- IEEE Computer Society
- IEEE Engineering in Medicine and Biology Society
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

Authors

Abstract

LinkOut - more resources

Full Text Sources

Other Literature Sources