11-03 Vision TransformersPyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
11-03 Vision TransformersSegNeXt: Rethinking Convolutional Attention Design for Semantic Segmentation
11-01 Vision TransformersSegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers