@@ -114,18 +114,18 @@ Prepare datasets according to the [guidelines](https://github.com/open-mmlab/mms
114
114
<br >
115
115
<div >
116
116
117
- | method | backbone | resolution | mIoU (ss/ms) | #params | FLOPs | Config | Download |
118
- | :---------: | :------------: | :--------: | :-----------: | :-----: | :---: | :--- --------------------------------------------------------------------------------------------: | :----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: |
119
- | UperNet | InternImage-T | 512x1024 | 82.58 / 83.40 | 59M | 1889G | [ config] ( ./configs/cityscapes/upernet_internimage_t_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_t_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_t_512x1024_160k_cityscapes.log.json ) |
120
- | UperNet | InternImage-S | 512x1024 | 82.74 / 83.45 | 80M | 2035G | [ config] ( ./configs/cityscapes/upernet_internimage_s_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_s_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_s_512x1024_160k_cityscapes.log.json ) |
121
- | UperNet | InternImage-B | 512x1024 | 83.18 / 83.97 | 128M | 2369G | [ config] ( ./configs/cityscapes/upernet_internimage_b_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_b_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_b_512x1024_160k_cityscapes.log.json ) |
122
- | UperNet | InternImage-L | 512x1024 | 83.68 / 84.41 | 256M | 3234G | [ config] ( ./configs/cityscapes/upernet_internimage_l_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_l_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_l_512x1024_160k_cityscapes.log.json ) |
123
- | UperNet\* | InternImage-L | 512x1024 | 85.94 / 86.22 | 256M | 3234G | [ config] ( ./configs/cityscapes/upernet_internimage_l_512x1024_160k_mapillary2cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_l_512x1024_160k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_l_512x1024_160k_mapillary2cityscapes.log.json ) |
124
- | UperNet | InternImage-XL | 512x1024 | 83.62 / 84.28 | 368M | 4022G | [ config] ( ./configs/cityscapes/upernet_internimage_xl_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_xl_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_xl_512x1024_160k_cityscapes.log.json ) |
125
- | UperNet\* | InternImage-XL | 512x1024 | 86.20 / 86.42 | 368M | 4022G | [ config] ( ./configs/cityscapes/upernet_internimage_xl_512x1024_160k_mapillary2cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_xl_512x1024_160k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_xl_512x1024_160k_mapillary2cityscapes.log.json ) |
126
- | SegFormer\* | InternImage-L | 512x1024 | 85.16 / 85.67 | 220M | 1580G | [ config] ( ./configs/cityscapes/segformer_internimage_l_512x1024_160k_mapillary2cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/segformer_internimage_l_512x1024_160k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/segformer_internimage_l_512x1024_160k_mapillary2cityscapes.log.json ) |
127
- | SegFormer\* | InternImage-XL | 512x1024 | 85.41 / 85.93 | 330M | 2364G | [ config] ( ./configs/cityscapes/segformer_internimage_xl_512x1024_160k_mapillary2cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/segformer_internimage_xl_512x1024_160k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/segformer_internimage_xl_512x1024_160k_mapillary2cityscapes.log.json ) |
128
- | Mask2Former | InternImage-H | 1024x1024 | 86.37 / 86.96 | 1094M | 7878G | [ config] ( ./configs/cityscapes/mask2former_internimage_h_1024x1024_80k_mapillary2cityscapes_ss .py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/mask2former_internimage_h_1024x1024_80k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/mask2former_internimage_h_1024x1024_80k_mapillary2cityscapes.log.json ) |
117
+ | method | backbone | resolution | mIoU (ss/ms) | #params | FLOPs | Config | Download |
118
+ | :----------- : | :------------: | :--------: | :-----------: | :-----: | :---: | :--------------------------------------------------------------------------------------------: | :----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: |
119
+ | UperNet | InternImage-T | 512x1024 | 82.58 / 83.40 | 59M | 1889G | [ config] ( ./configs/cityscapes/upernet_internimage_t_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_t_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_t_512x1024_160k_cityscapes.log.json ) |
120
+ | UperNet | InternImage-S | 512x1024 | 82.74 / 83.45 | 80M | 2035G | [ config] ( ./configs/cityscapes/upernet_internimage_s_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_s_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_s_512x1024_160k_cityscapes.log.json ) |
121
+ | UperNet | InternImage-B | 512x1024 | 83.18 / 83.97 | 128M | 2369G | [ config] ( ./configs/cityscapes/upernet_internimage_b_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_b_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_b_512x1024_160k_cityscapes.log.json ) |
122
+ | UperNet | InternImage-L | 512x1024 | 83.68 / 84.41 | 256M | 3234G | [ config] ( ./configs/cityscapes/upernet_internimage_l_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_l_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_l_512x1024_160k_cityscapes.log.json ) |
123
+ | UperNet\* | InternImage-L | 512x1024 | 85.94 / 86.22 | 256M | 3234G | [ config] ( ./configs/cityscapes/upernet_internimage_l_512x1024_160k_mapillary2cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_l_512x1024_160k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_l_512x1024_160k_mapillary2cityscapes.log.json ) |
124
+ | UperNet | InternImage-XL | 512x1024 | 83.62 / 84.28 | 368M | 4022G | [ config] ( ./configs/cityscapes/upernet_internimage_xl_512x1024_160k_cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_xl_512x1024_160k_cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_xl_512x1024_160k_cityscapes.log.json ) |
125
+ | UperNet\* | InternImage-XL | 512x1024 | 86.20 / 86.42 | 368M | 4022G | [ config] ( ./configs/cityscapes/upernet_internimage_xl_512x1024_160k_mapillary2cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/upernet_internimage_xl_512x1024_160k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/upernet_internimage_xl_512x1024_160k_mapillary2cityscapes.log.json ) |
126
+ | SegFormer\* | InternImage-L | 512x1024 | 85.16 / 85.67 | 220M | 1580G | [ config] ( ./configs/cityscapes/segformer_internimage_l_512x1024_160k_mapillary2cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/segformer_internimage_l_512x1024_160k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/segformer_internimage_l_512x1024_160k_mapillary2cityscapes.log.json ) |
127
+ | SegFormer\* | InternImage-XL | 512x1024 | 85.41 / 85.93 | 330M | 2364G | [ config] ( ./configs/cityscapes/segformer_internimage_xl_512x1024_160k_mapillary2cityscapes.py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/segformer_internimage_xl_512x1024_160k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/segformer_internimage_xl_512x1024_160k_mapillary2cityscapes.log.json ) |
128
+ | Mask2Former\* | InternImage-H | 1024x1024 | 86.37 / 86.96 | 1094M | 7878G | [ config] ( ./configs/cityscapes/mask2former_internimage_h_1024x1024_80k_mapillary2cityscapes .py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/mask2former_internimage_h_1024x1024_80k_mapillary2cityscapes.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/mask2former_internimage_h_1024x1024_80k_mapillary2cityscapes.log.json ) |
129
129
130
130
\* denotes the model is trained using extra Mapillary dataset.
131
131
@@ -138,9 +138,9 @@ Prepare datasets according to the [guidelines](https://github.com/open-mmlab/mms
138
138
<br >
139
139
<div >
140
140
141
- | method | backbone | resolution | mIoU (ss) | #params | FLOPs | Config | Download |
142
- | :---------: | :-----------: | :--------: | :-------: | :-----: | :---: | :-------------------------------------------------------------------------------------- : | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: |
143
- | Mask2Former | InternImage-H | 896x896 | 52.6 | 1.31B | 4635G | [ config] ( ./configs/coco_stuff164k/mask2former_internimage_h_896_80k_cocostuff164k_ss .py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/mask2former_internimage_h_896_80k_cocostuff164k.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/mask2former_internimage_h_896_80k_cocostuff164k.log.json ) |
141
+ | method | backbone | resolution | mIoU (ss) | #params | FLOPs | Config | Download |
142
+ | :---------: | :-----------: | :--------: | :-------: | :-----: | :---: | :-----------------------------------------------------------------------------------: | :--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: |
143
+ | Mask2Former | InternImage-H | 896x896 | 52.6 | 1.31B | 4635G | [ config] ( ./configs/coco_stuff164k/mask2former_internimage_h_896_80k_cocostuff164k .py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/mask2former_internimage_h_896_80k_cocostuff164k.pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/mask2former_internimage_h_896_80k_cocostuff164k.log.json ) |
144
144
145
145
</div >
146
146
@@ -151,9 +151,9 @@ Prepare datasets according to the [guidelines](https://github.com/open-mmlab/mms
151
151
<br >
152
152
<div >
153
153
154
- | method | backbone | resolution | mIoU (ss) | #params | FLOPs | Config | Download |
155
- | :---------: | :-----------: | :--------: | :-------: | :-----: | :---: | :------------------------------------------------------------------------------------: | :------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: |
156
- | Mask2Former | InternImage-H | 896x896 | 52 .6 | 1.31B | 4635G | [ config] ( ./configs/coco_stuff10k/mask2former_internimage_h_896_80k_cocostuff10k_ss .py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/mask2former_internimage_h_896_80k_cocostuff10k .pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/mask2former_internimage_h_896_80k_cocostuff10k .log.json ) |
154
+ | method | backbone | resolution | mIoU (ss) | #params | FLOPs | Config | Download |
155
+ | :---------: | :-----------: | :--------: | :--------- : | :-----: | :---: | :------------------------------------------------------------------------------------------ : | :---------------- ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------: |
156
+ | Mask2Former | InternImage-H | 512x512 | 59.2 / 59 .6 | 1.28B | 1528G | [ config] ( ./configs/coco_stuff10k/mask2former_internimage_h_512_40k_cocostuff164k_to_10k .py ) | [ ckpt] ( https://huggingface.co/OpenGVLab/InternImage/resolve/main/mask2former_internimage_h_512_40k_cocostuff164k_to_10k .pth ) \| [ log] ( https://huggingface.co/OpenGVLab/InternImage/raw/main/mask2former_internimage_h_512_40k_cocostuff164k_to_10k .log.json ) |
157
157
158
158
</div >
159
159
0 commit comments