Skip to content

Commit 2c4a60e

Browse files
authored
Merge pull request #2355 from open-mmlab/dev-1.x
Dev 1.x
2 parents dab2fdf + 93119ae commit 2c4a60e

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

48 files changed

+6231
-137
lines changed

configs/body_2d_keypoint/rtmpose/body8/rtmpose-l_8xb256-210e_body8-256x192.py

Lines changed: 553 additions & 0 deletions
Large diffs are not rendered by default.

configs/body_2d_keypoint/rtmpose/body8/rtmpose-l_8xb256-210e_body8-384x288.py

Lines changed: 553 additions & 0 deletions
Large diffs are not rendered by default.

configs/body_2d_keypoint/rtmpose/body8/rtmpose-m_8xb256-210e_body8-256x192.py

Lines changed: 553 additions & 0 deletions
Large diffs are not rendered by default.

configs/body_2d_keypoint/rtmpose/body8/rtmpose-m_8xb256-210e_body8-384x288.py

Lines changed: 553 additions & 0 deletions
Large diffs are not rendered by default.

configs/body_2d_keypoint/rtmpose/body8/rtmpose-s_8xb256-210e_body8-256x192.py

Lines changed: 553 additions & 0 deletions
Large diffs are not rendered by default.

configs/body_2d_keypoint/rtmpose/body8/rtmpose-t_8xb256-210e_body8-256x192.py

Lines changed: 554 additions & 0 deletions
Large diffs are not rendered by default.
Lines changed: 76 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,76 @@
1+
<!-- [ALGORITHM] -->
2+
3+
<details>
4+
<summary align="right"><a href="https://link.springer.com/chapter/10.1007/978-3-030-58580-8_27">RTMPose (arXiv'2023)</a></summary>
5+
6+
```bibtex
7+
@misc{https://doi.org/10.48550/arxiv.2303.07399,
8+
doi = {10.48550/ARXIV.2303.07399},
9+
url = {https://arxiv.org/abs/2303.07399},
10+
author = {Jiang, Tao and Lu, Peng and Zhang, Li and Ma, Ningsheng and Han, Rui and Lyu, Chengqi and Li, Yining and Chen, Kai},
11+
keywords = {Computer Vision and Pattern Recognition (cs.CV), FOS: Computer and information sciences, FOS: Computer and information sciences},
12+
title = {RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose},
13+
publisher = {arXiv},
14+
year = {2023},
15+
copyright = {Creative Commons Attribution 4.0 International}
16+
}
17+
18+
```
19+
20+
</details>
21+
22+
<!-- [BACKBONE] -->
23+
24+
<details>
25+
<summary align="right"><a href="https://arxiv.org/abs/2212.07784">RTMDet (arXiv'2022)</a></summary>
26+
27+
```bibtex
28+
@misc{lyu2022rtmdet,
29+
title={RTMDet: An Empirical Study of Designing Real-Time Object Detectors},
30+
author={Chengqi Lyu and Wenwei Zhang and Haian Huang and Yue Zhou and Yudong Wang and Yanyi Liu and Shilong Zhang and Kai Chen},
31+
year={2022},
32+
eprint={2212.07784},
33+
archivePrefix={arXiv},
34+
primaryClass={cs.CV}
35+
}
36+
```
37+
38+
</details>
39+
40+
<!-- [DATASET] -->
41+
42+
<details>
43+
<summary align="right"><a href="https://link.springer.com/chapter/10.1007/978-3-319-10602-1_48">COCO (ECCV'2014)</a></summary>
44+
45+
```bibtex
46+
@inproceedings{lin2014microsoft,
47+
title={Microsoft coco: Common objects in context},
48+
author={Lin, Tsung-Yi and Maire, Michael and Belongie, Serge and Hays, James and Perona, Pietro and Ramanan, Deva and Doll{\'a}r, Piotr and Zitnick, C Lawrence},
49+
booktitle={European conference on computer vision},
50+
pages={740--755},
51+
year={2014},
52+
organization={Springer}
53+
}
54+
```
55+
56+
</details>
57+
58+
- Results on COCO val2017 with detector having human AP of 56.4 on COCO val2017 dataset.
59+
- `*` denotes model trained on 7 public datasets:
60+
- [AI Challenger](https://mmpose.readthedocs.io/en/latest/dataset_zoo/2d_body_keypoint.html#aic)
61+
- [MS COCO](https://mmpose.readthedocs.io/en/latest/dataset_zoo/2d_body_keypoint.html#coco)
62+
- [CrowdPose](https://mmpose.readthedocs.io/en/latest/dataset_zoo/2d_body_keypoint.html#crowdpose)
63+
- [MPII](https://mmpose.readthedocs.io/en/latest/dataset_zoo/2d_body_keypoint.html#mpii)
64+
- [sub-JHMDB](https://mmpose.readthedocs.io/en/latest/dataset_zoo/2d_body_keypoint.html#sub-jhmdb-dataset)
65+
- [Halpe](https://mmpose.readthedocs.io/en/latest/dataset_zoo/2d_wholebody_keypoint.html#halpe)
66+
- [PoseTrack18](https://mmpose.readthedocs.io/en/latest/dataset_zoo/2d_body_keypoint.html#posetrack18)
67+
- `Body8` denotes the addition of the [OCHuman](https://mmpose.readthedocs.io/en/latest/dataset_zoo/2d_body_keypoint.html#ochuman) dataset, in addition to the 7 datasets mentioned above, for evaluation.
68+
69+
| Config | Input Size | AP<sup><br>(COCO) | [email protected]<sup><br>(Body8) | AUC<sup><br>(Body8) | EPE<sup><br>(Body8) | Params(M) | FLOPS(G) | Download |
70+
| :--------------------------------------------: | :--------: | :---------------: | :---------------------: | :-----------------: | :-----------------: | :-------: | :------: | :-----------------------------------------------: |
71+
| [RTMPose-t\*](/configs/body_2d_keypoint/rtmpose/body8/rtmpose-t_8xb256-210e_body8-256x192.py) | 256x192 | 65.9 | 91.44 | 63.18 | 19.45 | 3.34 | 0.36 | [Model](https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-t_simcc-body7_pt-body7_420e-256x192-026a1439_20230504.pth) |
72+
| [RTMPose-s\*](/configs/body_2d_keypoint/rtmpose/body8/rtmpose-s_8xb256-210e_body8-256x192.py) | 256x192 | 69.7 | 92.45 | 65.15 | 17.85 | 5.47 | 0.68 | [Model](https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-s_simcc-body7_pt-body7_420e-256x192-acd4a1ef_20230504.pth) |
73+
| [RTMPose-m\*](/configs/body_2d_keypoint/rtmpose/body8/rtmpose-m_8xb256-210e_body8-256x192.py) | 256x192 | 74.9 | 94.25 | 68.59 | 15.12 | 13.59 | 1.93 | [Model](https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-m_simcc-body7_pt-body7_420e-256x192-e48f03d0_20230504.pth) |
74+
| [RTMPose-l\*](/configs/body_2d_keypoint/rtmpose/body8/rtmpose-l_8xb256-210e_body8-256x192.py) | 256x192 | 76.7 | 95.08 | 70.14 | 13.79 | 27.66 | 4.16 | [Model](https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-l_simcc-body7_pt-body7_420e-256x192-4dba18fc_20230504.pth) |
75+
| [RTMPose-m\*](/configs/body_2d_keypoint/rtmpose/body8/rtmpose-m_8xb256-210e_body8-384x288.py) | 384x288 | 76.6 | 94.64 | 70.38 | 13.98 | 13.72 | 4.33 | [Model](https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-m_simcc-body7_pt-body7_420e-384x288-65e718c4_20230504.pth) |
76+
| [RTMPose-l\*](/configs/body_2d_keypoint/rtmpose/body8/rtmpose-l_8xb256-210e_body8-384x288.py) | 384x288 | 78.3 | 95.36 | 71.58 | 13.08 | 27.79 | 9.35 | [Model](https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-l_simcc-body7_pt-body7_420e-384x288-3f5a1437_20230504.pth) |
Lines changed: 93 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,93 @@
1+
Collections:
2+
- Name: RTMPose
3+
Paper:
4+
Title: "RTMPose: Real-Time Multi-Person Pose Estimation based on MMPose"
5+
URL: https://arxiv.org/abs/2303.07399
6+
README: https://github.com/open-mmlab/mmpose/blob/main/projects/rtmpose/README.md
7+
Models:
8+
- Config: configs/body_2d_keypoint/rtmpose/body8/rtmpose-t_8xb256-210e_body8-256x192.py
9+
In Collection: RTMPose
10+
Metadata:
11+
Architecture: &id001
12+
- RTMPose
13+
Training Data: &id002
14+
- AI Challenger
15+
- COCO
16+
- CrowdPose
17+
- MPII
18+
- sub-JHMDB
19+
- Halpe
20+
- PoseTrack18
21+
Name: rtmpose-t_8xb256-210e_body8-256x192
22+
Results:
23+
- Dataset: Body8
24+
Metrics:
25+
AP: 0.659
26+
27+
Task: Body 2D Keypoint
28+
Weights: https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-t_simcc-body7_pt-body7_420e-256x192-026a1439_20230504.pth
29+
- Config: configs/body_2d_keypoint/rtmpose/body8/rtmpose-s_8xb256-210e_body8-256x192.py
30+
In Collection: RTMPose
31+
Metadata:
32+
Architecture: *id001
33+
Training Data: *id002
34+
Name: rtmpose-s_8xb256-210e_body8-256x192
35+
Results:
36+
- Dataset: Body8
37+
Metrics:
38+
AP: 0.697
39+
40+
Task: Body 2D Keypoint
41+
Weights: https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-s_simcc-body7_pt-body7_420e-256x192-acd4a1ef_20230504.pth
42+
- Config: configs/body_2d_keypoint/rtmpose/body8/rtmpose-m_8xb256-210e_body8-256x192.py
43+
In Collection: RTMPose
44+
Metadata:
45+
Architecture: *id001
46+
Training Data: *id002
47+
Name: rtmpose-m_8xb256-210e_body8-256x192
48+
Results:
49+
- Dataset: Body8
50+
Metrics:
51+
AP: 0.749
52+
53+
Task: Body 2D Keypoint
54+
Weights: https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-m_simcc-body7_pt-body7_420e-256x192-e48f03d0_20230504.pth
55+
- Config: configs/body_2d_keypoint/rtmpose/body8/rtmpose-l_8xb256-210e_body8-256x192.py
56+
In Collection: RTMPose
57+
Metadata:
58+
Architecture: *id001
59+
Training Data: *id002
60+
Name: rtmpose-l_8xb256-210e_body8-256x192
61+
Results:
62+
- Dataset: Body8
63+
Metrics:
64+
AP: 0.767
65+
66+
Task: Body 2D Keypoint
67+
Weights: https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-l_simcc-body7_pt-body7_420e-256x192-4dba18fc_20230504.pth
68+
- Config: configs/body_2d_keypoint/rtmpose/body8/rtmpose-m_8xb256-210e_body8-384x288.py
69+
In Collection: RTMPose
70+
Metadata:
71+
Architecture: *id001
72+
Training Data: *id002
73+
Name: rtmpose-m_8xb256-210e_body8-384x288
74+
Results:
75+
- Dataset: Body8
76+
Metrics:
77+
AP: 0.766
78+
79+
Task: Body 2D Keypoint
80+
Weights: https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-m_simcc-body7_pt-body7_420e-384x288-65e718c4_20230504.pth
81+
- Config: configs/body_2d_keypoint/rtmpose/body8/rtmpose-l_8xb256-210e_body8-384x288.py
82+
In Collection: RTMPose
83+
Metadata:
84+
Architecture: *id001
85+
Training Data: *id002
86+
Name: rtmpose-l_8xb256-210e_body8-384x288
87+
Results:
88+
- Dataset: Body8
89+
Metrics:
90+
AP: 0.783
91+
92+
Task: Body 2D Keypoint
93+
Weights: https://download.openmmlab.com/mmpose/v1/projects/rtmposev1/rtmpose-l_simcc-body7_pt-body7_420e-384x288-3f5a1437_20230504.pth

configs/face_2d_keypoint/rtmpose/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -29,4 +29,4 @@ Results on LaPa dataset
2929

3030
| Model | Input Size | NME | Details and Download |
3131
| :-------: | :--------: | :--: | :---------------------------------------: |
32-
| RTMPose-m | 256x256 | 1.29 | [rtmpose_lapa.md](./wflw/rtmpose_lapa.md) |
32+
| RTMPose-m | 256x256 | 1.29 | [rtmpose_lapa.md](./lapa/rtmpose_lapa.md) |

configs/face_2d_keypoint/rtmpose/lapa/rtmpose-m_8xb64-120e_lapa-256x256.py

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -187,7 +187,7 @@
187187
data_root=data_root,
188188
data_mode=data_mode,
189189
ann_file='annotations/lapa_train.json',
190-
data_prefix=dict(img='train/images/'),
190+
data_prefix=dict(img=''),
191191
pipeline=train_pipeline,
192192
))
193193
val_dataloader = dict(
@@ -201,7 +201,7 @@
201201
data_root=data_root,
202202
data_mode=data_mode,
203203
ann_file='annotations/lapa_val.json',
204-
data_prefix=dict(img='val/images/'),
204+
data_prefix=dict(img=''),
205205
test_mode=True,
206206
pipeline=val_pipeline,
207207
))
@@ -216,7 +216,7 @@
216216
data_root=data_root,
217217
data_mode=data_mode,
218218
ann_file='annotations/lapa_test.json',
219-
data_prefix=dict(img='test/images/'),
219+
data_prefix=dict(img=''),
220220
test_mode=True,
221221
pipeline=val_pipeline,
222222
))

configs/face_2d_keypoint/rtmpose/lapa/rtmpose_lapa.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -33,7 +33,7 @@
3333

3434
</details>
3535

36-
Results on COCO-WholeBody-Face val set
36+
Results on LaPa val set
3737

3838
| Arch | Input Size | NME | ckpt | log |
3939
| :------------------------------------------------------------- | :--------: | :--: | :------------------------------------------------------------: | :------------------------------------------------------------: |

configs/face_2d_keypoint/topdown_regression/README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,4 +15,5 @@ Result on WFLW test set
1515
| Model | Input Size | NME | ckpt | log |
1616
| :-------------------------------------------------------------- | :--------: | :--: | :------------------------------------------------------------: | :-----------------------------------------------------------: |
1717
| [ResNet-50](/configs/face_2d_keypoint/topdown_regression/wflw/td-reg_res50_8xb64-210e_wflw-256x256.py) | 256x256 | 4.88 | [ckpt](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256-92d0ba7f_20210303.pth) | [log](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_20210303.log.json) |
18-
| [ResNet-50+SoftWingLoss](/configs/face_2d_keypoint/topdown_regression/wflw/td-reg_res50_softwingloss_8xb64-210e_wflw-256x256.py) | 256x256 | 4.67 | [ckpt](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_softwingloss-4d34f22a_20211212.pth) | [log](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_softwingloss_20211212.log.json) |
18+
| [ResNet-50+WingLoss](/configs/face_2d_keypoint/topdown_regression/wflw/td-reg_res50_wingloss_8xb64-210e_wflw-256x256.py) | 256x256 | 4.67 | [ckpt](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_wingloss-f82a5e53_20210303.pth) | [log](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_wingloss_20210303.log.json) |
19+
| [ResNet-50+SoftWingLoss](/configs/face_2d_keypoint/topdown_regression/wflw/td-reg_res50_softwingloss_8xb64-210e_wflw-256x256.py) | 256x256 | 4.44 | [ckpt](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_softwingloss-4d34f22a_20211212.pth) | [log](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_softwingloss_20211212.log.json) |
Lines changed: 76 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,76 @@
1+
<!-- [ALGORITHM] -->
2+
3+
<details>
4+
<summary align="right"><a href="http://openaccess.thecvf.com/content_cvpr_2014/html/Toshev_DeepPose_Human_Pose_2014_CVPR_paper.html">DeepPose (CVPR'2014)</a></summary>
5+
6+
```bibtex
7+
@inproceedings{toshev2014deeppose,
8+
title={Deeppose: Human pose estimation via deep neural networks},
9+
author={Toshev, Alexander and Szegedy, Christian},
10+
booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
11+
pages={1653--1660},
12+
year={2014}
13+
}
14+
```
15+
16+
</details>
17+
18+
<!-- [BACKBONE] -->
19+
20+
<details>
21+
<summary align="right"><a href="http://openaccess.thecvf.com/content_cvpr_2016/html/He_Deep_Residual_Learning_CVPR_2016_paper.html">ResNet (CVPR'2016)</a></summary>
22+
23+
```bibtex
24+
@inproceedings{he2016deep,
25+
title={Deep residual learning for image recognition},
26+
author={He, Kaiming and Zhang, Xiangyu and Ren, Shaoqing and Sun, Jian},
27+
booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
28+
pages={770--778},
29+
year={2016}
30+
}
31+
```
32+
33+
</details>
34+
35+
<!-- [ALGORITHM] -->
36+
37+
<details>
38+
<summary align="right"><a href="http://openaccess.thecvf.com/content_cvpr_2018/html/Feng_Wing_Loss_for_CVPR_2018_paper.html">Wingloss (CVPR'2018)</a></summary>
39+
40+
```bibtex
41+
@inproceedings{feng2018wing,
42+
title={Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks},
43+
author={Feng, Zhen-Hua and Kittler, Josef and Awais, Muhammad and Huber, Patrik and Wu, Xiao-Jun},
44+
booktitle={Computer Vision and Pattern Recognition (CVPR), 2018 IEEE Conference on},
45+
year={2018},
46+
pages ={2235-2245},
47+
organization={IEEE}
48+
}
49+
```
50+
51+
</details>
52+
53+
<!-- [DATASET] -->
54+
55+
<details>
56+
<summary align="right"><a href="http://openaccess.thecvf.com/content_cvpr_2018/html/Wu_Look_at_Boundary_CVPR_2018_paper.html">WFLW (CVPR'2018)</a></summary>
57+
58+
```bibtex
59+
@inproceedings{wu2018look,
60+
title={Look at boundary: A boundary-aware face alignment algorithm},
61+
author={Wu, Wayne and Qian, Chen and Yang, Shuo and Wang, Quan and Cai, Yici and Zhou, Qiang},
62+
booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
63+
pages={2129--2138},
64+
year={2018}
65+
}
66+
```
67+
68+
</details>
69+
70+
Results on WFLW dataset
71+
72+
The model is trained on WFLW train set.
73+
74+
| Model | Input Size | NME | ckpt | log |
75+
| :-------------------------------------------------------------- | :--------: | :--: | :------------------------------------------------------------: | :-----------------------------------------------------------: |
76+
| [ResNet-50+WingLoss](/configs/face_2d_keypoint/topdown_regression/wflw/td-reg_res50_wingloss_8xb64-210e_wflw-256x256.py) | 256x256 | 4.67 | [ckpt](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_wingloss-f82a5e53_20210303.pth) | [log](https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_wingloss_20210303.log.json) |
Lines changed: 16 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,16 @@
1+
Models:
2+
- Config: configs/face_2d_keypoint/topdown_regression/wflw/td-reg_res50_wingloss_8xb64-210e_wflw-256x256.py
3+
In Collection: ResNet
4+
Metadata:
5+
Architecture:
6+
- DeepPose
7+
- ResNet
8+
- WingLoss
9+
Training Data: WFLW
10+
Name: td-reg_res50_wingloss_8xb64-210e_wflw-256x256
11+
Results:
12+
- Dataset: WFLW
13+
Metrics:
14+
NME: 4.67
15+
Task: Face 2D Keypoint
16+
Weights: https://download.openmmlab.com/mmpose/face/deeppose/deeppose_res50_wflw_256x256_wingloss-f82a5e53_20210303.pth

0 commit comments

Comments
 (0)