|
|
|
Detectron2 model zoo's experimental settings and a few implementation details are different from Detectron. |
|
|
|
The differences in implementation details are shared in |
|
[Compatibility with Other Libraries](../../docs/notes/compatibility.md). |
|
|
|
The differences in model zoo's experimental settings include: |
|
* Use scale augmentation during training. This improves AP with lower training cost. |
|
* Use L1 loss instead of smooth L1 loss for simplicity. This sometimes improves box AP but may |
|
affect other AP. |
|
* Use `POOLER_SAMPLING_RATIO=0` instead of 2. This does not significantly affect AP. |
|
* Use `ROIAlignV2`. This does not significantly affect AP. |
|
|
|
In this directory, we provide a few configs that __do not__ have the above changes. |
|
They mimic Detectron's behavior as close as possible, |
|
and provide a fair comparison of accuracy and speed against Detectron. |
|
|
|
<!-- |
|
./gen_html_table.py --config 'Detectron1-Comparisons/*.yaml' --name "Faster R-CNN" "Keypoint R-CNN" "Mask R-CNN" --fields lr_sched train_speed inference_speed mem box_AP mask_AP keypoint_AP --base-dir ../../../configs/Detectron1-Comparisons |
|
--> |
|
|
|
|
|
<table><tbody> |
|
<!-- START TABLE --> |
|
<!-- TABLE HEADER --> |
|
<th valign="bottom">Name</th> |
|
<th valign="bottom">lr<br/>sched</th> |
|
<th valign="bottom">train<br/>time<br/>(s/iter)</th> |
|
<th valign="bottom">inference<br/>time<br/>(s/im)</th> |
|
<th valign="bottom">train<br/>mem<br/>(GB)</th> |
|
<th valign="bottom">box<br/>AP</th> |
|
<th valign="bottom">mask<br/>AP</th> |
|
<th valign="bottom">kp.<br/>AP</th> |
|
<th valign="bottom">model id</th> |
|
<th valign="bottom">download</th> |
|
<!-- TABLE BODY --> |
|
<!-- ROW: faster_rcnn_R_50_FPN_noaug_1x --> |
|
<tr><td align="left"><a href="faster_rcnn_R_50_FPN_noaug_1x.yaml">Faster R-CNN</a></td> |
|
<td align="center">1x</td> |
|
<td align="center">0.219</td> |
|
<td align="center">0.038</td> |
|
<td align="center">3.1</td> |
|
<td align="center">36.9</td> |
|
<td align="center"></td> |
|
<td align="center"></td> |
|
<td align="center">137781054</td> |
|
<td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Detectron1-Comparisons/faster_rcnn_R_50_FPN_noaug_1x/137781054/model_final_7ab50c.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Detectron1-Comparisons/faster_rcnn_R_50_FPN_noaug_1x/137781054/metrics.json">metrics</a></td> |
|
</tr> |
|
<!-- ROW: keypoint_rcnn_R_50_FPN_1x --> |
|
<tr><td align="left"><a href="keypoint_rcnn_R_50_FPN_1x.yaml">Keypoint R-CNN</a></td> |
|
<td align="center">1x</td> |
|
<td align="center">0.313</td> |
|
<td align="center">0.071</td> |
|
<td align="center">5.0</td> |
|
<td align="center">53.1</td> |
|
<td align="center"></td> |
|
<td align="center">64.2</td> |
|
<td align="center">137781195</td> |
|
<td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Detectron1-Comparisons/keypoint_rcnn_R_50_FPN_1x/137781195/model_final_cce136.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Detectron1-Comparisons/keypoint_rcnn_R_50_FPN_1x/137781195/metrics.json">metrics</a></td> |
|
</tr> |
|
<!-- ROW: mask_rcnn_R_50_FPN_noaug_1x --> |
|
<tr><td align="left"><a href="mask_rcnn_R_50_FPN_noaug_1x.yaml">Mask R-CNN</a></td> |
|
<td align="center">1x</td> |
|
<td align="center">0.273</td> |
|
<td align="center">0.043</td> |
|
<td align="center">3.4</td> |
|
<td align="center">37.8</td> |
|
<td align="center">34.9</td> |
|
<td align="center"></td> |
|
<td align="center">137781281</td> |
|
<td align="center"><a href="https://dl.fbaipublicfiles.com/detectron2/Detectron1-Comparisons/mask_rcnn_R_50_FPN_noaug_1x/137781281/model_final_62ca52.pkl">model</a> | <a href="https://dl.fbaipublicfiles.com/detectron2/Detectron1-Comparisons/mask_rcnn_R_50_FPN_noaug_1x/137781281/metrics.json">metrics</a></td> |
|
</tr> |
|
</tbody></table> |
|
|
|
## Comparisons: |
|
|
|
* Faster R-CNN: Detectron's AP is 36.7, similar to ours. |
|
* Keypoint R-CNN: Detectron's AP is box 53.6, keypoint 64.2. Fixing a Detectron's |
|
[bug](https://github.com/facebookresearch/Detectron/issues/459) lead to a drop in box AP, and can be |
|
compensated back by some parameter tuning. |
|
* Mask R-CNN: Detectron's AP is box 37.7, mask 33.9. We're 1 AP better in mask AP, due to more correct implementation. |
|
See [this article](https://ppwwyyxx.com/blog/2021/Where-are-Pixels/) for details. |
|
|
|
For speed comparison, see [benchmarks](https://detectron2.readthedocs.io/notes/benchmarks.html). |
|
|