davidhajdu
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -146,12 +146,12 @@ Images are resized/rescaled such that the shortest side is at least 480 and at m
|
|
146 |
|
147 |
## Evaluation
|
148 |
|
149 |
-
![image/png](https://
|
150 |
|
151 |
|
152 |
### Model Architecture and Objective
|
153 |
|
154 |
-
![image/png](https://
|
155 |
|
156 |
Overview of DAB-DETR. We extract image spatial features using a CNN backbone followed with Transformer encoders to refine the CNN features.
|
157 |
Then dual queries, including positional queries (anchor boxes) and content queries (decoder embeddings), are fed into the decoder to probe the objects which correspond to the anchors and have similar patterns with the content queries. The dual queries are updated layer-by-layer to get close to the target ground-truth objects gradually.
|
|
|
146 |
|
147 |
## Evaluation
|
148 |
|
149 |
+
![image/png](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/dab_detr_results.png)
|
150 |
|
151 |
|
152 |
### Model Architecture and Objective
|
153 |
|
154 |
+
![image/png](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/dab_detr_model_arch.png)
|
155 |
|
156 |
Overview of DAB-DETR. We extract image spatial features using a CNN backbone followed with Transformer encoders to refine the CNN features.
|
157 |
Then dual queries, including positional queries (anchor boxes) and content queries (decoder embeddings), are fed into the decoder to probe the objects which correspond to the anchors and have similar patterns with the content queries. The dual queries are updated layer-by-layer to get close to the target ground-truth objects gradually.
|