davidhajdu
/

dab-detr-resnet-50

Object Detection

Inference Endpoints

Model card Files Files and versions Community

davidhajdu commited on about 24 hours ago

Commit

4cd20e9

·

verified ·

1 Parent(s): 47e4be4

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -146,12 +146,12 @@ Images are resized/rescaled such that the shortest side is at least 480 and at m
 ## Evaluation
-![image/png](https://github.com/conditionedstimulus/hf_media/blob/main/https://github.com/conditionedstimulus/hf_media/blob/main/results_dab_detr.png)
 ### Model Architecture and Objective
-![image/png](https://github.com/conditionedstimulus/hf_media/blob/main/model_arch_dab_detr.png)
 Overview of DAB-DETR. We extract image spatial features using a CNN backbone followed with Transformer encoders to refine the CNN features.
 Then dual queries, including positional queries (anchor boxes) and content queries (decoder embeddings), are fed into the decoder to probe the objects which correspond to the anchors and have similar patterns with the content queries. The dual queries are updated layer-by-layer to get close to the target ground-truth objects gradually.

 ## Evaluation
+![image/png](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/dab_detr_results.png)
 ### Model Architecture and Objective
+![image/png](https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/transformers/model_doc/dab_detr_model_arch.png)
 Overview of DAB-DETR. We extract image spatial features using a CNN backbone followed with Transformer encoders to refine the CNN features.
 Then dual queries, including positional queries (anchor boxes) and content queries (decoder embeddings), are fed into the decoder to probe the objects which correspond to the anchors and have similar patterns with the content queries. The dual queries are updated layer-by-layer to get close to the target ground-truth objects gradually.