Deformable Object Detection

1

rtdetr_v2_r18vdModel38/100

via “real-time object detection with deformable transformer attention”

object-detection model by undefined. 1,06,918 downloads.

Unique: Uses deformable transformer attention (sampling only task-relevant spatial regions) combined with ResNet-18 backbone for real-time inference, whereas standard DETR processes full feature maps with quadratic attention complexity. This architectural choice reduces FLOPs by ~40% compared to vanilla transformer detectors while maintaining anchor-free detection paradigm.

vs others: Faster than YOLOv8 on edge devices due to deformable attention efficiency, and more accurate than lightweight anchor-based detectors (MobileNet-SSD) because transformer attention captures long-range spatial relationships without hand-crafted anchor priors.

2

rtdetr_r50vdModel36/100

via “real-time object detection with deformable transformer architecture”

object-detection model by undefined. 32,868 downloads.

Unique: Uses deformable cross-attention instead of standard multi-head attention, allowing the model to dynamically sample only task-relevant spatial regions; combined with ResNet-50-VD backbone (a more efficient variant than standard ResNet-50), this achieves <100ms inference while maintaining COCO AP of 53.0+ without NMS post-processing

vs others: Faster inference than YOLOv8 on equivalent hardware (deformable attention vs dense convolution) and more accurate than EfficientDet-D0 on COCO while using fewer parameters than Faster R-CNN variants

3

deformable-detrModel33/100

object-detection model by undefined. 27,497 downloads.

Unique: Incorporates deformable attention that adjusts to the spatial distribution of objects, enhancing detection in diverse scenarios compared to static attention mechanisms.

vs others: More adaptable to varying object shapes and sizes than traditional object detection models like Faster R-CNN due to its deformable attention mechanism.

4

mmdetBenchmark30/100

via “transformer-based detector implementation (detr, deformable detr, dino variants)”

OpenMMLab Detection Toolbox and Benchmark

Unique: Implements transformer-based detection as a set prediction problem with learnable query embeddings refined through multi-layer transformer decoders, and supports deformable attention that learns spatial offsets to focus on relevant regions, enabling efficient processing of multi-scale features without hand-crafted anchors

vs others: More efficient than vanilla DETR because deformable attention reduces computational complexity from O(n²) to O(n) by attending only to relevant spatial regions; more integrated than standalone DETR implementations because it shares backbone/neck infrastructure with CNN-based detectors, enabling easy comparison

Top Matches

Also Known As

Company