oneformer_ade20k_swin_tinyModel45/100 via “lightweight-swin-tiny-backbone-inference”
image-segmentation model by undefined. 2,48,429 downloads.
Unique: Swin Tiny backbone uses hierarchical window-based self-attention (shifted windows across 4 stages) to achieve O(n log n) complexity instead of O(n²), reducing FLOPs by 60% vs ViT-Base while maintaining competitive accuracy. Parameter count of 28M is 3× smaller than Swin Base (87M), enabling deployment to edge devices.
vs others: Faster inference than ResNet-based backbones (e.g., ResNet50) on modern hardware due to better GPU utilization of attention operations; smaller than Swin Base/Large while maintaining hierarchical feature extraction that CNNs lack, making it ideal for edge deployment.