Datasets Models Results
Models Generic SigLIP2 SO400M
Google

SigLIP2 SO400M

Vision encoder with Shape-Optimized 400M architecture and sigmoid contrastive loss.

Google Open Source Vision Only
Generic
Model Type
#3
Overall Rank
40.54%
Avg. R@1
32.98%
Avg.
mAP@20
1152
Embedding
Size
384
Input
Size
5
Datasets

Performance Across Datasets

Dataset Category R@1 R@5 mAP
Stanford Online Products E-commerce 80.28% 90.01% 60.79%
Products-10K E-commerce 65.98% 84.25% 44.15%
DIY v1 Hardware/DIY 23.86% 46.61% 32.46%
Automotive v1 Automotive 22.02% 40.06% 25.32%
Clips-and-Connectors v1 Industrial 10.57% 23.12% 2.16%
Average 40.54% 56.81% 32.98%