NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval Paper • 2603.12824 • Published 5 days ago • 5
Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction Paper • 2603.09930 • Published 7 days ago
Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction Paper • 2603.09930 • Published 7 days ago
NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval Paper • 2603.12824 • Published 5 days ago • 5
Look in the Middle: Structural Anchor Pruning for Scalable Visual RAG Indexing Paper • 2601.20107 • Published Jan 27