TimeViper Collection a hybrid vision-language model for long video understanding • 2 items • Updated Nov 23 • 1
Time-R1 Collection Time-R1: Post-Training Large Vision-Language Model for Temporal Video Grounding • 4 items • Updated Nov 23 • 3
TiME Collection The TiME collection gathers monolingual BERT-style encoders for 16 languages (xs, s, m). Each model outputs embeddings distilled from XLM-R large. • 49 items • Updated Aug 26 • 1
Teacher Logits Collection Logits captured from large models to act as the teacher for distillation • 3 items • Updated 9 days ago • 7
InternVideo-Next Collection InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision • 5 items • Updated about 15 hours ago • 4
Hummingbird Collection Hummingbird is a series of video generation models built on AMD Instinct™ GPUs, including text-to-video, image-to-videos models. • 2 items • Updated 20 days ago • 1
Instella ✨ Collection Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. • 13 items • Updated 20 days ago • 10
Holo2 Collection Holo2 - Cost-Efficient Models for Cross-Platform Computer-Use Agents • 3 items • Updated Nov 13 • 21
Cambrian-S-Data Collection Data used during Cambrian-S's 4-stage training • 4 items • Updated Nov 10 • 3
MDGA Collection Make Diffusion Great Again. The resource list for Super Data Learners, Quokka, and OpenMoE 2. • 16 items • Updated Nov 4 • 8