A Critical Look at Targeted Instruction Selection Datasets used in the paper "A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn’t)" Harvard-DCML/tis-random-unbalanced Viewer • Updated about 2 hours ago • 30k Harvard-DCML/tis-subset-datasets-Olmo-3-1025-7B Viewer • Updated about 2 hours ago • 300k Harvard-DCML/tis-subset-datasets-SmolLM3-3B-Base Viewer • Updated about 2 hours ago • 300k Harvard-DCML/tis-subset-datasets-Qwen3-4B-Base Viewer • Updated about 2 hours ago • 300k
Boomerang Distillation Distilled models from the boomerang distillation paper (https://arxiv.org/abs/2510.05064). Harvard-DCML/boomerang-llama-3.2-1.9B 2B • Updated Oct 14, 2025 • 1 Harvard-DCML/boomerang-pythia-1.6B 2B • Updated Oct 14, 2025 Harvard-DCML/boomerang-qwen3-4.9B 5B • Updated Oct 14, 2025 • 1 Harvard-DCML/boomerang-qwen3-2.3B Text Generation • 3B • Updated Nov 18, 2025 • 354 • 1
A Critical Look at Targeted Instruction Selection Datasets used in the paper "A Critical Look at Targeted Instruction Selection: Disentangling What Matters (and What Doesn’t)" Harvard-DCML/tis-random-unbalanced Viewer • Updated about 2 hours ago • 30k Harvard-DCML/tis-subset-datasets-Olmo-3-1025-7B Viewer • Updated about 2 hours ago • 300k Harvard-DCML/tis-subset-datasets-SmolLM3-3B-Base Viewer • Updated about 2 hours ago • 300k Harvard-DCML/tis-subset-datasets-Qwen3-4B-Base Viewer • Updated about 2 hours ago • 300k
Boomerang Distillation Distilled models from the boomerang distillation paper (https://arxiv.org/abs/2510.05064). Harvard-DCML/boomerang-llama-3.2-1.9B 2B • Updated Oct 14, 2025 • 1 Harvard-DCML/boomerang-pythia-1.6B 2B • Updated Oct 14, 2025 Harvard-DCML/boomerang-qwen3-4.9B 5B • Updated Oct 14, 2025 • 1 Harvard-DCML/boomerang-qwen3-2.3B Text Generation • 3B • Updated Nov 18, 2025 • 354 • 1