WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 68 items • Updated 1 day ago • 8
H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs Paper • 2512.01797 • Published Dec 1, 2025 • 6
TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents Paper • 2602.19633 • Published 10 days ago • 7
PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency Paper • 2602.16745 • Published 16 days ago • 8
Benchmark Test-Time Scaling of General LLM Agents Paper • 2602.18998 • Published 12 days ago • 8
QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models Paper • 2602.20309 • Published 10 days ago • 16
Test-Time Training with KV Binding Is Secretly Linear Attention Paper • 2602.21204 • Published 9 days ago • 30
Query-focused and Memory-aware Reranker for Long Context Processing Paper • 2602.12192 • Published 21 days ago • 55
TINY MODELS WITH BIG INTELLIGENCE Collection Tiny (<30B) models that tend to outperform their same-parameter counterparts. • 15 items • Updated 3 days ago • 3
Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language Paper • 2602.18964 • Published 12 days ago • 1 • 4
Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language Paper • 2602.18964 • Published 12 days ago • 1
GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL Paper • 2602.22190 • Published 8 days ago • 15
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning Paper • 2602.21534 • Published 9 days ago • 23
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published 9 days ago • 37