In a Training Loop 🔄

Urro

urroxyz

https://urro.xyz/

urroxyz

AI & ML interests

i like research on empowering small LMs to do better 😮 i DISLIKE video & image generation (esp. ai "art") 🤢

Recent Activity

updated a collection 1 day ago

WTF GENIUS PAPERS

upvoted a paper 1 day ago

H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs

upvoted a paper 3 days ago

TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

View all activity

Organizations

updated a collection 1 day ago

WTF GENIUS PAPERS

Collection

Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. • 68 items • Updated 1 day ago • 8

upvoted a paper 1 day ago

H-Neurons: On the Existence, Impact, and Origin of Hallucination-Associated Neurons in LLMs

Paper • 2512.01797 • Published Dec 1, 2025 • 6

upvoted 8 papers 3 days ago

TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents

Paper • 2602.19633 • Published 10 days ago • 7

PETS: A Principled Framework Towards Optimal Trajectory Allocation for Efficient Test-Time Self-Consistency

Paper • 2602.16745 • Published 16 days ago • 8

Benchmark Test-Time Scaling of General LLM Agents

Paper • 2602.18998 • Published 12 days ago • 8

QuantVLA: Scale-Calibrated Post-Training Quantization for Vision-Language-Action Models

Paper • 2602.20309 • Published 10 days ago • 16

upvoted a collection 3 days ago

Qwen3.5

Collection

21 items • Updated 2 days ago • 952

liked a model 3 days ago

Qwen/Qwen3.5-9B

Image-Text-to-Text • 10B • Updated 4 days ago • 341k • 449

updated a collection 3 days ago

TINY MODELS WITH BIG INTELLIGENCE

Collection

Tiny (<30B) models that tend to outperform their same-parameter counterparts. • 15 items • Updated 3 days ago • 3

New activity in Qwen/Qwen3.5-27B 3 days ago

Qwen are is the 8b coming out?

#14 opened 8 days ago by

crownelius

commented a paper 4 days ago

Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language

Paper • 2602.18964 • Published 12 days ago • 1 •

upvoted 4 papers 4 days ago

Yor-Sarc: A gold-standard dataset for sarcasm detection in a low-resource African language

Paper • 2602.18964 • Published 12 days ago • 1

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Paper • 2602.22190 • Published 8 days ago • 15

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

Paper • 2602.21534 • Published 9 days ago • 23

DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference

Paper • 2602.21548 • Published 9 days ago • 37

Urro

AI & ML interests

Recent Activity

Organizations

urroxyz's activity

Qwen are is the 8b coming out?