Reasoning models trained on synthetic data using reinforcement learning.
Yichao 'Peak' Ji
peakji
AI & ML interests
Agents, Small Language Models, Retrieval-Augmented Generation, Information Extraction
Recent Activity
liked
a model
2 days ago
ibm-granite/granite-4.0-h-small
liked
a model
7 days ago
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-BF16
upvoted
an
article
14 days ago
Transformers v5: Simple model definitions powering the AI ecosystem