Aayush

Aayushfaced

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

openai-community/gpt2

upvoted an article about 1 month ago

We Got Claude to Fine-Tune an Open Source LLM

upvoted an article about 1 month ago

New in llama.cpp: Model Management

View all activity

Organizations

None yet

liked a model 7 days ago

openai-community/gpt2

Text Generation • 0.1B • Updated Feb 19, 2024 • 6.94M • 3.09k

upvoted 2 articles about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

579

Article

New in llama.cpp: Model Management

Dec 11, 2025

•

112

upvoted 2 collections about 2 months ago

NVIDIA Nemotron V2

Collection

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated about 9 hours ago • 101

Inference Optimized Checkpoints (with Model Optimizer)

Collection

A collection of generative models quantized and optimized for inference with Model Optimizer. • 47 items • Updated about 9 hours ago • 73

upvoted an article about 2 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

Jun 21, 2025

•

liked 3 Spaces 3 months ago

The Smol Training Playbook

📚

2.9k

The secrets to building world-class LLMs

FineWeb: decanting the web for the finest text data at scale

🍷

1.27k

Generate high-quality text data for LLMs using FineWeb

Robot Learning: A Tutorial

📝

312

Learn about modern robot learning techniques and applications

liked a Space 4 months ago

The Ultra-Scale Playbook

🌌

3.66k

The ultimate guide to training LLM on large GPU Clusters

upvoted 2 papers 4 months ago

UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning

Paper • 2509.11543 • Published Sep 15, 2025 • 48

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16, 2025 • 117

liked a dataset 4 months ago

InternRobotics/OmniWorld

Viewer • Updated 13 days ago • 6.35B • 20.8k • 79

upvoted 7 papers 4 months ago

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10, 2025 • 56

A Survey of Reinforcement Learning for Large Reasoning Models

Paper • 2509.08827 • Published Sep 10, 2025 • 190

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20, 2024 • 176

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10, 2025 • 98

Aayush

AI & ML interests

Recent Activity

Organizations

Aayushfaced's activity

We Got Claude to Fine-Tune an Open Source LLM

New in llama.cpp: Model Management

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

The Smol Training Playbook

FineWeb: decanting the web for the finest text data at scale

Robot Learning: A Tutorial

The Ultra-Scale Playbook