HolyFox's picture

HolyFox

Holy-fox

·

AI & ML interests

LLMが好きな高校生。得意分野は合成データ

Recent Activity

liked a model 9 days ago

nvidia/Cosmos3-Nano

published a dataset 11 days ago

TeamDelta/Nemotron-Cascade-2-RL-reproduction

updated a dataset 11 days ago

TeamDelta/Nemotron-Cascade-2-RL-reproduction

View all activity

Organizations

upvoted a collection 23 days ago

Nemotron-Post-Training-v3

Collection of datasets used in the post-training phase of Nemotron Nano, Super, and Ultra v3. • 50 items • Updated about 5 hours ago • 155

upvoted a collection about 2 months ago

DeepSeek-V4

4 items • Updated Apr 24 • 675

upvoted a collection 2 months ago

LLM-jp-4 Models

5 items • Updated Mar 31 • 16

upvoted a collection 3 months ago

Qwen3.5

21 items • Updated Mar 9 • 1.67k

upvoted a collection 6 months ago

Nemotron-Cascade

Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 14 items • Updated 3 days ago • 55

upvoted a paper 8 months ago

DeepMMSearch-R1: Empowering Multimodal LLMs in Multimodal Web Search

Paper • 2510.12801 • Published Oct 14, 2025 • 14

upvoted 2 papers 10 months ago

SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities

Paper • 2502.12025 • Published Feb 17, 2025 • 3

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 190

upvoted a collection 10 months ago

gpt-oss

Open-weight models designed for powerful reasoning, agentic tasks, and versatile developer use cases. • 2 items • Updated Aug 7, 2025 • 442