1 8 65

Vadim Smolyakov

vsmolyakov

https://vsmolyakov.github.io/

AI & ML interests

Machine Learning Engineer @ Microsoft

Recent Activity

upvoted a paper 4 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

liked a model 2 months ago

moonshotai/Kimi-K2-Thinking

liked a model 3 months ago

MiniMaxAI/MiniMax-M2

View all activity

Organizations

None yet

liked a model 2 months ago

moonshotai/Kimi-K2-Thinking

Text Generation • Updated Nov 8, 2025 • 224k • • 1.63k

liked 2 models 3 months ago

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated 28 days ago • 123k • • 1.45k

zeroentropy/zerank-1

Text Ranking • 4B • Updated Nov 19, 2025 • 2.53k • 73

liked a model 4 months ago

zai-org/GLM-4.5

Text Generation • 358B • Updated Aug 11, 2025 • 20.1k • • 1.39k

liked a dataset 4 months ago

openai/gdpval

Viewer • Updated Sep 25, 2025 • 220 • 22.9k • 448

liked 2 models 5 months ago

openai/gpt-oss-120b

Text Generation • 120B • Updated Aug 26, 2025 • 3.08M • • 4.36k

gghfez/Mistral-Small-3.2-24B-Instruct-hf-AWQ

Text Generation • 24B • Updated Jun 25, 2025 • 252 • 4

liked 2 datasets 6 months ago

Salesforce/CRMArenaPro

Viewer • Updated Jul 9, 2025 • 8.61k • 397 • 15

Salesforce/CRMArena

Viewer • Updated Jun 18, 2025 • 1.19k • 227 • 8

liked 3 models 9 months ago

liked a dataset 9 months ago

allenai/reward-bench

Viewer • Updated Sep 9, 2024 • 8.11k • 5.26k • 104

liked 4 models 10 months ago

weqweasdas/RM-Mistral-7B

Text Classification • 7B • Updated Mar 31, 2024 • 2.49k • 24

RLHFlow/ArmoRM-Llama3-8B-v0.1

Text Classification • 8B • Updated Sep 23, 2024 • 9.69k • 184

mistralai/Mistral-Small-3.1-24B-Instruct-2503

24B • Updated 29 days ago • 74k • 1.34k

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 86.3k • • 2.88k

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.65k

The ultimate guide to training LLM on large GPU Clusters

liked 2 models 12 months ago

mistralai/Mistral-Small-24B-Instruct-2501

24B • Updated Jul 28, 2025 • 760k • 949

deepseek-ai/DeepSeek-R1-Distill-Llama-70B

Text Generation • 71B • Updated Feb 24, 2025 • 226k • • 738

Vadim Smolyakov

AI & ML interests

Recent Activity

Organizations

vsmolyakov's activity

The Ultra-Scale Playbook