In a Training Loop 🔄

1 4 39

Michał Wiliński

MWilinski

https://michal-wilinski.com

AI & ML interests

Machine Learning, Reinforcement Learning

Recent Activity

updated a model 10 days ago

MWilinski/dro-v-qwen3-1.7b-paperlike

published a model 10 days ago

MWilinski/dro-v-qwen3-1.7b-paperlike

updated a dataset 11 days ago

MWilinski/hh-rlhf-irl

View all activity

Organizations

Collections 2

Papers 3

arxiv:2505.13291

arxiv:2502.06037

arxiv:2409.13530

spaces 3

models 3

MWilinski/dro-v-qwen3-1.7b-paperlike

Updated 9 days ago

MWilinski/dro-qwen3-1.7b-full-fixed-tau

Updated 24 days ago

MWilinski/dro-qwen3-1.7b-full

Updated 24 days ago

datasets 17

MWilinski/hh-rlhf-irl

Viewer • Updated 11 days ago • 10k • 89

MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-policy

Viewer • Updated 12 days ago • 2k • 29

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-policy

Viewer • Updated 12 days ago • 2k • 27

MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child

Viewer • Updated 12 days ago • 1.5k • 28

MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult

Viewer • Updated 12 days ago • 1.5k • 30

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult

Viewer • Updated 12 days ago • 1.5k • 26

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child

Viewer • Updated 12 days ago • 1.5k • 31

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b

Viewer • Updated 12 days ago • 1.2k • 46

MWilinski/hh-rlhf-helpful-base-rollouts-gpt-oss-20b

Viewer • Updated Feb 12 • 1.2k • 23

MWilinski/hh-rlhf-harmless-base

Viewer • Updated Nov 5, 2025 • 44.8k • 18

View 17 datasets

Michał Wiliński

AI & ML interests

Recent Activity

Organizations

Collections 2

Papers 3

spaces 3 Sort: Recently updated

Urban Autonomy Instance Segmentation

HF-Docs-QA

bit

models 3 Sort: Recently updated

datasets 17 Sort: Recently updated

spaces 3

models 3

datasets 17