-
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 26 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 30 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 28 -
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 30
Michał Wiliński
MWilinski
AI & ML interests
Machine Learning, Reinforcement Learning
Recent Activity
updated a model 8 days ago
MWilinski/dro-v-qwen3-1.7b-paperlike published a model 8 days ago
MWilinski/dro-v-qwen3-1.7b-paperlike updated a dataset 9 days ago
MWilinski/hh-rlhf-irl