[ICLR 2026] Official repository of "Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs". Repo: https://github.com/pspdada/Uni-DPO
-
Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs
Paper • 2506.10054 • Published • 3 -
psp-dada/Uni-DPO
Preview • Updated • 28 • 1 -
psp-dada/Qwen2.5-7B-Uni-DPO
Text Generation • 8B • Updated • 14 • 1 -
psp-dada/Llama-3-8B-Instruct-Uni-DPO-v2-GPT-4o
Text Generation • 8B • Updated • 7 • 1