Sung-Feng Huang
sungfengh
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 hour ago
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization
upvoted
a
paper
10 days ago
Masking Teacher and Reinforcing Student for Distilling Vision-Language Models
updated
a dataset
17 days ago
PeacefulData/SINE_v2