Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sinnis's picture
2 7

sinnis

sinnis
·
  • sinnis1991

AI & ML interests

None yet

Organizations

None yet

sinnis 's collections 3

papers
  • Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

    Paper • 2508.08221 • Published Aug 11 • 49
knowledge
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 64
generative work
  • Idempotent Generative Network

    Paper • 2311.01462 • Published Nov 2, 2023 • 26
papers
  • Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

    Paper • 2508.08221 • Published Aug 11 • 49
generative work
  • Idempotent Generative Network

    Paper • 2311.01462 • Published Nov 2, 2023 • 26
knowledge
  • Direct Preference Optimization: Your Language Model is Secretly a Reward Model

    Paper • 2305.18290 • Published May 29, 2023 • 64
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs