Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
mindchain 's Collections
Google Gemma
Nemo-Gym
Reward Models
Trained
Encoder/Decoder Architecture
Gemma’s Soul-Vault: Evolutionary JumpReLU Steering Hub
Mamba/Transformers Combo
EDGE - Funktion Calling

Reward Models

updated 4 days ago
Upvote
1

  • nvidia/Nemotron-4-340B-Reward

    Updated Jun 19, 2024 • 50 • 125

  • nvidia/Qwen3-Nemotron-8B-BRRM

    Text Generation • Updated 8 days ago • 735 • 8

  • nvidia/Llama-3.3-Nemotron-70B-Reward-Principle

    Text Generation • 71B • Updated Oct 30 • 106 • 5

  • nvidia/Qwen3-Nemotron-32B-GenRM-Principle

    Text Generation • 33B • Updated Oct 30 • 831 • 11

  • nvidia/Qwen3-Nemotron-32B-RLBFF

    Text Generation • 33B • Updated Oct 31 • 141 • 27

  • nvidia/Qwen3-Nemotron-14B-BRRM

    Text Generation • Updated 8 days ago • 216 • 11

  • nvidia/HelpSteer3

    Viewer • Updated Nov 16 • 133k • 2.7k • 93
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs