Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
XueZhang-bjtu 's Collections
M-Thinker-Data
M-Thinker

M-Thinker-Data

updated Oct 14

Data of Paper "Think Natively: Unlocking Multilingual Reasoning with Consistency-Enhanced Reinforcement Learning" (https://arxiv.org/pdf/2510.07300)

Upvote
-

  • XueZhang-bjtu/M-Thinker-SFT-data

    Viewer • Updated Oct 13 • 20.1k • 37

  • XueZhang-bjtu/Light-R1-SFTData-question-translated-76K

    Viewer • Updated Oct 14 • 151k • 23

  • XueZhang-bjtu/M-Thinker-1.5B-RL-Iter1-data

    Viewer • Updated Oct 14 • 15.1k • 22

  • XueZhang-bjtu/M-Thinker-1.5B-RL-Iter2-data

    Viewer • Updated Oct 14 • 15.1k • 15

  • XueZhang-bjtu/M-Thinker-7B-RL-Iter1-data

    Viewer • Updated Oct 14 • 15.1k • 13

  • XueZhang-bjtu/M-Thinker-7B-RL-Iter2-data

    Viewer • Updated Oct 14 • 15.1k • 12
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs