GREAM
Collection
The collection for paper "Generative Reasoning Recommendation via LLMs"
•
4 items
•
Updated
Paper: Generative Reasoning Recommendation via LLMs, 2025.
Authors: Minjie Hong*, Zetong Zhou*, Zirun Guo, Ziang Zhang, Ruofan Hu, Weinan Gan, Jieming Zhu, Zhou Zhao†
Repository: https://github.com/Indolent-Kawhi/GRRM
HF Papers Link: https://huggingface.co/papers/2510.20815
GREAM (Generative Reasoning Recommendation Model) is a large language model (LLM)-based generative reasoning recommender designed to unify understanding, reasoning, and prediction for recommendation tasks.
It introduces a reasoning-enhanced, verifiable reinforcement learning framework that allows both high-throughput direct recommendations and interpretable reasoning-based outputs.
| Component | Description |
|---|---|
| Backbone | Qwen3-4B-Instruct |
| Indexing | Residual Quantization (RQ-KMeans, 5 levels, 256 values per level) |
| Training Phases | ① Collaborative–Semantic Alignment → ② Reasoning Curriculum Activation → ③ SRPO Reinforcement Learning |
| Inference Modes | - Direct Sequence Recommendation: low-latency item generation - Sequential Reasoning Recommendation: interpretable CoT reasoning chains |
| RL Framework | Verl + SGLang backend |
| Data Type | Source | Description |
|---|---|---|
| Dalign | Amazon Review Datasets (Beauty, Sports, Instruments) | Sequential, semantic reconstruction, and preference understanding tasks |
| Dreason | Synthetic CoT data generated via GPT-5 / Qwen3-30B / Llama-3.1 | Multi-step reasoning sequences with <think>...</think> and <answer>...</answer> supervision |
| Text Sources | Item titles, descriptions, and high-quality reviews | Combined and rewritten to form dense item semantics |
@misc{hong2025generativereasoningrecommendationllms,
title={Generative Reasoning Recommendation via LLMs},
author={Minjie Hong and Zetong Zhou and Zirun Guo and Ziang Zhang and Ruofan Hu and Weinan Gan and Jieming Zhu and Zhou Zhao},
year={2025},
eprint={2510.20815},
archivePrefix={arXiv},
primaryClass={cs.IR},
url={https://arxiv.org/abs/2510.20815},
}