prithivMLmods/Ophiuchi-Qwen3-14B-Instruct Text Generation β’ 15B β’ Updated May 12, 2025 β’ 198 β’ 9
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Paper β’ 2503.09516 β’ Published Mar 12, 2025 β’ 38