| | --- |
| | license: mit |
| | base_model: |
| | - Qwen/Qwen3-4B-Instruct-2507 |
| | --- |
| | |
| | ## LiteCoder-4b-Terminal-preview |
| |
|
| | **LiteCoder-4b-Terminal-preview** is part of our series of models specialized in terminal-based interactions and stems from our recent efforts to develop capable small and medium-sized code agent models. The model is fine-tuned from ` |
| | Qwen3-4B-Instruct-2507` on the [LiteCoder-SFT-Terminal-preview](https://huggingface.co/datasets/Lite-Coder/LiteCoder-SFT-Terminal-preview) dataset. |
| |
|
| | **Notably, this model achieves competitive results using fewer than 1,000 training samples.** By relying entirely on a fully synthetic pipeline—without converting any existing datasets—we were able to secure significant gains on the challenging Terminal Bench, matching the performance of leading open-source models with extreme data efficiency. |
| |
|
| | ## Released Artifacts |
| |
|
| | | 2025/12/17 | | | |
| | | --- | --- | --- | |
| | | LiteCoder-4b-Terminal-preview | Model | https://huggingface.co/Lite-Coder/LiteCoder-4b-Terminal-preview | |
| | | LiteCoder-SFT-Terminal-preview | Dataset | https://huggingface.co/datasets/Lite-Coder/LiteCoder-SFT-Terminal-preview | |
| |
|
| | ## Results |
| |
|
| | Our models achieve competitive results on **Terminal Bench**, significantly outperforming general-purpose models of similar (and even larger) sizes. |
| |
|
| | **Terminal Bench 1.0 Performance** |
| |
|
| | | **Model** | **Agent** | **Results** | |
| | | --- | --- | --- | |
| | | **LiteCoder-30a3b-Terminal-preview** | Terminus 2 | **18.75%** | |
| | | Qwen3-30B-A3B-Nex-N1 | Terminus 2 | 18.75% | |
| | | **LiteCoder-4b-Terminal-preview** | Terminus 2 | **13.75%** | |
| | | Qwen3-30B-A3B-Instruct | Terminus 2 | 12.5% | |
| | | Qwen3-4B-Instruct | Terminus 2 | 5.0% | |
| |
|
| | **Terminal Bench 2.0 Performance** |
| |
|
| | | **Model** | **Agent** | **Results** | |
| | | --- | --- | --- | |
| | | **LiteCoder-30a3b-Terminal-preview** | Terminus 2 | **5.6%** | |
| | | **LiteCoder-4b-Terminal-preview** | Terminus 2 | **3.3%** | |
| | | Qwen3-32B | Terminus 2 | 1.9% | |
| | | InternLM3-8B-Nex-N1 | Terminus 2 | 0% | |
| | | Qwen3-8B | Terminus 2 | 0% | |
| |
|
| | ## Citation |
| |
|
| | ```latex |
| | @misc{LiteCoder Team, |
| | title={LiteCoder: Advancing Small and Medium-sized Code Agents}, |
| | author={Xiaoxuan Peng and Xinyu Lu and Kaiqi Zhang and Taosong Fang and Boxi Cao and Yaojie Lu}, |
| | year={2025}, |
| | } |
| | ``` |
| |
|
| | ## Future Directions |
| |
|
| | - **Scaling Environments:** Expanding the diversity of Docker environments and teacher models to improve generalization. |
| | - **Agentic RL:** Implementing Reinforcement Learning specifically for multi-turn agentic workflows. |
| |
|
| | ## Team & Contributions |
| |
|
| | - **Xiaoxuan Peng:** Main Contributor |
| | - [Xinyu Lu](https://scholar.google.com/citations?user=_OsLG8EAAAAJ&hl=zh-CN)**:** Project Lead |
| | - **Kaiqi Zhang:** Contributor |
| | - **Taosong Fang**: Contributor |
| | - **Boxi Cao:** Contributor |
| | - **Yaojie Lu:** Contributor |
| |
|
| | ## Acknowledgements |
| |
|
| | LiteCoder builds upon multiple open-source projects, including [Harbor](https://github.com/laude-institute/harbor). The models are trained using [AutoAlign](https://github.com/icip-cas/AutoAlign). |
| |
|
| | ## Join Us |
| |
|
| | Join the discussion on our [Discord](https://discord.gg/EX9qZe8B). |
| |
|