z-lab/gpt-oss-20b-DFlash
Text Generation
•
0.8B
•
Updated
•
103
•
1
Efficient AI
DFlash: Block Diffusion for Flash Speculative Decoding
ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference