70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper โข 2504.11651 โข Published Apr 15, 2025 โข 31