Large Language Models are Locally Linear Mappings Paper • 2505.24293 • Published May 30, 2025 • 14 • 4
Lowering PyTorch's Memory Consumption for Selective Differentiation Paper • 2404.12406 • Published Apr 15, 2024 • 1 • 1