T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published 18 days ago • 112
iiiorg/piiranha-v1-detect-personal-information Token Classification • 0.3B • Updated Nov 1 • 14.8k • • 228
Few-Bit Backward: Quantized Gradients of Activation Functions for Memory Footprint Reduction Paper • 2202.00441 • Published Feb 1, 2022 • 1
Memory-Efficient Backpropagation through Large Linear Layers Paper • 2201.13195 • Published Jan 31, 2022 • 1
Kandinsky: an Improved Text-to-Image Synthesis with Image Prior and Latent Diffusion Paper • 2310.03502 • Published Oct 5, 2023 • 78