Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
LCM-Lab
's Collections
Elastic-Attention
LongRM
Elastic-Attention
updated
Jan 28
Elastic Attention: Test-time Adaptive Sparsity Ratios for Efficient Transformers
Upvote
3
LCM-Lab/full_xattn_64k_qwen3-4b_wfrozen
Text Generation
•
4B
•
Updated
Jan 28
•
1
LCM-Lab/full_streaming_64k_qwen3-4b_end0.7_wfrozen
Text Generation
•
4B
•
Updated
Jan 28
•
4
LCM-Lab/full_xattn_64k_qwen3-8b_end0.7_wfrozen
Text Generation
•
8B
•
Updated
Jan 28
•
2
LCM-Lab/full_xattn_64k_llama3.1-8b_wfrozen
Text Generation
•
8B
•
Updated
Jan 28
LCM-Lab/full_streaming_64k_qwen3-4b_MLP2.0_wfrozen
Text Generation
•
4B
•
Updated
Jan 28
•
4
LCM-Lab/1.2steps300_full_streaming_64k_qwen3-8b_wfrozen
Text Generation
•
8B
•
Updated
Jan 28
•
5
LCM-Lab/nsa_llama
Text Generation
•
8B
•
Updated
Jan 28
•
2
LCM-Lab/nsa_qwen3-4b
Text Generation
•
Updated
Jan 28
LCM-Lab/nsa_qwen3-8b
Text Generation
•
9B
•
Updated
Jan 28
•
1
LCM-Lab/moba_qwen3-4b
Text Generation
•
4B
•
Updated
Jan 28
•
1
LCM-Lab/infllm_qwen3-4b
Text Generation
•
4B
•
Updated
Jan 28
•
1
LCM-Lab/infllm_llama
Text Generation
•
8B
•
Updated
Jan 28
•
2
LCM-Lab/moba_qwen3-8b
Text Generation
•
8B
•
Updated
Jan 28
•
2
LCM-Lab/infllm_qwen3-8b
Text Generation
•
8B
•
Updated
Jan 28
•
1
LCM-Lab/moba_llama
Text Generation
•
Updated
Jan 28
•
1
LCM-Lab/full_streaming_64k_qwen3-4b_MLP8.0_wfrozen
Text Generation
•
4B
•
Updated
Jan 28
•
48
LCM-Lab/full_streaming_64k_qwen3-4b_MLP3.0_wfrozen
Text Generation
•
4B
•
Updated
Jan 28
•
1
Upvote
3
Share collection
View history
Collection guide
Browse collections