ut-enyac/mamba2-8b-converted-uniql-1.0-masked-lora-rft-w4a16
0.1B
•
Updated
•
19
Energy-aware Computing, Low Power Design, EDA, Dark Silicon, Efficient Deep Learning
UniQL: Unified Quantization and Low-rank Compression for Adaptive Edge LLMs
Quamba2: A Robust and Scalable Post-training Quantization Framework for Selective State Space Models
Totally Free + Zero Barriers + No Login Required