-
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment
Paper • 2405.03594 • Published • 7 -
Sparse Finetuning for Inference Acceleration of Large Language Models
Paper • 2310.06927 • Published • 14 -
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
Paper • 2301.00774 • Published • 3 -
The Optimal BERT Surgeon: Scalable and Accurate Second-Order Pruning for Large Language Models
Paper • 2203.07259 • Published • 4
Angelos Vlachos
aavlachos
·
AI & ML interests
Computer Vision, NLP
Recent Activity
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-R1
liked
a model
2 months ago
neuralmagic/Sparse-Llama-3.1-8B-2of4
liked
a model
2 months ago
meta-llama/Llama-3.1-8B
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet