Zeynel

zeynel
·

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago
Qwen/Qwen2.5-VL-3B-Instruct-AWQ
liked a model 5 days ago
perplexity-ai/r1-1776
liked a model 11 days ago
microsoft/mattergen
View all activity

Organizations

Die Linke's profile picture

zeynel's activity

reacted to schuler's post with 🔥 13 days ago
view post
Post
7217
📢 New Research Alert: Making Language Models Smaller & Smarter!

Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.

The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.

🔑 Key Findings:
• 77% parameter reduction.
• Maintained model capabilities.
• Improved generalization.

Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm
  • 2 replies
·
New activity in deepseek-ai/DeepSeek-R1 29 days ago

DesspSeek Censorship

13
#42 opened about 1 month ago by
rzgar