Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Tien Dung's picture
10086 14 222

Tien Dung

tiendung
khanhtx8x's profile picture daosysang's profile picture 21world's profile picture
·
  • tiendung

AI & ML interests

None yet

Recent Activity

liked a model about 1 month ago
SparseLLM/BlockFFN-3B-SFT
liked a model about 1 month ago
turboderp/ERNIE-4.5-300B-A47B-PT-exl3
reacted to Jaward's post with 😎 about 2 months ago
I played around with the new RXTX paper (XX^T) and was able to train nanogpt with 4x4 RXTX matmuls in both attention layer and optimizer🤕 It just works (well I had to add some guardrails) but still saves 5% of memory usage: The Patch: - Computes attention scores with a 4x4 blockwise RXTX matmuls (no pytorch dot prod) - Handles arbitrary sequence lengths by padding to the nearest multiple of 4. - An RXTX variant of shampoo with params reshaped into 4x4 blocks during each optimizer step. - Uses 5% less ops Code: https://github.com/Jaykef/ai-algorithms/blob/main/nanogpt-rxtx.ipynb Paper: https://arxiv.org/pdf/2505.09814
View all activity

Organizations

Symato Team's profile picture Tiny Monsters's profile picture Vietnamese Mistral's profile picture

published an article 11 months ago
view article
Article

Ưu tiên có thể diễn giải thông qua Mô hình Phần thưởng Đa mục tiêu và Hỗn hợp Chuyên gia

By tiendung •
Sep 29, 2024
• 1
published an article 11 months ago
view article
Article

Bài học đắng trong AI

By tiendung •
Sep 29, 2024
• 1
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略