18 30 59

PenutChen

penut85420

penut85420

AI & ML interests

LLM, Quantization

Recent Activity

updated a Space 6 days ago

DaOppaiLoli/JpVocab

new activity 7 days ago

MediaTek-Research/Llama-Breeze2-8B-Instruct:`.save_pretrained()` failed

upvoted a paper 7 days ago

The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities

View all activity

Organizations

penut85420's activity

updated a Space 6 days ago

JpVocab

✏

Take a Japanese vocabulary quiz

New activity in MediaTek-Research/Llama-Breeze2-8B-Instruct 7 days ago

`.save_pretrained()` failed

#4 opened 7 days ago by

penut85420

upvoted a paper 7 days ago

The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities

Paper • 2501.13921 • Published Jan 23 • 3

liked a Space 10 days ago

JpVocab

✏

Take a Japanese vocabulary quiz

published a Space 12 days ago

JpVocab

✏

Take a Japanese vocabulary quiz

liked 2 models about 1 month ago

jinaai/ReaderLM-v2

Text Generation • Updated 18 days ago • 26.2k • • 514

sentence-transformers/static-similarity-mrl-multilingual-v1

commented a paper about 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 13 •

upvoted a paper about 2 months ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published Nov 26, 2024 • 13

liked a model 2 months ago

IamCreateAI/Ruyi-Mini-7B

Image-to-Video • Updated Dec 25, 2024 • 2.81k • 595

upvoted a collection 2 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 138

updated a Space 2 months ago

KanaQuiz

📝

Take a kana quiz and learn romaji

upvoted a paper 4 months ago

Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs

Paper • 2410.10739 • Published Oct 14, 2024 • 2

commented a paper 4 months ago

Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs

Paper • 2410.10739 • Published Oct 14, 2024 • 2 •

New activity in yentinglin/Llama-3-Taiwan-8B-Instruct 4 months ago

請問是有重新訓練過tokenizer嗎?

#9 opened 8 months ago by

tedslin

commented a paper 5 months ago

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29 •

upvoted a paper 5 months ago

Instruction Following without Instruction Tuning

Paper • 2409.14254 • Published Sep 21, 2024 • 29

liked a model 5 months ago

MediaTek-Research/Breeze-7B-FC-v1_0

Updated Jan 15 • 352 • 20

commented a paper 5 months ago

A Comprehensive Evaluation of Quantized Instruction-Tuned Large Language Models: An Experimental Analysis up to 405B

Paper • 2409.11055 • Published Sep 17, 2024 • 17 •

liked a model 5 months ago

jinaai/reader-lm-1.5b

Text Generation • Updated Jan 17 • 1.05k • 588