Vinh Nguyen's picture

Vinh Nguyen

vinhnx90

AI & ML interests

Learn by doing

Recent Activity

Organizations

None yet

vinhnx90's activity

upvoted 6 articles 1 day ago
view article
Article

SmolVLM2: Bringing Video Understanding to Every Device

β€’ 135
view article
Article

LLM Dataset Formats 101: A No‐BS Guide for Hugging Face Devs

By tegridydev β€’
β€’ 5
view article
Article

Smol but Mighty: Can Small Models Reason well? πŸ€”

By evijit β€’
β€’ 8
view article
Article

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

By NormalUhr β€’
β€’ 11
view article
Article

Faster fine-tuning using TRL & Unsloth

β€’ 52
upvoted an article 1 day ago
view article
Article

Tool Use, Unified

β€’ 84
reacted to dreamerdeo's post with πŸš€ 2 days ago
view post
Post
2698
πŸš€ Excited to share our technical report on the Southeast Asian multilingual model Sailor2 and its latest updates!

Our 49-page report details Sailor2's development journey, including multilingual data cleaning, small model data mixture simulations, multi-stage continual pre-training, multi-stage post-training, and multi-cultural multi-lingual evaluations. Sailor2 aims to streamline the multilingual model pre-training process efficiently for the community.

🧭 We highlight Sailor2's impressive performance in low-resource language translation scenarios and its cultural understanding advantages in Southeast Asia, promoting practical applications for regional languages.

Model updates include:Β 
πŸ’‘ More precise outputs: Reduced redundancy in model outputs through refined post-training data and optimization techniques.Β 
🌈 Handling longer texts: Expanded to handle up to 128K context length in Southeast Asian languages through long-text training. 
⚑️ Faster inference: Achieved 2.5x faster inference speed with speculative decoding. 
πŸŒͺ️ More model sizes: Introduced new sizes of 3B and 14B through model pruning.

🌟 All models are Apache-licensed for commercial use; development tools (code, resources) are open-source.

πŸ“š Technical report: Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs (2502.12982)Β 
πŸ€–οΈ Models: sail/sailor2-language-models-674d7c9e6b4dbbd9a869906bΒ 
πŸ’¬ Demo: sail/Sailor2-20B-ChatΒ 
πŸ“£ Sailor2 community: https://huggingface.co/sailor2
upvoted an article 12 days ago
view article
Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

By dvgodoy β€’
β€’ 7
upvoted an article 19 days ago
view article
Article

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

By Kseniase β€’
β€’ 14