-
BlackMamba: Mixture of Experts for State-Space Models
Paper • 2402.01771 • Published • 24 -
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Paper • 2402.01739 • Published • 27 -
DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
Paper • 2401.06066 • Published • 49
David Samuel
Davidsamuel101
AI & ML interests
NLP, Computer Vision
Recent Activity
updated
a dataset
3 days ago
bookbot/en_snapshot_madison_vc
published
a dataset
6 days ago
bookbot/en_snapshot_madison_vc
updated
a collection
13 days ago
English Synthetic Bookbot Books TTS Datasets