James PRO

jtatman

AI & ML interests

improving domain specific models and re-sampling data, refining datasets for use in different modalities, small scale micro-llm clusters using quantized and smoothed models, and all emerging llm stack connecting technologies. Small models rock.

Recent Activity

liked a model about 10 hours ago
NousResearch/DeepHermes-3-Llama-3-8B-Preview
liked a model about 11 hours ago
HuggingFaceTB/SmolVLM-256M-Instruct
liked a model about 13 hours ago
BAAI/bge-base-en-v1.5
View all activity

Organizations

ZeroGPU Explorers's profile picture The Hydra Project's profile picture Tatman ML Technologies's profile picture M4-ai's profile picture

jtatman's activity

upvoted an article 7 months ago
view article
Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

227
upvoted an article 8 months ago
view article
Article

Welcome Gemma - Google's new open LLM

22
upvoted an article 10 months ago
view article
Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

34