James

jtatman

AI & ML interests

improving domain specific models and re-sampling data, refining datasets for use in different modalities, small scale micro-llm clusters using quantized and smoothed models, and all emerging llm stack connecting technologies. Small models rock.

Recent Activity

liked a model 12 days ago

DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF

liked a model 12 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

liked a model 12 days ago

meta-llama/Llama-3.2-3B-Instruct

View all activity

Organizations

upvoted a paper 16 days ago

InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation

Paper • 2507.17520 • Published 29 days ago • 14

upvoted an article 4 months ago

Article

Page-to-Video: Generate videos from webpages 🪄🎬

•

May 6

• 27

upvoted a paper 9 months ago

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 46

upvoted a paper 11 months ago

Paper Copilot: A Self-Evolving and Efficient LLM System for Personalized Academic Assistance

Paper • 2409.04593 • Published Sep 6, 2024 • 27

upvoted a collection about 1 year ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 683

upvoted an article about 1 year ago

Article

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

and 7 others •

Jul 23, 2024

• 237

upvoted 2 papers about 1 year ago

Retrieval-Enhanced Machine Learning: Synthesis and Opportunities

Paper • 2407.12982 • Published Jul 17, 2024 • 6

Model Merging and Safety Alignment: One Bad Model Spoils the Bunch

Paper • 2406.14563 • Published Jun 20, 2024 • 31

upvoted an article about 1 year ago

Article

Welcome Gemma - Google's new open LLM

and 2 others •

Feb 21, 2024

• 25

upvoted a collection about 1 year ago

abliterated-v3

Collection

Latest gen of the abliterated models I've produced • 17 items • Updated Jun 3, 2024 • 127

upvoted an article over 1 year ago

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

Jun 23, 2024

• 34

upvoted 5 papers over 1 year ago

Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers

Paper • 2311.10642 • Published Nov 17, 2023 • 26

James

AI & ML interests

Recent Activity

Organizations

jtatman's activity

Page-to-Video: Generate videos from webpages 🪄🎬

Llama 3.1 - 405B, 70B & 8B with multilinguality and long context

Welcome Gemma - Google's new open LLM

SeeMoE: Implementing a MoE Vision Language Model from Scratch