Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
31
3
Stas Bekman
stas
Follow
RafaelZequeira's profile picture
NameeO's profile picture
muhammadzeeshan007's profile picture
113 followers
·
4 following
https://stasosphere.com/machine-learning/
StasBekman
stas00
AI & ML interests
Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at Contextual.AI Training LLM/RAG/Generative AI/Machine Learning/Scalability
Recent Activity
updated
a model
2 days ago
stas/ml-engineering-book
updated
a model
5 months ago
stas/ml-engineering-book
posted
an
update
6 months ago
Do you want ArcticTraining at @SnowflakeDB to add an ability to post-train DeepSeek V3/R1 models with DPO using just a few GPU nodes? Please vote here and tell others about it: https://github.com/snowflakedb/ArcticTraining/discussions/58 ArcticTraining is an open-source, easy to use post-training framework for NVIDIA GPUs built on top of DeepSpeed.
View all activity
Organizations
stas
's models
9
Sort: Recently updated
stas/ml-engineering-book
Updated
2 days ago
•
19
stas/tiny-random-llama-2
Text Generation
•
0.0B
•
Updated
Nov 14, 2023
•
5.81k
•
41
stas/tiny-m2m_100
Updated
Apr 29, 2022
•
2.53k
stas/tr8b-104B-debug3
Updated
Nov 29, 2021
stas/pegasus-cnn_dailymail-tiny-random
Updated
Jul 1, 2021
•
384
stas/mt5-tiny-random
Updated
Jun 23, 2021
•
69.3k
•
2
stas/tiny-wmt19-en-de
Updated
May 3, 2021
•
387
•
1
stas/tiny-wmt19-en-ru
Updated
May 3, 2021
•
2.08k
stas/t5-very-small-random
Updated
Apr 21, 2021
•
1
•
1