Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
31
3
Stas Bekman
stas
Follow
admarcosai's profile picture
shubhamagarwal92's profile picture
Crystalpacking's profile picture
113 followers
·
4 following
https://stasosphere.com/machine-learning/
StasBekman
stas00
AI & ML interests
Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at Contextual.AI Training LLM/RAG/Generative AI/Machine Learning/Scalability
Recent Activity
updated
a model
2 days ago
stas/ml-engineering-book
updated
a model
5 months ago
stas/ml-engineering-book
posted
an
update
6 months ago
Do you want ArcticTraining at @SnowflakeDB to add an ability to post-train DeepSeek V3/R1 models with DPO using just a few GPU nodes? Please vote here and tell others about it: https://github.com/snowflakedb/ArcticTraining/discussions/58 ArcticTraining is an open-source, easy to use post-training framework for NVIDIA GPUs built on top of DeepSpeed.
View all activity
Organizations
stas
's datasets
8
Sort: Recently updated
stas/openwebtext-synthetic-testing
Updated
Nov 14, 2023
•
50
•
4
stas/oscar-en-10k
Viewer
•
Updated
Oct 19, 2022
•
10k
•
264
•
2
stas/c4-en-10k
Viewer
•
Updated
Oct 19, 2022
•
10k
•
257
•
4
stas/general-pmd-synthetic-testing
Updated
Oct 18, 2022
•
15
stas/cm4-synthetic-testing
Updated
Oct 18, 2022
•
34
stas/openwebtext-10k
Viewer
•
Updated
Sep 15, 2021
•
10k
•
1.87k
•
31
stas/wmt14-en-de-pre-processed
Viewer
•
Updated
Feb 16, 2021
•
4.55M
•
57
•
3
stas/wmt16-en-ro-pre-processed
Viewer
•
Updated
Feb 16, 2021
•
614k
•
41