Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Stas Bekman's picture
31 3

Stas Bekman

stas
RafaelZequeira's profile picture NameeO's profile picture muhammadzeeshan007's profile picture
·
https://stasosphere.com/machine-learning/
  • StasBekman
  • stas00

AI & ML interests

Toolmaker. Software creator, optimizer and harmonizer. Makes things work and fly at Contextual.AI Training LLM/RAG/Generative AI/Machine Learning/Scalability

Recent Activity

updated a model 2 days ago
stas/ml-engineering-book
updated a model 5 months ago
stas/ml-engineering-book
posted an update 6 months ago
Do you want ArcticTraining at @SnowflakeDB to add an ability to post-train DeepSeek V3/R1 models with DPO using just a few GPU nodes? Please vote here and tell others about it: https://github.com/snowflakedb/ArcticTraining/discussions/58 ArcticTraining is an open-source, easy to use post-training framework for NVIDIA GPUs built on top of DeepSpeed.
View all activity

Organizations

BigScience Workshop's profile picture Social Post Explorers's profile picture

stas 's models 9

stas/ml-engineering-book

Updated 2 days ago • 19

stas/tiny-random-llama-2

Text Generation • 0.0B • Updated Nov 14, 2023 • 5.81k • 41

stas/tiny-m2m_100

Updated Apr 29, 2022 • 2.53k

stas/tr8b-104B-debug3

Updated Nov 29, 2021

stas/pegasus-cnn_dailymail-tiny-random

Updated Jul 1, 2021 • 384

stas/mt5-tiny-random

Updated Jun 23, 2021 • 69.3k • 2

stas/tiny-wmt19-en-de

Updated May 3, 2021 • 387 • 1

stas/tiny-wmt19-en-ru

Updated May 3, 2021 • 2.08k

stas/t5-very-small-random

Updated Apr 21, 2021 • 1 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略