Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Thomas Wang's picture
267 5

Thomas Wang

TimeRobber
Dzy6's profile picture maniroudsari's profile picture alpayariyak's profile picture
·
  • thomasw21

AI & ML interests

Large Language Models, Efficient NLP, NeRF

Organizations

Safetensors's profile picture BigScience Workshop's profile picture HF Internships's profile picture BigScience Catalogue Data's profile picture BigScience Data's profile picture BigScience Catalogue Data Dev's profile picture Team 7's profile picture BigCode's profile picture ShapeNet's profile picture

authored 2 papers over 1 year ago

OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents

Paper • 2306.16527 • Published Jun 21, 2023 • 46

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 51
authored a paper almost 2 years ago

FinGPT: Large Generative Models for a Small Language

Paper • 2311.05640 • Published Nov 3, 2023 • 32
authored a paper about 2 years ago

What Language Model to Train if You Have One Million GPU Hours?

Paper • 2210.15424 • Published Oct 27, 2022 • 2
authored 5 papers over 2 years ago

StarCoder: may the source be with you!

Paper • 2305.06161 • Published May 9, 2023 • 31

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Paper • 2303.03915 • Published Mar 7, 2023 • 7

Crosslingual Generalization through Multitask Finetuning

Paper • 2211.01786 • Published Nov 3, 2022 • 2

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 33

Multitask Prompted Training Enables Zero-Shot Task Generalization

Paper • 2110.08207 • Published Oct 15, 2021 • 2
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略