Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Catherine Arnett's picture
3 5 7

Catherine Arnett

catherinearnett
privru's profile picture stefan-it's profile picture lunarflu's profile picture
·
https://catherinearnett.github.io/
  • linguist_cat
  • catherinearnett
  • catherinearnett.bsky.social

AI & ML interests

multilingual NLP, tokenization

Recent Activity

updated a dataset 7 days ago
catherinearnett/eng_montok
published a dataset 7 days ago
catherinearnett/eng_montok
authored a paper about 1 month ago
BPE Stays on SCRIPT: Structured Encoding for Robust Multilingual Pretokenization
View all activity

Organizations

Blog-explorers's profile picture Language and Cognition Lab (UCSD)'s profile picture

catherinearnett 's datasets 2

catherinearnett/eng_montok

Updated 7 days ago • 70

catherinearnett/morphscore

Viewer • Updated Jul 10 • 5.09M • 365 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略