Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
5
7
Catherine Arnett
catherinearnett
Follow
stefan-it's profile picture
thermal666's profile picture
pkd's profile picture
41 followers
·
22 following
https://catherinearnett.github.io/
linguist_cat
catherinearnett
catherinearnett.bsky.social
AI & ML interests
multilingual NLP, tokenization
Recent Activity
updated
a dataset
6 days ago
catherinearnett/eng_montok
published
a dataset
6 days ago
catherinearnett/eng_montok
authored
a paper
about 1 month ago
BPE Stays on SCRIPT: Structured Encoding for Robust Multilingual Pretokenization
View all activity
Organizations
catherinearnett
's models
18
Sort: Recently updated
catherinearnett/B-GPT_pl_en_sequential
Text Generation
•
0.1B
•
Updated
Jun 12
•
425
catherinearnett/B-GPT_en_pl_sequential
Text Generation
•
0.1B
•
Updated
Jun 12
•
344
catherinearnett/B-GPT_pl_en_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12
•
553
catherinearnett/B-GPT_en_pl_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12
•
691
catherinearnett/B-GPT_el_en_sequential
Text Generation
•
0.1B
•
Updated
Jun 12
•
439
catherinearnett/B-GPT_en_el_sequential
Text Generation
•
0.1B
•
Updated
Jun 12
•
394
catherinearnett/B-GPT_el_en_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12
•
447
catherinearnett/B-GPT_en_el_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12
•
479
catherinearnett/B-GPT_es_en_sequential
Text Generation
•
0.1B
•
Updated
Jun 12
•
468
catherinearnett/B-GPT_en_es_sequential
Text Generation
•
0.1B
•
Updated
Jun 12
•
370
catherinearnett/B-GPT_es_en_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12
•
677
catherinearnett/B-GPT_en_es_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12
•
539
catherinearnett/B-GPT_nl_en_sequential
Text Generation
•
0.1B
•
Updated
Jun 12
•
196
catherinearnett/B-GPT_en_nl_sequential
Text Generation
•
0.1B
•
Updated
Jun 12
•
132
catherinearnett/B-GPT_nl_en_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12
•
325
catherinearnett/B-GPT_en_nl_simultaneous
Text Generation
•
0.1B
•
Updated
Jun 12
•
2.78k
catherinearnett/pythia-1b-bigram_masked
Updated
May 1
catherinearnett/pythia-160m-bigram_masked
Updated
May 1