Data that can be used for developing developmentally plausible language models in German.
Bastian Bunzeck
bbunzeck
AI & ML interests
Cognitive and usage-based approaches to language modeling, language acquisition in humans and machines, small and efficient language models
Recent Activity
updated
a collection
3 days ago
German BabyLM
updated
a collection
3 days ago
German BabyLM
updated
a collection
3 days ago
German BabyLM
Organizations
Collections
4
Papers
1
models
14
bbunzeck/gpt-wee-large-curriculum
Text Generation
•
Updated
•
173
bbunzeck/gpt-wee-large
Text Generation
•
Updated
•
183
bbunzeck/gpt-wee-small-curriculum
Text Generation
•
Updated
•
179
bbunzeck/gpt-wee-small
Text Generation
•
Updated
•
171
bbunzeck/gpt-wee-medium
Text Generation
•
Updated
•
176
bbunzeck/tweenie_llama
Text Generation
•
Updated
•
69
bbunzeck/weenie_llama
Text Generation
•
Updated
•
64
bbunzeck/teenie_llama
Text Generation
•
Updated
•
68
bbunzeck/baby_llama
Text Generation
•
Updated
•
67
bbunzeck/phoneme-llama-no-whitespace
Text Generation
•
Updated
•
3.95k
datasets
12
bbunzeck/wikibooks-wikijunior
Viewer
•
Updated
•
7.6k
•
30
bbunzeck/mini-klexikon
Viewer
•
Updated
•
32.1k
•
35
bbunzeck/klexikon
Viewer
•
Updated
•
47.4k
•
18
bbunzeck/fluter
Viewer
•
Updated
•
84
•
28
bbunzeck/rhyme-sentences
Viewer
•
Updated
•
400
•
6
bbunzeck/wug-words
Viewer
•
Updated
•
1k
•
15
bbunzeck/phoneme-babylm-100M
Viewer
•
Updated
•
15.8M
•
41
bbunzeck/phoneme-blimp
Viewer
•
Updated
•
59.9k
•
70
bbunzeck/phoneme-babylm-10M
Viewer
•
Updated
•
3.92M
•
34
bbunzeck/minisiqa
Viewer
•
Updated
•
1.39k
•
50