Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources
LAION eV
non-profit
AI & ML interests
open multi-modal foundation models and datasets for their creation; scaling laws, model evaluation; fully local, sovereign model deployment, personalized assistants and open local agentic systems
Recent Activity
View all activity
Organization Card
models and datasets related to openthoughts 4 experiments
-
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k
2B • Updated • 11 -
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen2.5-1.5B_32k
Text Generation • 2B • Updated • 10 -
laion/openthoughts-3-QwQ-32b-annotated-16k_qwen2.5-1.5B_16k
Text Generation • 2B • Updated • 15 -
laion/openthoughts-4-code-qwen3-32b-annotated-7k_qwen3-1.7B_10k
Text Generation • 2B • Updated • 13
Collection of models and dataset related to MixtureVitae, open and fully reproducible pretraining dataset built from permissive sources
models and datasets related to openthoughts 4 experiments
-
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen3-1.7B_32k
2B • Updated • 11 -
laion/openthoughts-4-code-qwen3-32b-annotated-32k_qwen2.5-1.5B_32k
Text Generation • 2B • Updated • 10 -
laion/openthoughts-3-QwQ-32b-annotated-16k_qwen2.5-1.5B_16k
Text Generation • 2B • Updated • 15 -
laion/openthoughts-4-code-qwen3-32b-annotated-7k_qwen3-1.7B_10k
Text Generation • 2B • Updated • 13
models
255
laion/GLM-4_7-swesmith-sandboxes-with_tests-oracle_verified_120s-maxeps-131k
Text Generation
•
308k
•
Updated
•
4
laion/exp-syh-r2egym-swesmith-mixed_glm_4_7_traces_locetash
Text Generation
•
308k
•
Updated
•
4
laion/dev_set_part1_10k_glm_4_7_traces_locetash
Text Generation
•
308k
•
Updated
•
22
laion/music-whisper
Automatic Speech Recognition
•
0.2B
•
Updated
•
10
laion/exp-uns-r2egym-2_1x_glm_4_7_traces_locetash
Text Generation
•
308k
•
Updated
•
22
laion/exp-gfi-staqc-short-response-filtered-10K_glm_4_7_traces_locetash
Text Generation
•
308k
•
Updated
•
21
laion/GLM-4.6-stackexchange-overflow-sandboxes-32eps-65k-reasoning_adam-beta1_0-97_Qwen3-32B
Text Generation
•
33B
•
Updated
•
4
laion/glm46-bash-textbook-traces
Text Generation
•
308k
•
Updated
•
14
laion/exp-gfi-staqc-random-filtered-10K_glm_4_7_traces_locetash
Text Generation
•
308k
•
Updated
•
30
laion/GLM-4_7-inferredbugs-sandboxes-maxeps-131k
Text Generation
•
308k
•
Updated
•
27
datasets
179
laion/CLIP-ViT-H-14-laion2B-s32B-b79K-all-checkpoints
Updated
•
88
•
2
laion/majestrino-data
Viewer
•
Updated
•
7.6M
•
6.17k
laion/majestrino-data-v2
Updated
•
9
laion/common-voice-subset-for-clap
Viewer
•
Updated
•
10
•
126
•
1
laion/speech-attributes-classification
Updated
•
32
•
1
laion/openthoughts-4-math-qwen3-32b-7k-annotated-sharegpt
Viewer
•
Updated
•
3.5M
•
6
laion/timbre-audio-caption-pairs
Viewer
•
Updated
•
830k
•
328
•
1
laion/voice_tag-audio-pairs
Updated
•
1
laion/openthoughts-4-code-qwen3-32b-7k-annotated-sharegpt
Viewer
•
Updated
•
959k
•
5
laion/Qwen3-32B_hero_run_4_code_32k-sharegpt
Updated
•
82