Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
5
Maggie Huan
Ibisbill
Follow
0 followers
·
3 following
Ibisbill
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
upvoted
a
paper
9 days ago
Beyond Transcription: Mechanistic Interpretability in ASR
upvoted
a
paper
11 days ago
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
View all activity
Organizations
models
13
Sort: Recently updated
Ibisbill/stage4_OpenThinker2_step50
Text Generation
•
7B
•
Updated
Jun 1
•
8
Ibisbill/stage3_SimpleRL_lr_1e5_epoch2
Text Generation
•
7B
•
Updated
May 28
•
6
Ibisbill/stage3_OpenR1_lr_1e5_epoch2
Text Generation
•
7B
•
Updated
May 28
•
5
Ibisbill/stage3_S1.1_lr_1e5
Text Generation
•
7B
•
Updated
May 28
•
11
Ibisbill/stage3_OpenThinker2-7B_lr_5e6
Text Generation
•
7B
•
Updated
May 24
•
6
Ibisbill/stage3_OpenThinker2-7B_lr_1e5
Text Generation
•
7B
•
Updated
May 24
•
6
Ibisbill/stage3_OpenThinker-7B_lr_1e5
Updated
May 24
Ibisbill/OpenThinker2-7B_lr_1e5
Updated
May 24
Ibisbill/ensemble-linear-regression
Updated
Dec 13, 2024
Ibisbill/reddit_superclean_raid
0.4B
•
Updated
Dec 5, 2024
•
4
View 13 models
datasets
13
Sort: Recently updated
Ibisbill/Clustering_deduplicated_reasoning
Viewer
•
Updated
Jun 6
•
56.4k
•
40
Ibisbill/Semantic_similarity_deduplicated_reasoning_data_english
Viewer
•
Updated
Jun 6
•
77.7k
•
22
Ibisbill/hash_deduplicated_reasoning_data_english
Viewer
•
Updated
Jun 6
•
72.7k
•
34
Ibisbill/General_English_only_SFT_Filtered_655k
Updated
Jun 6
•
54
•
1
Ibisbill/General_English_only_SFT_Filtered_25k
Updated
Jun 5
•
11
Ibisbill/General_SFT_Filtered_25k
Viewer
•
Updated
Jun 5
•
25k
•
7
Ibisbill/Tagging_Data_Simple_20k
Viewer
•
Updated
May 14
•
20k
•
8
Ibisbill/Tagging_Data_Full_63k
Viewer
•
Updated
May 14
•
63.6k
•
8
Ibisbill/Tagging_Data
Viewer
•
Updated
May 14
•
63.6k
•
6
Ibisbill/dnd-dataset-20pct-improved
Viewer
•
Updated
May 5
•
1.66M
•
24
View 13 datasets