Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
1
Jerry Huang
PRO
jerry128
Follow
smadala2's profile picture
1 follower
·
5 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training
updated
a dataset
2 months ago
jerry128/rag-rl-sft-linear
updated
a dataset
2 months ago
jerry128/rag-rl-sft-min-max
View all activity
Organizations
jerry128
's datasets
149
Sort: Recently updated
jerry128/rag-rl-sft-linear
Viewer
•
Updated
Jul 1
•
2.77k
•
4
jerry128/rag-rl-sft-min-max
Viewer
•
Updated
Jul 1
•
3.15k
•
4
jerry128/RAG-RL-MuSiQue-Min-Max-rebuttal-Shuffled
Viewer
•
Updated
Jul 1
•
19.9k
•
4
jerry128/RAG-RL-MuSiQue-Min-Max-rebuttal
Viewer
•
Updated
Jul 1
•
19.9k
•
4
jerry128/RAG-RL-MuSiQue-Linear-rebuttal-Sorted-by-Num-Hops
Viewer
•
Updated
Jul 1
•
19.9k
•
4
jerry128/RAG-RL-MuSiQue-Linear-rebuttal-Shuffled
Viewer
•
Updated
Jul 1
•
19.9k
•
4
jerry128/RAG-RL-MuSiQue-Linear-rebuttal
Viewer
•
Updated
Jul 1
•
19.9k
•
3
jerry128/rag-rl-sft2
Viewer
•
Updated
Jun 30
•
2.3k
•
3
jerry128/rag-rl-sft
Viewer
•
Updated
Jun 29
•
5k
•
3
jerry128/ToolACE_Alpaca_Sft
Viewer
•
Updated
Jun 10
•
9.57k
•
3
jerry128/ToolACE_Alpaca
Viewer
•
Updated
Jun 10
•
9.57k
•
2
jerry128/ToolACE_Transformed2
Viewer
•
Updated
Jun 10
•
9.57k
•
3
jerry128/ToolACE_Transformed
Viewer
•
Updated
Jun 6
•
9.57k
jerry128/ToolACE-axolotl-dpo
Viewer
•
Updated
Jun 5
•
3.85k
jerry128/ToolACE-axolotl
Viewer
•
Updated
Jun 5
•
11.3k
jerry128/RAG-RL-2Wiki-Eval-ideal-retriever-OOD
Viewer
•
Updated
Apr 2
•
500
•
2
jerry128/RAG-RL-2Wiki-Eval-ideal-retriever-ID
Viewer
•
Updated
Apr 2
•
500
•
3
jerry128/RAG-RL-HotpotQA-Eval-ideal-retriever-OOD
Viewer
•
Updated
Apr 2
•
512
•
2
jerry128/RAG-RL-HotpotQA-Eval-ideal-retriever-ID
Viewer
•
Updated
Apr 2
•
512
•
3
jerry128/RAG-RL-MuSiQue-Eval-ideal-retriever-OOD
Viewer
•
Updated
Apr 2
•
379
•
2
jerry128/RAG-RL-MuSiQue-Eval-ideal-retriever-ID
Viewer
•
Updated
Apr 2
•
379
•
3
jerry128/RAG-RL-MuSiQue-Eval-OOD
Viewer
•
Updated
Apr 2
•
681
•
2
jerry128/RAG-RL-MuSiQue-Eval-ID
Viewer
•
Updated
Apr 2
•
681
•
2
jerry128/RAG-RL-HotpotQA-Eval-OOD
Viewer
•
Updated
Apr 2
•
810
•
2
jerry128/RAG-RL-HotpotQA-Eval-ID
Viewer
•
Updated
Apr 2
•
810
•
2
jerry128/RAG-RL-2Wiki-Eval-OOD
Viewer
•
Updated
Apr 2
•
1k
•
1
jerry128/RAG-RL-2Wiki-Eval-ID
Viewer
•
Updated
Apr 2
•
1k
•
4
jerry128/RAG-RL-2Wiki-Eval-Gold-Only-50k
Viewer
•
Updated
Apr 2
•
12.6k
•
3
jerry128/RAG-RL-MuSiQue-Eval-Gold-Only-50k
Viewer
•
Updated
Apr 2
•
2.42k
•
1
jerry128/RAG-RL-HotpotQA-Eval-Gold-Only-50k
Viewer
•
Updated
Apr 2
•
7.41k
•
61
Previous
1
2
3
...
5
Next