1 8 74

Yiming Zheng

ZYM666

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

liked a Space 12 days ago

TTS-AGI/Voice-Clone-Arena

liked a dataset 18 days ago

AIDC-AI/CSEMOTIONS

View all activity

Organizations

Collections 1

models 7

ZYM666/swin-spe-model

Updated Feb 14, 2024 • 5

ZYM666/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Jan 20, 2024

ZYM666/Alpaca

Updated Oct 22, 2023

ZYM666/ChatDoctor_change

Text Generation • Updated Sep 25, 2023 • 8 • 1

ZYM666/text2vec-large-chinese-support-sentence-transformer

Updated May 9, 2023

ZYM666/text2vec-large-chinese-support-sentence

Updated May 9, 2023

ZYM666/flower_yolov5

Updated Feb 2, 2023

datasets 0

None public yet

Yiming Zheng

AI & ML interests

Recent Activity

Organizations

Collections 1

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Towards Efficient and Exact Optimization of Language Model Alignment

A General Theoretical Paradigm to Understand Learning from Human Preferences

Statistical Rejection Sampling Improves Preference Optimization

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Towards Efficient and Exact Optimization of Language Model Alignment

A General Theoretical Paradigm to Understand Learning from Human Preferences

Statistical Rejection Sampling Improves Preference Optimization

Papers 1

spaces 1

ECS

models 7

ZYM666/swin-spe-model

ZYM666/q-FrozenLake-v1-4x4-noSlippery

ZYM666/Alpaca

ZYM666/ChatDoctor_change

ZYM666/text2vec-large-chinese-support-sentence-transformer

ZYM666/text2vec-large-chinese-support-sentence

ZYM666/flower_yolov5

datasets 0

Yiming Zheng

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 1

spaces 1

ECS

models 7 Sort: Recently updated

datasets 0

models 7