Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Hu Xiaoyan
Yann1021
Follow
https://yannxiaoyanhu.github.io
AI & ML interests
Reinforcement learning
Recent Activity
authored
a paper
3 days ago
Provably Efficient CVaR RL in Low-rank MDPs
authored
a paper
3 days ago
PAK-UCB Contextual Bandit: An Online Learning Approach to Prompt-Aware Selection of Generative Models and LLMs
authored
a paper
3 days ago
A Multi-Armed Bandit Approach to Online Selection and Evaluation of Generative Models
View all activity
Organizations
None yet
Papers
3
arxiv:
2410.13287
arxiv:
2406.07451
arxiv:
2311.11965
models
0
None public yet
datasets
0
None public yet