Wei Shen
Swtheking
AI & ML interests
None yet
Recent Activity
commented on
a paper
3 months ago
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
authored
a paper
4 months ago
AdaCoT: Pareto-Optimal Adaptive Chain-of-Thought Triggering via
Reinforcement Learning