Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
weiliu's picture
2 10 15

weiliu

thinkwee
Monta3Pt's profile picture Mi6paulino's profile picture XingweiT's profile picture
·
https://thinkwee.top/about/
  • thinkwee2767
  • thinkwee
  • thinkwee

AI & ML interests

LLM reasoning, agents

Recent Activity

upvoted a paper 11 days ago
Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?
upvoted a paper 17 days ago
Agentic Reinforced Policy Optimization
updated a collection 18 days ago
NOVER1
View all activity

Organizations

None yet

New activity in thinkwee/NOVEReason_5k about 1 month ago

[bot] Conversion to Parquet

#1 opened about 1 month ago by
parquet-converter
commented a paper 3 months ago

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Paper • 2505.16022 • Published May 21 • 3 •
5
commented 2 papers 4 months ago

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Paper • 2505.16022 • Published May 21 • 3 •
5

NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning

Paper • 2505.16022 • Published May 21 • 3 •
5
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略