Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Melih Özcan's picture
226

Melih Özcan

staycoolish
aakashbilly's profile picture 21world's profile picture
·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey
upvoted a paper 2 days ago
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
upvoted a paper 2 days ago
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning
View all activity

Organizations

None yet

models 11

staycoolish/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Apr 27, 2023

staycoolish/a2c-PandaReachDense-v2

Reinforcement Learning • Updated Mar 6, 2023

staycoolish/a2c-AntBulletEnv-v0

Reinforcement Learning • Updated Mar 6, 2023

staycoolish/ppo-Pyramids

Reinforcement Learning • Updated Feb 28, 2023 • 2

staycoolish/ppo-SnowballTarget

Reinforcement Learning • Updated Feb 28, 2023 • 2

staycoolish/Reinforce-Pixelcopter-v1

Reinforcement Learning • Updated Jan 20, 2023

staycoolish/Reinforce-Cartpole-v1

Reinforcement Learning • Updated Jan 20, 2023

staycoolish/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Jan 19, 2023

staycoolish/q-Taxi-v3

Reinforcement Learning • Updated Jan 19, 2023

staycoolish/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Jan 19, 2023
View 11 models

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略