Asif's picture

3 1

Asif

HaseebAsif

·

Haseebasif7

AI & ML interests

None yet

Recent Activity

upvoted a paper 30 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

updated a model about 1 month ago

HaseebAsif/GEOPAK

published a model about 1 month ago

HaseebAsif/GEOPAK

View all activity

Organizations

None yet

upvoted a paper 30 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published Feb 5 • 49

updated a model about 1 month ago

HaseebAsif/GEOPAK

published a model about 1 month ago

HaseebAsif/GEOPAK

updated a dataset about 1 month ago

HaseebAsif/UrduReason-Eval

Viewer • Updated Jan 24 • 800 • 34 • 1

published a dataset about 2 months ago

HaseebAsif/UrduReason-Eval

Viewer • Updated Jan 24 • 800 • 34 • 1

upvoted a paper about 2 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7, 2025 • 184

liked a model 6 months ago

meituan-longcat/LongCat-Flash-Chat

Text Generation • 562B • Updated Sep 24, 2025 • 31.3k • 526

updated a model 7 months ago

HaseebAsif/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Aug 11, 2025

published a model 7 months ago

HaseebAsif/rl_course_vizdoom_health_gathering_supreme

Reinforcement Learning • Updated Aug 11, 2025

updated a model 7 months ago

HaseebAsif/Reinforce-Cartpole

Reinforcement Learning • Updated Aug 5, 2025

published a model 7 months ago

HaseebAsif/Reinforce-Cartpole

Reinforcement Learning • Updated Aug 5, 2025

updated a model 7 months ago

HaseebAsif/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Aug 1, 2025

published a model 7 months ago

HaseebAsif/dqn-SpaceInvadersNoFrameskip-v4

Reinforcement Learning • Updated Aug 1, 2025

updated a model 7 months ago

HaseebAsif/q-Taxi-v3

Reinforcement Learning • Updated Jul 31, 2025

published a model 7 months ago

HaseebAsif/q-Taxi-v3

Reinforcement Learning • Updated Jul 31, 2025

updated a model 7 months ago

HaseebAsif/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Jul 31, 2025

published a model 7 months ago

HaseebAsif/q-FrozenLake-v1-4x4-noSlippery

Reinforcement Learning • Updated Jul 31, 2025

updated a model 7 months ago

HaseebAsif/ppo-LunarLander-v2

Reinforcement Learning • Updated Jul 29, 2025

published a model 7 months ago

HaseebAsif/ppo-LunarLander-v2

Reinforcement Learning • Updated Jul 29, 2025

upvoted an article about 1 year ago

Article

Fine-Tuning LLMs: Supervised Fine-Tuning and Reward Modelling

Dec 4, 2023

•

7