Zhaoning Yu's picture

4

Zhaoning Yu

ZhaoningYu

·

AI & ML interests

None yet

Recent Activity

authored a paper 20 days ago

RESTRAIN: From Spurious Votes to Signals -- Self-Driven RL with Self-Penalization

upvoted a paper about 2 months ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

upvoted a paper about 2 months ago

Hybrid Reinforcement: When Reward Is Sparse, It's Better to Be Dense

View all activity

Organizations

None yet

Papers 1

arxiv:2510.02172

models 1

ZhaoningYu/rl-course-ppo-LunarLander-v2

Reinforcement Learning • Updated Dec 27, 2024 • 4

datasets 0

None public yet