2 8 2

Zhepei Wei

weizhepei

https://weizhepei.com

AI & ML interests

None yet

Recent Activity

published a dataset 16 days ago

weizhepei/WebArena-Lite-SFT

upvoted a paper 19 days ago

VisPlay: Self-Evolving Vision-Language Models from Images

updated a dataset about 2 months ago

weizhepei/TruthRL-HotpotQA

View all activity

Organizations

published a dataset 16 days ago

weizhepei/WebArena-Lite-SFT

Viewer • Updated Mar 22 • 18.9k • 40

upvoted a paper 19 days ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published 20 days ago • 42

updated 3 datasets about 2 months ago

published 3 datasets about 2 months ago

weizhepei/TruthRL-NaturalQuestions

Viewer • Updated Oct 21 • 3.61k • 36

weizhepei/TruthRL-MuSiQue

Viewer • Updated Oct 20 • 22.4k • 48

weizhepei/TruthRL-HotpotQA

Viewer • Updated Oct 21 • 7.41k • 27

updated a dataset about 2 months ago

weizhepei/TruthRL-CRAG

Viewer • Updated Oct 20 • 1.3k • 101

published a dataset about 2 months ago

weizhepei/TruthRL-CRAG

Viewer • Updated Oct 20 • 1.3k • 101

upvoted a paper 2 months ago

TaTToo: Tool-Grounded Thinking PRM for Test-Time Scaling in Tabular Reasoning

Paper • 2510.06217 • Published Oct 7 • 63

updated a dataset 2 months ago

meng-lab/DeSA-RecallResult

Updated Oct 2 • 24

upvoted 2 papers 2 months ago

Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training

Paper • 2509.03403 • Published Sep 3 • 22

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 55

commented a paper 2 months ago

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30 • 55 •

updated a model 2 months ago

meng-lab/DeSA-qwen2.5-3b-it-em-after50stepsacc-step150

3B • Updated Sep 28 • 3

published a model 2 months ago

meng-lab/DeSA-qwen2.5-3b-it-em-after50stepsacc-step150

3B • Updated Sep 28 • 3

updated a model 3 months ago

meng-lab/DeSA-qwen2.5-3b-it-stage1-acc

Updated Sep 24

published a model 3 months ago

meng-lab/DeSA-qwen2.5-3b-it-stage1-acc

Updated Sep 24

updated a model 3 months ago

meng-lab/DeSA-qwen2.5-3b-it-searchbehaviorandem

Updated Sep 24

Zhepei Wei

AI & ML interests

Recent Activity

Organizations

weizhepei's activity