Quickpanda's picture

3 6

Quickpanda

Quickpanda

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 11 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

upvoted a paper 9 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

liked a model 9 months ago

RabotniKuma/Fast-Math-R1-14B

View all activity

Organizations

None yet

upvoted a paper 11 days ago

Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning

Paper • 2512.20605 • Published 15 days ago • 60

upvoted a paper 9 months ago

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce

Paper • 2504.11343 • Published Apr 15, 2025 • 19

upvoted an article over 1 year ago

Article

Merge Large Language Models with mergekit

Jan 9, 2024

•

147