Collections

Discover the best community collections!

Collections including paper arxiv:2502.09056
Typhoon R1 - ICLR 2025 SCI-FM Artifacts
Artifacts from our paper, Adapting Language-Specific LLMs to a Reasoning Mode https://arxiv.org/abs/2502.09056, accepted at ICLR 2025 SCI-FM workshop.
RL+reason model
Collection by 3 days ago
readings
Collection by 4 days ago
Typhoon R1 - ICLR 2025 SCI-FM Artifacts
Artifacts from our paper, Adapting Language-Specific LLMs to a Reasoning Mode https://arxiv.org/abs/2502.09056, accepted at ICLR 2025 SCI-FM workshop.
RL+reason model
Collection by 3 days ago
Reasoning, Thinking, RL and Test-Time Scaling
Collection by 16 days ago
readings
Collection by 4 days ago