Kai Ruan's picture

Kai Ruan

6cf

·

x66ccff

AI & ML interests

AI for Science

Recent Activity

liked a model 9 days ago

deepseek-ai/DeepSeek-V3.2

liked a model 12 days ago

moonshotai/Kimi-K2-Thinking

upvoted an article about 1 month ago

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

View all activity

Organizations

upvoted an article about 1 month ago

Article

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Jun 11

•

119

upvoted a paper about 1 month ago

SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics

Paper • 2506.01844 • Published Jun 2 • 143

upvoted a collection 4 months ago

DeepSeek-V3.1

DeepSeek's new 3.1 update to their V3 models! • 6 items • Updated 9 days ago • 7

upvoted 2 papers 6 months ago

Scaling Diffusion Transformers Efficiently via μP

Paper • 2505.15270 • Published May 21 • 35

Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Paper • 2506.09250 • Published Jun 10 • 27

upvoted a collection 6 months ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 59 items • Updated 9 days ago • 260

upvoted 2 papers 7 months ago

Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities

Paper • 2505.02567 • Published May 5 • 80

Benchmarking LLMs' Swarm intelligence

Paper • 2505.04364 • Published May 7 • 20

upvoted a paper 9 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113

upvoted a paper 10 months ago

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 47

upvoted a paper 11 months ago

Discovering symbolic expressions with parallelized tree search

Paper • 2407.04405 • Published Jul 5, 2024 • 1

upvoted 4 collections 11 months ago

DeepSeek-R1

10 items • Updated 14 days ago • 821

IdeaWhiz

3 items • Updated Jan 9 • 3

DeepSeek-V3

4 items • Updated 14 days ago • 278

Models for Open Hands (Open Devin)

Models for Open Hands(Open Devin) trained on the Devinator Dataset. • 3 items • Updated Oct 25, 2024 • 3

upvoted a collection 12 months ago

LiveIdeaBench

4 items • Updated May 8 • 5

upvoted a paper 12 months ago

LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context

Paper • 2412.17596 • Published Dec 23, 2024 • 6

upvoted 2 collections 12 months ago

YuLan-Mini

A highly capable 2.4B lightweight LLM using only 1T pre-training data with all details. • 6 items • Updated Apr 14 • 16

DeepSeek-VL2

5 items • Updated 14 days ago • 78

upvoted an article about 1 year ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

Dec 4, 2024

•

80