·
AI & ML interests
LLM × RL
Recent Activity
Organizations
Viewer
•
Updated
•
8.79k
•
1
ryota39/llmjp-chatbot-arena-v2
Viewer
•
Updated
•
594
•
8
Viewer
•
Updated
•
29.1k
•
3
ryota39/llm-jp-chatbot-arena-conversations-reformatted
Viewer
•
Updated
•
990
•
4
•
1
ryota39/reviews_and_summaries2
Viewer
•
Updated
•
50
•
26
ryota39/reviews_and_summaries
Viewer
•
Updated
•
50
•
23
ryota39/movie_reviews_local
Viewer
•
Updated
•
50
•
25
Viewer
•
Updated
•
50
•
10
Viewer
•
Updated
•
3.49k
•
3
ryota39/aya-evol-instruct
Viewer
•
Updated
•
29.2k
•
11
ryota39/JCommonsenseMorality
Viewer
•
Updated
•
9.98k
•
29
Viewer
•
Updated
•
169k
•
23
ryota39/preference-en-ja-100k
Viewer
•
Updated
•
101k
•
19
•
1
Viewer
•
Updated
•
29.6k
•
69
ryota39/preference_test_annotated
Viewer
•
Updated
•
5
•
11
ryota39/open_preference_v0.4
Viewer
•
Updated
•
202k
•
64
•
1
ryota39/webgpt_comparisons-ja
Viewer
•
Updated
•
17.4k
•
20
•
1
ryota39/synthetic-instruct-gptj-pairwise-ja
Viewer
•
Updated
•
33.1k
•
9
•
1
ryota39/self-rewarding_instruct_AIFT_M3_scored
Viewer
•
Updated
•
7.11k
•
30
ryota39/self-rewarding_instruct_AIFT_M2_scored
Viewer
•
Updated
•
7k
•
17
ryota39/self-rewarding_instruct_AIFT_M1_scored
Viewer
•
Updated
•
4k
•
13
ryota39/Synthetic-JP-Conversations-Magpie-Nemotron-4-10k_scored
Viewer
•
Updated
•
10.1k
•
8
Viewer
•
Updated
•
31.3k
•
6
ryota39/hh-rlhf-12k-ja_orpo
Viewer
•
Updated
•
12k
•
11
•
1
ryota39/izumi-lab-dpo-45k
Viewer
•
Updated
•
45.7k
•
19
•
1
ryota39/open_preference_v0.1
Viewer
•
Updated
•
49.2k
•
9
•
1
Viewer
•
Updated
•
45.2k
•
159
Viewer
•
Updated
•
49.2k
•
36
ryota39/janli_synthetic_rationale
Viewer
•
Updated
•
14.4k
•
11
•
1
Viewer
•
Updated
•
108
•
10