SynGen
Collection
State-of-the-art models & datasets for synthetic reasoning trace generation. Credit for the original dataset goes to https://huggingface.co/Pinkstack
•
5 items
•
Updated
This is a slightly refined version of qingy2024/SynGen-14B with DPO training on qingy2024/SynGen-Antiloop-DPO. This should reduce repetitions and improve quality of generated reasoning traces. See the original model card for a description of what it can do.
Notes:
temp = 0.7, top_p = 0.95, pretty much default works.<reasoning_style>deepseek_r1</reasoning_style> # Can replace deepseek_r1 with gpt_oss
<system_prompt>Original System Prompt</system_prompt>
<user>User Message Here</user>
<assistant>Assistant Final Response Here (without reasoning)</assistant>
<think>Generated Reasoning</think>