Update README.md
Browse files
README.md
CHANGED
|
@@ -6,6 +6,17 @@ datasets:
|
|
| 6 |
- PrimeIntellect/StackV1-popular
|
| 7 |
- mlfoundations/dclm-baseline-1.0-parquet
|
| 8 |
- open-web-math/open-web-math
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 9 |
language:
|
| 10 |
- en
|
| 11 |
pipeline_tag: text-generation
|
|
@@ -96,13 +107,13 @@ First, we conducted an extensive series of 16 Supervised Fine-Tuning (SFT) train
|
|
| 96 |
- arcee-ai/The-Tomb
|
| 97 |
|
| 98 |
2. **Instruction Following**:
|
| 99 |
-
- [mlabonne/open-perfectblend-fixed](MaziyarPanahi/open-perfectblend-fixed) (generalist capabilities)
|
| 100 |
-
- [microsoft/orca-agentinstruct-1M-v1-cleaned](mlabonne/orca-agentinstruct-1M-v1-cleaned) (Chain-of-Thought)
|
| 101 |
-
- [Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs](Post-training-Data-Flywheel/AutoIF-instruct-61k
|
| 102 |
|
| 103 |
3. **Domain-Specific**:
|
| 104 |
- [Team-ACE/ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) (function calling)
|
| 105 |
-
- [Synthia coder](MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt) (programming)
|
| 106 |
- [ServiceNow-AI/M2Lingual](https://huggingface.co/datasets/ServiceNow-AI/M2Lingual) (multilingual)
|
| 107 |
- [AI-MO/NuminaMath-TIR](https://huggingface.co/datasets/AI-MO/NuminaMath-TIR) (mathematics)
|
| 108 |
|
|
|
|
| 6 |
- PrimeIntellect/StackV1-popular
|
| 7 |
- mlfoundations/dclm-baseline-1.0-parquet
|
| 8 |
- open-web-math/open-web-math
|
| 9 |
+
- MaziyarPanahi/open-perfectblend-fixed
|
| 10 |
+
- mlabonne/orca-agentinstruct-1M-v1-cleaned
|
| 11 |
+
- Post-training-Data-Flywheel/AutoIF-instruct-61k
|
| 12 |
+
- Team-ACE/ToolACE
|
| 13 |
+
- MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt
|
| 14 |
+
- ServiceNow-AI/M2Lingual
|
| 15 |
+
- AI-MO/NuminaMath-TIR
|
| 16 |
+
- allenai/tulu-3-sft-personas-code
|
| 17 |
+
- tulu-3-sft-personas-math
|
| 18 |
+
- tulu-3-sft-personas-math-grade
|
| 19 |
+
- tulu-3-sft-personas-algebra
|
| 20 |
language:
|
| 21 |
- en
|
| 22 |
pipeline_tag: text-generation
|
|
|
|
| 107 |
- arcee-ai/The-Tomb
|
| 108 |
|
| 109 |
2. **Instruction Following**:
|
| 110 |
+
- [mlabonne/open-perfectblend-fixed](https://huggingface.co/datasets/MaziyarPanahi/open-perfectblend-fixed) (generalist capabilities)
|
| 111 |
+
- [microsoft/orca-agentinstruct-1M-v1-cleaned](https://huggingface.co/datasets/mlabonne/orca-agentinstruct-1M-v1-cleaned) (Chain-of-Thought)
|
| 112 |
+
- [Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs](https://huggingface.co/datasets/Post-training-Data-Flywheel/AutoIF-instruct-61k)
|
| 113 |
|
| 114 |
3. **Domain-Specific**:
|
| 115 |
- [Team-ACE/ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) (function calling)
|
| 116 |
+
- [Synthia coder](https://huggingface.co/datasets/MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt) (programming)
|
| 117 |
- [ServiceNow-AI/M2Lingual](https://huggingface.co/datasets/ServiceNow-AI/M2Lingual) (multilingual)
|
| 118 |
- [AI-MO/NuminaMath-TIR](https://huggingface.co/datasets/AI-MO/NuminaMath-TIR) (mathematics)
|
| 119 |
|