Respair commited on
Commit
86ba417
·
verified ·
1 Parent(s): f6e9b50

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md ADDED
@@ -0,0 +1,18 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ tags:
4
+ - codec
5
+ - audio_tokenizer
6
+ - audio_codec
7
+ ---
8
+
9
+ This is an on-going project. it is a modified version of Higgs-Boson audio tokenizer, you can fully train it. all scripts have been tested.
10
+ a Few notes however:
11
+
12
+ 1- this is not backward compatible with the original checkpoint (I think you can tweak it to be, but you have to adhere to Boson community license if you do.)
13
+ 2- I highly recommend you to pretrain the model without the mel and adversarial setup first. it saves you a significant amount of compute and speed-up your convergence.
14
+ 3- for the semantic teacher, I am using ```utter-project/mHuBERT-147``` which has a good multilingual support. if you want the original setup you can change it in the config.
15
+
16
+ I will train a checkpoint on a larger enough dataset one of these days after figuring out a few things first. but the setup is solid.
17
+
18
+ Happy training (~~inshallah~~).