an implement of wavenet vocoder using tensorflow

!!! the audio code is copied from wavenet_vocoder !!!

!!! the main tensorflow model is fixed from tensorflow-wavenet !!!

Some issue

mixture is in the branch of dev, but there are some bugs in generating wavs.

To Do

Required

python >= 3.3
tensorflow > =1.3
tqdm
pyworld
pysptk
nnmnkwii >= 0.12
scipy
lws == 1.0

Getting Start

Download dataset

the voice conversion dataset(for multi speaker, 16k): cmu_arctic
the single speaker dataset(22.05k): LJSpeech-1.0

Preprocess data

for train faster, we should process the data to npy

python preprocess.py --num_workers 4 --name ljspeech --in_dir /your_path/LJSpeech-1.0 --out_dir /your_outpath/ --hparams sample_rate=22050

Training

for single speaker

python train.py --num_gpus 4 --batch_size 2 --train_txt /your_train_txt/ --hparams gc_enable=False,global_channel=0,global_cardinality=0,NPY_DATAROOT=/your_npy_datadir/,sample_rate=22050 --logdir_root log_ljspeech

for multi speaker

python train.py --batch_size 2 --num_gpus 4 --train_txt /your_train_txt/ --logdir_root log_arctic

Synthesize

for single speaker

the eval_txt is extracted from the train_txt

python mul_generate.py --eval_txt /your_eval_txt/ --wav_out_path test_ljspeech.wav /your_cheakpoint/ ---hparams gc_enable=False,global_channel=0,global_cardinality=0,NPY_DATAROOT=/your_npy_datadir/,sample_rate=22050

for multi speaker

python mul_generate.py --eval_txt /your_eval_txt/ --wav_out_path test_arctic.wav /your_checkpoint/ --gc_id 6

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
datasets		datasets
model		model
wav		wav
.gitignore		.gitignore
README.md		README.md
audio.py		audio.py
hparams.py		hparams.py
mul_generate.py		mul_generate.py
preprocess.py		preprocess.py
train.py		train.py
unit_test.py		unit_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

an implement of wavenet vocoder using tensorflow

Some issue

To Do

Required

Getting Start

Download dataset

Preprocess data

Training

for single speaker

for multi speaker

Synthesize

for single speaker

for multi speaker

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

an implement of wavenet vocoder using tensorflow

Some issue

To Do

Required

Getting Start

Download dataset

Preprocess data

Training

for single speaker

for multi speaker

Synthesize

for single speaker

for multi speaker

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages