Skip to content
View lostspace3's full-sized avatar

Block or report lostspace3

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lostspace3/README.md

Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT

This is the implementation of the Interspeech 2020 paper "Converting anyone's emotion: steps towards speaker-independent emotional voice conversion". Please kindly cite our paper if you are using our codes.

Getting Started

Prerequisites

  • Ubuntu 16.04
  • Python 3.6
    • Tensorflow-gpu 1.5.0
    • PyWorld
    • librosa
    • soundfile
    • numpy 1.14.0
    • sklearn
    • glob
    • sprocket-vc
    • pycwt
    • scipy

Usage

  1. Prepare your dataset.
Please follow the file structure:

training_dir: ./data/wav/training_set/*/*.wav

evaluation_dir ./data/wav/evaluation_set/*/*.wav

For example: "./data/wav/training_set/Angry/0001.wav"
  1. Activate your virtual enviroment.
source activate [your env]
  1. Train VAW-GAN for prosody.
./train_f0.sh
# Remember to change the source and target dir in "architecture-vawgan-vcc2016.json"
  1. Train VAW-GAN for spectrum.
./train_sp.sh
# Remember to change the source and target dir in "architecture-vawgan-vcc2016.json"
  1. Generate the converted emotional speech.
./convert.sh

Note: The codes are based on VAW-GAN Voice Conversion: https://github.com/JeremyCCHsu/vae-npvc/tree/vawgan

Pinned Loading

  1. integration integration Public

    Cesium angular directive with ncwms integration to display netcdf datasets

  2. lostspace3 lostspace3 Public

    Python

  3. nmchannel nmchannel Public

    Monitor live ArcDps logs, parse data using Elite Insights and send WvW fight reports to a Discord channel and optionally a Twitch channel.

    JavaScript

  4. singlew singlew Public

    A tool for the unsupervised clustering of cells from single cell RNA-Seq experiments