xref: /aosp_15_r20/external/libopus/dnn/torch/neural-pitch/README.md (revision a58d3d2adb790c104798cd88c8a3aff4fa8b82cc)
1## Neural Pitch Estimation
2
3- Dataset Installation
4    1. Download and unzip PTDB Dataset:
5        wget https://www2.spsc.tugraz.at/databases/PTDB-TUG/SPEECH_DATA_ZIPPED.zip
6        unzip SPEECH_DATA_ZIPPED.zip
7
8    2. Inside "SPEECH DATA" above, run ptdb_process.sh to combine male/female
9
10    3. To Download and combine demand, simply run download_demand.sh
11
12- LPCNet preparation
13    1. To extract xcorr, add lpcnet_extractor.c and add relevant functions to lpcnet_enc.c, add source for headers/c files and Makefile.am, and compile to generate ./lpcnet_xcorr_extractor object
14
15- Dataset Augmentation and training (check out arguments to each of the following)
16    1. Run data_augmentation.py
17    2. Run training.py using augmented data
18    3. Run experiments.py
19