diff --git a/README.md b/README.md index d22d58f3a0514c7ab184a0aa956d5de0c9e97d5c..04b43ea3eeac6eb6cdaf19d7e733738be4f5a9a6 100644 --- a/README.md +++ b/README.md @@ -28,6 +28,7 @@ An introduction to neural networks and deep learning: ### Machine Learning & Deep Learning methods [Using CNNs on spectrogram images](https://www.ofai.at/~jan.schlueter/pubs/2014_icassp.pdf) (Schlüter, Böck, 2014) : + - [MADMOM implementation](https://madmom.readthedocs.io/en/v0.16/modules/features/onsets.html) - Python implementation for Taiko rythm games : https://github.com/seiichiinoue/odcnn ### Other methods @@ -45,7 +46,9 @@ If we ever want to use an AI to identify syllables without a reference lyrics fi - MKVToolnix (at least the CLI utils) - Python >= 3.8 -- PyTorch : follow the instructions [here](https://pytorch.org/get-started/locally/) + +Optional : +- PyTorch for custom model training : follow the instructions [here](https://pytorch.org/get-started/locally/) All other python modules can be installed directly through pip, see further. @@ -62,35 +65,12 @@ $ pip install -r requirements.txt $ deactivate ``` -Having a CUDA-capable GPU is optional, but can greatly reduce processing time. +Having a CUDA-capable GPU is optional, but can greatly reduce processing time in some situations. # Use - -## Training - -To extract vocals and ASS from MKV video files: -```bash -$ ./process_train_data video_folder train_folder -``` - -To prepare the training data for the model : -```bash -$ python cnn_prepare_data.py train train_folder -``` - -Prepared data will be stored in `./data/pickles/train_data.pickle` - -To train the model on the prepared data : -```bash -$ python cnn_train.py -``` - -The model will be written to `./models/model.pth` - - -## Infer +## Inference To execute AutoKara on a MKV video file : ```bash diff --git a/requirements.txt b/requirements.txt index 4a1d9cbfadfab62043cc52f5798be97e66bc7b97..770a61b27edbe79b68aa14593a431c0a841062bf 100644 --- a/requirements.txt +++ b/requirements.txt @@ -5,4 +5,8 @@ soundfile sklearn matplotlib numpy -tqdm \ No newline at end of file +tqdm +scipy +cython +mido +git+https://github.com/CPJKU/madmom.git \ No newline at end of file