diff --git a/README.md b/README.md index b37bfd3dfd7df975752fce83206754c68fb4e3c2..9cb74c77394e54d7767a17d090b2e92b5a691a2f 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # AutoKara -AI that can time automatically karas. +Experiment in automatic karaoke timing. # Some documentation first @@ -22,11 +22,20 @@ Get a data buffer from an audio file: - https://stackoverflow.com/questions/49279425/extract-human-vocals-from-song - https://www.geeksforgeeks.org/python-speech-recognition-on-large-audio-files/ +### Extracting vocals from music + +- https://github.com/deezer/spleeter/ + +### Syllable segmentation and recognition + + - [Syllable segmentation](https://www.sciencedirect.com/science/article/pii/S1877050916319068/pdf?md5=abc426e84a71cd4f5c0e6bef9713643e&pid=1-s2.0-S1877050916319068-main.pdf&_valck=1) + - [Syllable segmentation and recognition](https://cdn.intechopen.com/pdfs/15947/InTech-Syllable_based_speech_recognition.pdf) + # Installation ## Python -You need python 3 with the tensorflow library. We will set up a virtual envireonment to not mess up with the system. +You need python 3 with the tensorflow library. We will set up a virtual environment to not mess up with the system. To do that you will need [virtualenv](https://virtualenv.pypa.io/en/stable/) and [virtualenvwrapper](https://virtualenvwrapper.readthedocs.io/en/latest/).