From 5a0c2981dad4422c5d003e99da840f1724ec9934 Mon Sep 17 00:00:00 2001 From: Sting <loic.allegre@ensiie.fr> Date: Tue, 20 Jun 2023 11:02:32 +0200 Subject: [PATCH] Update README.md --- README.md | 13 +++++++++++-- 1 file changed, 11 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index b37bfd3..9cb74c7 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,6 @@ # AutoKara -AI that can time automatically karas. +Experiment in automatic karaoke timing. # Some documentation first @@ -22,11 +22,20 @@ Get a data buffer from an audio file: - https://stackoverflow.com/questions/49279425/extract-human-vocals-from-song - https://www.geeksforgeeks.org/python-speech-recognition-on-large-audio-files/ +### Extracting vocals from music + +- https://github.com/deezer/spleeter/ + +### Syllable segmentation and recognition + + - [Syllable segmentation](https://www.sciencedirect.com/science/article/pii/S1877050916319068/pdf?md5=abc426e84a71cd4f5c0e6bef9713643e&pid=1-s2.0-S1877050916319068-main.pdf&_valck=1) + - [Syllable segmentation and recognition](https://cdn.intechopen.com/pdfs/15947/InTech-Syllable_based_speech_recognition.pdf) + # Installation ## Python -You need python 3 with the tensorflow library. We will set up a virtual envireonment to not mess up with the system. +You need python 3 with the tensorflow library. We will set up a virtual environment to not mess up with the system. To do that you will need [virtualenv](https://virtualenv.pypa.io/en/stable/) and [virtualenvwrapper](https://virtualenvwrapper.readthedocs.io/en/latest/). -- GitLab