Audio recognition and fingerprint using sklean & librosa [closed]

Question

Closed. This question needs to be more focused. It is not currently accepting answers. Want to improve this question? Update the question so it focuses on one problem only by editing this post. Closed last year. Improve this question I want to create a model that can predict who has speak with different word.…

Accepted Answer

For the sound processing and feature extraction part, librosa is definitely going to provide you all you need.For the machine learning part however, speaker identification (also called &#8220;voice recognition&#8221;) is a relatively complex task. You probably will get more success using techniques from deep learning. You can certainly try to use random forests if you like, but you&#8217;ll probably get a lower accuracy and will have to spend more time doing feature engineering. In fact, it will be a good exercise for you to compare the results you can get with the various techniques.For an example tutorial on speaker identification using Keras, see e.g. this article.

Advertisement

Answer