Google speech recognition not recognizing certain words / phrases like um and er | python

Question

So it seems google speech recognition is taking out certain parts of my speech like um, er and ahh. The problem is I want these to be recognized, I can not seem to figure out how to enable this. Here is the code: It works as wanted just google takes out the vocal imperfections. Does anyone know how to enable

Accepted Answer

I took a look at the Google Cloud Speech-to-text API docs and didn&#8217;t see anything relevant (as of March 2022). I also came across these related resources:Detecting filler words in speech-to-textHow can I detect filler words like &#8220;ah, um&#8221; using a speech-to-text API like Google Speech API? (Quora)FillerWordShock &#8211; one person&#8217;s research on this topicAll evidence suggests that it isn&#8217;t possible to use the Google Cloud Speech-to-text service (at this time), and that you&#8217;ll have to seek alternative services. I won&#8217;t rehash the alternatives listed in the resources, but several are provided and you&#8217;ll have to pick which one best suits your particular needs.Also, you may already know this (so apologies if you do), but these types of words are typically called &#8220;filler&#8221; and/or &#8220;hesitation&#8221; words. That might be helpful to you while researching the topic.The good news is that the SpeechRecognition module (I think that&#8217;s what you&#8217;re using based on your code) supports several different engines, so hopefully one of those provides filler words.

Advertisement

Answer