IBM Cloud Speech-to-Text: Two Implementations

Thomas David Kehoe
10 min readApr 2, 2020

Three years ago I implemented IBM Cloud Speech-to-Text into my web app LanguageTwo. It was the worst part of my app. The streaming WebSockets connection was unreliable at best. I’ve spent three weeks fixing it.

My project uses AngularJS and Firebase.

My first plan was to discontinue the WebSockets streaming and instead record each audiofile, save it to the database, send the file to the IBM Cloud for processing, and then send the response to the browser to display to the user.

--

--

Thomas David Kehoe

I make technology for speech clinics to treat stuttering and other disorders. I like backpacking with my dog, competitive running, and Russian jokes.