Nao microphones and google API in ROS

asked 2014-04-10 17:11:59 -0500

Mehdi. gravatar image

updated 2014-04-10 17:13:42 -0500


I would like to implement a speech detection which will send sentences to the google API and get the text. This will be permanently listening and doesn't need any vocabulary. so before starting I thought it would be better to ask maybe somebody already did something like that.

I'm looking for something like you, have you tried the package audio_common? It should catch the sound from the microphones and publish an /audio message. Let us know if it works.

bapclenet gravatar image bapclenet  ( 2014-06-07 10:50:14 -0500 )edit

2 Answers

answered 2014-05-27 01:10:50 -0500

rmb209 gravatar image

A similar thing has been done here in python.

This wasn't on a Nao but our own proprietary robot.

Hope it may be of use.

answered 2014-05-27 05:57:01 -0500

severin gravatar image

For a simpler library, you may want to check gspeet: it's a small python library that listen (continously or not) on the microphone (using ALSA. It should work on Nao), encode the speech using the speex encoder (well, a google variant of it, actually -- it still requires libspeex), send it to Google speech API, and returns the result.

Integration in a ROS node should be trivial, but the library itself is independant.

