Best Channel Selection for Distant Speech Recognition
- Status
- Finished
- Type
- Master Project
- Announcement date
- 03 Oct 2016
- Student
- Jonas Helm
- Mentors
- Research Areas
Short Description
Conventionally multi-channel microphones acquisition is used for beamforming to reduce interfering sound and reverberation. An alternative is choosing the best channel. This work well if the microphones are not organised as a single array, but as a network of microphones. In recent years several algorithms have been proposed to determine the best microphone channel, that is then used for automatic speech recognition. One approach is e.g. based on the envelope of the speech signal, which gets smeared by reverberation and is there a cue to the quality of the detected signal. A more sophisticated approach uses a frequency dependent analysis, the modulation frequency, just to mention a few.
At the SPSC Lab, we have built a live voice controlled home automation system for Austrian German in our meeting room and kitchen. Microphones are distributed over several walls and the ceiling. We want to have channel selection as an option for our system as a pre-processing step for automatic speech recognition.
Your Tasks
- Review of recent literature on channel selection for ASR
- Choose at least one promising approach
- Real-time implementation of this approach preferably in Python
- Documentation
Your Profile
- Motivation and interest in the topic
- Strong background in (Speech) Signal Processing
- Experience in Python or Matlab
- Speech Communication 1 completed
Contact
Martin Hagmüller (hagmueller@tugraz.at or 0316/873 4377)
References
Martin Wolf, Climent Nadeu, Channel selection measures for multi-microphone speech recognition, Speech Communication, Volume 57,pp 170-180 http://dx.doi.org/10.1016/j.specom.2013.09.015