Implementation of a Maximum Negentropy Beamformer

home › theses & projects › Implementation of a Maximum Negentropy Beamformer

Implementation of a Maximum Negentropy Beamformer

Status

Open

Type

Master Project

Announcement date

28 Sep 2015

Mentors

Hannes Pessentheiner
Martin Hagmüller

Research Areas

Speech Communication

Description

A distant speech recognition system is fundamental for voice-operated ambient assisted living facilities. A common purpose of such a system is to capture the wave field, locate a target, e.g., a speaker in a crowded room, set a focus (a virtual beam) on it, and enhance its speech. The same applies to close-talking systems, e.g., a speaker using a hand-held device in a crowded public transport vehicle.

In both cases, the system consists of several modules, whereas one module is related to spatial filtering, i.e., beamforming. A beamformer exploits directional or positional information provided by a target localizer or tracker. It enables us to spatially filter a wave field captured by a microphone array in, e.g., a room so that we can eliminate signal components of interfering targets or noise sources. The result is an enhanced monaural signal with dominating target signal components and attenuated interfering target or noise components. Such a signal can be fed into, e.g., a word recognizer to transform the target signal into symbols or words.

We are currently working on distant- and close-talking speech recognition applications, where beamforming is an indispensable tool. By now, we’ve evaluated the performances of common beamformers, e.g., the delay and sum (DS) beamformer, the minimum variance distortionless response (MVDR) beamformer, and our recently introduced convex-optimized (CVX) beamformer. However, there is still a bunch of beamformers we have not evaluated yet, e.g., the maximum negentropy (MN) beamformer, etc. We need to implement this beamformer to do more comprehensive performance evaluations and to be able to apply more suitable beamformers to specific problems.

Tasks

Literature review of the MN beamformer.
Implementation in MATLAB.
Evaluation of its performance (speech database and speech recognizer provided).
Report about beamformer, experiments, and findings (in English).

Your Profile / Requirements

The candidate should be interested in literature reviews (papers), spatial filtering (beamforming), digital signal processing, and MATLAB programming.

References

Tashev, I., ``Sound Capture and Processing: Practical Approaches,’’ Wiley, August, 2009.
Kumatani, K., McDonough, J., Raj, B., ``Microphone Array Processing for Distant Speech Recognition: From close-talking microphones to far-field sensors,’’ IEEE Signal Processing Magazine, 29(6):127-140, November, 2012.