Signal Processing and Speech Communication Laboratory
homeresearch projects › OneVoice - Speech Enhancement for Dictation Systems

OneVoice - Speech Enhancement for Dictation Systems

2006 — 2008
Österreichische Forschungsförderungsgesellschaft mbH , FFG
  • Österreichische Akademie der Wissenschaften, ÖAW
  • Philips Austria GmbH, Speech Processing - Dictation Systems
Research Areas

      Speech recording is a common practice in daily professional activities, such as for lawyers, physicians, journalists and architects, among others. The combination of dictation systems with automatic speech recognition (ASR) is being demanded today as the natural procedure to take over their daily transcription routines. However, in those working environments (e.g. hospital, court of law, street, etc.), it is not always possible to record in silent or noise-free conditions, this fact causing ASR to become unreliable. The researchers in oneVoice have developed several novel signal processing-based techniques for analyzing speech with natural intonation. These methods represent the scientific basis of the project outcome, namely, a new single-channel speech enhancement/coding system that removes the background interferences present in the recording.