Speech Communication
Speech Communication covers speech production and speech perception of the sounds used in spoken human language. It is a highly interdisciplinary field that is studied by several academic disciplines including acoustics, psychology, speech pathology, linguistics, cognitive science, communication studies, computer science, and signal processing. Research in Speech Communication at our lab focuses on automatic speech processing techniques for human-machine-interaction, for enhancing speech transmission, and for improving life quality of disabled persons. This involves research topics in speech signal processing, like speech analysis, speech enhancement and transmission, as well as research topics in automatic speech communication, like acoustic source localization, speech and speaker recognition, and several language technologies. To this end, our lab provides the Speech Communication research area with an ideal combination of other research areas, like nonlinear signal processing, and intelligent systems.
Finished PhD Theses:
- 2025: What's so complex about conversational speech? Prosodic prominence and speech recognition challenges — Julian Linke
- 2021: Towards the Evolution of Neural Acoustic Beamformers — Lukas Pfeifenberger
- 2019: Speech Enhancement Using Deep Neural Beamformers — Matthias Zöhrer
- 2019: Contributions to Single-Channel Speech Enhancement with a Focus on the Spectral Phase — Johannes Stahl
- 2017: Localization, Characterization, and Tracking of Harmonic Sources: With Applications to Speech Signal Processing — Hannes Pessentheiner
- 2015: The Bionic Electro-Larynx Speech System - Challenges, Investigations, and Solutions — Anna Katharina Fuchs
- 2014: Diplophonic Voice: Definitions, models, and detection — Philipp Aichinger
- 2013: Kernel PCA and Pre-Image Iterations for Speech Enhancemen — Christina Leitner
- 2012: Probabilistic Model-Based Multiple Pitch Tracking of Speech — Michael Wohlmayr
- 2011: Auditory Inspired Methods for Multiple Speaker Localization and Tracking Using a Circular Microphone Array — Tania Habib
- 2010: Source-Filter Model Based Single Channel Speech Separation — Michael Stark
- 2010: Phonetic Similarity Matching of Non-Literal Transcripts in Automatic Speech Recognition — Stefan Petrik
- 2009: Speech Enhancement for Disordered and Substitution Voices — Martin Hagmüller
- 2009: Speech Watermarking and Air Traffic Control — ~Konrad Hofbauer
- 2007: Variable Delay Speech Communication over Packet-Switched Networks — ~Muhammad Sarwar Ehsan
- 2007: Semantic Similarity in Automatic Speech Recognition for Meetings — Michael Pucher
- 2007: Wavelet Analysis For Robust Speech Processing and Applications — Van Tuan Pham
- 2006: Quality Aspects of Packet-Based Interactive Speech Communication — Florian Hammer
- 2005: Sparse Pulsed Auditory Representations For Speech and Audio Coding — Christian Feldbauer
- 2003: Improving automatic speech recognition for pluricentric languages exemplified on varieties of German — ~Micha Baum