Welcome!
In 2000, the Signal Processing and Speech Communication Laboratory (SPSC Lab) of Graz University of Technology (TU Graz) was founded as a research and education center in nonlinear signal processing and computational intelligence, algorithm engineering, as well as circuits & systems modeling and design. It covers applications in wireless communications, speech/audio communication, and telecommunications.
If you want to learn more about Signal Processing, click: What is Signal Processing?
The Research of SPSC Lab addresses fundamental and applied research problems in five scientific areas:
Result of the Month
On Disfluency and Non-lexical Sound Labeling for End-to-end Automatic Speech Recognition [link]
Abstract: Spontaneous speech contains a significant amount of disfluencies and non-lexical sounds (e.g., backchannels, filled pauses), which are often difficult to transcribe. Disfluency labeling for automatic speech recognition (ASR) aims at editing these phenomena in the transcription to improve overall recognition accuracy. Such labeling techniques typically delete nonlexical/disfluent labels from the prediction, where classical ASR techniques either ignore or treat them as lexical items. Our results, obtained by systematic comparison and detailed evaluation of various disfluency labeling methods on two different language conversational corpora, suggest that neither of the previous approaches are optimal. We propose to distinguish between filled pauses and meaningful conversational grunts and show that keeping the non-lexical labels is not only possible but as low as 7% label error rates can be achieved for highly important categories (including ’mhm’) while preserving a decent WER.
Index Terms: end-to-end speech recognition, disfluency, conversational speech, filled pauses, Hungarian, Austrian German
Konferenz: Proc. of Interspeech 2024, pp. 1270–1274, 2024. (1-5 September 2024, Kos, Greece)
Contact: Julian Link, Barbara SchupplerLatest News
17 Jul 2024 Research and Teaching Associate (Pre-Doc) in Signal Processing and Speech Communication
04 Jun 2024 Menschliche Gespräche mit einem Roboterkopf
08 Sep 2023 Research and Teaching Associate in Signal Processing and Speech Communicartion (Two PhD position)
06 Oct 2022 Student Projects Information Event: 14.10., 15:00 (BSc SP; MSc Project and Master Theses)
22 Mar 2022 Two PhD Positions in Wireless Communications and Positioning
03 Mar 2022 Press release on AI based denoising filters
03 Mar 2022 Christian Knoll received the Josef Krainer Award for his PhD Thesis
20 Apr 2021 Press release covering the H2020 project REINDEER has been published
01 Mar 2021 Course "Array Signal Processing" starting end of May 2021
01 Jan 2021 H2020 project REINDEER has been started
Check older news here.