Signal Processing and Speech Communication Laboratory
hometheses & projects › Automatic Speech Recognition for Dementia Prediction

Automatic Speech Recognition for Dementia Prediction

Status
Open
Type
Master Thesis
Announcement date
18 Oct 2025
Mentors
Research Areas

Short Description

Language is increasingly recognized as a powerful behavioural marker for Alzheimer’s disease (AD). Yet, most research so far relies on highly controlled speaking tasks and manual analyses that are impractical for clinical use. Discourse analysis, for example, has huge diagnostic potential, but the current gold-standard—manual verbatim transcription—can take 4 to 60 hours per hour of recordings.

The long-term vision is to automatically identify linguistic markers of AD and evaluating their potential to differentiate Alzheimer’s-related decline from depression-related cognitive impairment (pseudodementia). To make this clinically feasible, our goal is to develop a clinic-friendly linguistic screening tool that could be applied in real-world diagnostic routines.

The aim of this project is to build a robust automatic speech recognition (ASR) systems for spontaneous, disfluent speech. Unlike standard ASR tasks, this involves handling noisy clinical data, atypical speech patterns, and relatively small datasets, requiring innovative solutions in self-supervised learning, data augmentation and transfer learning.

Your Tasks (depending on specific project):

  • Literature research
  • Data preparation
  • Conduction of ASR Experiments with wav2vec and Whisper
  • Automatic analysis of hesitation markers
  • Analysis and reporting your results (thesis writing)

Your Profile

  • basic knowlegde of sound engineering and/or speech communication
  • good knowledge of programming (e.g., Python)

Contact:

Barbara Schuppler (b.schuppler@tugraz.at)