Signal Processing and Speech Communication Laboratory
hometheses & projects › Automatic Pitch Accent Annotation of Conversational German

Automatic Pitch Accent Annotation of Conversational German

Status
Finished
Type
Bachelor Project
Announcement date
08 Oct 2023
Student
Maximilian Karner and Adam Toeroek
Mentors
Research Areas

The nature of conversational speech compared to read speech poses a significant challenge for Automatic Speech Recog- nition (ASR) systems. There are several existing studies that focus on improving the performance of ASR by us- ing prosodic features. The accurate annotation of prosodic characteristics like stress and intonation is a key part of this process. This bachelor project focuses on categorizing words into different pitch accent classes according to the Kiel Intonation Model (KIM), and investigates the influence of diverse acoustic features on the accuracy of the classification, focusing on the impact of entropy-based chroma features. The classification experiments consist of training and testing a Random Forest classifier with multiple combinations of pitch accent categories and features, and comparing them based on their impact on classification performance. The experiments show that classical prosodic features generally outperform chroma features in pitch accent classification, with F0-based features being the most important. The highest classification accuracy was achieved in binary classification tasks involving semantically distinct classes, followed by groupings based on semantic-functional similarity. Groupings based on contour shapes delivered moderate performance, with entropy based chroma features showing minimal impact across all setups.