Sample Based Glottal Excitation Signal Database

home › theses & projects › Sample Based Glottal Excitation Signal Database

Sample Based Glottal Excitation Signal Database

Status

Open

Type

Master Thesis

Announcement date

06 Mar 2013

Mentors

Research Areas

Audio and Acoustics

Short Description

A widely used model for natural speech production is the source-filter model. It is described by the larynx and the containing vocal folds which represents the source of excitation and the vocal tract as the linear acoustic filter. There are several approaches to model the glottal excitation signal including simple impulse trains, physical models (One-Mass model) and waveform models (Liljencrants-Fant model).
In speech synthesis that is based on the source-filter theory, it is known that the quality suffers due to an over-simplified excitation model. It can be shown that glottal source pulses computed from real speech using inverse filtering improves the task compared to the models mentioned above [1].
This leads to the conclusion that also electro-larynx speech, where the source of human speech is replaced by a hand-held, battery-driven device, can profit from an excitation signal from a database with real glottal source pulses.

Your Tasks

Review of literature
Creating glottal excitation signal database
Comparison to other approaches
Documentation

Your Profile/Prerequisites

Motivation and interest in the topic
Background information in Matlab and (speech) signal processing

References

K. Matsui, S. Person, K. Hata, and T. Kamai, “Improving naturalness in text-to-speech synthesis using natural glottal source,” in Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Converence on, pp. 769-772 vol.2.