Version control for speech corpora

home › theses & projects › Version control for speech corpora

Version control for speech corpora

Status

Finished

Type

Master Thesis

Announcement date

08 Jan 2021

Student

Vlad Dumitru

Mentors

Research Areas

Speech Communication

A speech corpus consists of a set of audio recordings and, for each of the recordings, a set of annotation layers representing various features of the audio signal.

Developing such a corpus is an expensive task in terms of the amount of work required, and a usual approach is having multiple annotators work in parallel on the same recordings.

As the size and complexity of such a corpus increases, two problems require addressing:

Storing the contents of the corpus in an efficient manner, such that it can be both served as a static package, and allow analysis on the contents, such as by performing queries on the various layers of annotations;
Supporting collaborative work, allowing annotators to work in parallel, by merging their partial results.

In addition, easy interoperability with external tools is desirable.