Signal Processing and Speech Communication Laboratory
hometheses & projects › Hybrid ML for Music Analysis

Hybrid ML for Music Analysis

Status
Open
Type
Master Thesis
Announcement date
22 Oct 2024
Mentors
Research Areas

Description

Hybrid machine-learning (ML) approaches that combine deep-learning with model based approaches promises the “best of both worlds.” While some methods can be combined in a common framework, e.g. mean-field variational message passing and variational autoencoders [1], realizing such a hybrid methods is not trivial. Challenges arise e.g. due to the computational complexity of model-based algorithms which slows down the training of the ML part. Thus, the aim of this thesis is to investigate hybrid inference methods that combine deep-learning with model-based approaches, e.g. in the context of multi-pitch estimation [2] where the signals from tonal instruments can be well-modeled as a periodic signal (e.g. using Fourier series) but non-tonal instruments like a drum kit or other percussive instruments cannot be modeled in the same way.

Your Tasks:

  • Literature research on hybrid ML methods.
  • Discuss/propose an architecture of a ``neural enhanced’’ multi-pitch detection algorithm.
  • Implement and train the proposed algorithm/architecture.
  • Evaluate the performance and compare it against other methods.
  • Write your thesis.

What we expect from you:

  • Familiar with (statistical) signal processing.
  • Experience with training of ML-methods (particularly variational autoencoders) is beneficial.
  • Experience with Tensorflow, PyTorch or similar ML-toolchains is also beneficial.

Contact

If you are interested and want to know more about it, send an email to either jakob.moederl@tugraz.at or erik.leitinger@tugraz.at, or visit us in our offices at Inffeldgasse 12/I room HF01042 or Inffeldgasse 16c/EG room IDEG128.

References

  1. M. J. Johnson, D. K. Duvenaud, A. Wiltschko, R. P. Adams, and S. R. Datta. “Composing graphical models with neural networks for structured representations and fast inference,” in Advances in Neural Information Processing Systems 29, 2016.
  2. J. Möderl, F. Pernkopf, K. Witrisal, and E. Leitinger, “Variational Inference of Structured Line Spectra Exploiting Group-Sparsity,” arXiv preprint, 2023, doi: 10.48550/arXiv.2303.03017