University of Surrey

Test tubes in the lab Research in the ATI Dance Research

Low rank modelling for polyphonic music analysis.

O'Brien, Cian (2020) Low rank modelling for polyphonic music analysis. Doctoral thesis, University of Surrey.

[img]
Preview
Text
thesis.pdf - Version of Record
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (3MB) | Preview

Abstract

This thesis deals with the problem of Automatic Music Transcription (AMT), which aims to extract the pitch and timing information from recorded music signals. AMT is a challenging problem that is closely related to source separation and sparse mod- eling. Many approaches use latent variable models where the goal is to extract the underlying explanatory factors (musical pitches) which best explain the signal in question. A fundamental technique is the Nonnegative Matrix Factorization (NMF) algorithm which seeks to decompose the signal into a linear combination of nonneg- ative templates. However, NMF fails to account for the structure of music signals such as time smoothness. We introduce extensions of NMF to more accurately model music signals. The motivating assumption is that good transcriptions tend to have a low-rank structure, which when taken into account can improve the transcription performance. First, we extend classical NMF to a Low-Rank NMF model, based on work in low- rank matrix completion. We explore the connection between optimization of the matrix nuclear norm and proximal algorithms to derive a model that results in low- rank transcriptions. The nuclear norm approach is then extended to non-convex penalties which more accurately reflect the desired low-rank assumption. Next, we extend these ideas to deal with models in which the resulting transcription is locally low-rank, which we argue is a better model of music signals. An algorithm based on NMF and submodular function optimization is introduced, which learns a collection of local models. It is shown that this leads to further improvement for the AMT task. Finally, we develop a probabilistic framework that represents the signal using a hi- erarchy of local models. and discuss the interpretation of the proposed approaches as hard and soft clustering methods. We find that the proposed probabilistic “soft clustering” algorithm leads to further performance gains for the AMT task, outper- forming comparable state-of-the-art AMT systems which are based on NMF.

Item Type: Thesis (Doctoral)
Divisions : Theses
Authors : O'Brien, Cian
Date : 31 July 2020
Funders : EU 7th Framework Programme
DOI : 10.15126/thesis.00858084
Contributors :
ContributionNameEmailORCID
http://www.loc.gov/loc.terms/relators/THSPlumbley, Markm.plumbley@surrey.ac.uk
http://www.loc.gov/loc.terms/relators/THSWang, Wenwuw.wang@surrey.ac.uk
Depositing User : O'Brien O'Brien
Date Deposited : 29 Jul 2020 17:10
Last Modified : 29 Jul 2020 17:10
URI: http://epubs.surrey.ac.uk/id/eprint/858084

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year


Information about this web site

© The University of Surrey, Guildford, Surrey, GU2 7XH, United Kingdom.
+44 (0)1483 300800