Publikationsdetails

Zitat:	Matthias Dorfer, Florian Henkel, Gerhard Widmer, "Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning Game." : Proceedings of 19th International Society for Music Information Retrieval Conference (ISMIR), 2018
Original Titel:	Learning to Listen, Read, and Follow: Score Following as a Reinforcement Learning Game.
Sprache des Titels:	Englisch
Original Buchtitel:	Proceedings of 19th International Society for Music Information Retrieval Conference (ISMIR)
Original Kurzfassung:	Score following is the process of tracking a musical performance (audio) with respect to a known symbolic representation (a score). We start this paper by formulating score following as a multimodal Markov Decision Process, the mathematical foundation for sequential decision making. Given this formal definition, we address the score following task with state-of-the-art deep reinforcement learning (RL) algorithms such as synchronous advantage actor critic (A2C). In particular, we design multimodal RL agents that simultaneously learn to listen to music, read the scores from images of sheet music, and follow the audio along in the sheet, in an end-to-end fashion. All this behavior is learned entirely from scratch, based on a weak and potentially delayed reward signal that indicates to the agent how close it is to the correct position in the score. Besides discussing the theoretical advantages of this learning paradigm, we show in experiments that it is in fact superior compared to previously proposed methods for score following in raw sheet music images.
Sprache der Kurzfassung:	Englisch
Erscheinungsjahr:	2018
Anzahl der Seiten:	8
Reichweite:	international
Publikationstyp:	Aufsatz / Paper in Tagungsband (referiert)
Autoren:	Matthias Dorfer, Florian Henkel, Gerhard Widmer
Forschungseinheiten:	Institut für Computational Perception

Wissenschaftsgebiete:	Informatik (ÖSTAT:102) Artificial Intelligence (ÖSTAT:102001) Bildverarbeitung (ÖSTAT:102003) Informationssysteme (ÖSTAT:102015) Audiovisuelle Medien (ÖSTAT:202002)

Forschungsprojekte:	Con Espressione - Getting at the Heart of Things: Towards Expressivity-aware Computer Systems in Music (ERC Advanced Grant) (Anfangsjahr: 2016)

fodok.jku.at

Benutzerbetreuung: Sandra Winzer, letzte Änderung:

Johannes Kepler Universität (JKU) Linz, Altenbergerstr. 69, A-4040 Linz, Austria
Telefon + 43 732 / 2468 - 9121, Fax + 43 732 / 2468 - 29121, Internet www.jku.at, Impressum