Publikationsdetails

Zitat:	Jan Schlüter, "Learning to Pinpoint Singing Voice from Weakly Labeled Examples" : Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR, 8-2016
Original Titel:	Learning to Pinpoint Singing Voice from Weakly Labeled Examples
Sprache des Titels:	Englisch
Original Buchtitel:	Proceedings of the 17th International Society for Music Information Retrieval Conference (ISMIR
Original Kurzfassung:	Building an instrument detector usually requires temporally accurate ground truth that is expensive to create. However, song-wise information on the presence of instruments is often easily available. In this work, we investigate how well we can train a singing voice detection system merely from song-wise annotations of vocal presence. Using convolutional neural networks, multipleinstance learning and saliency maps, we can not only detect singing voice in a test signal with a temporal accuracy close to the state-of-the-art, but also localize the spectral bins with precision and recall close to a recent source separation method. Our recipe may provide a basis for other sequence labeling tasks, for improving source separation or for inspecting neural networks trained on auditory spectrograms.
Sprache der Kurzfassung:	Englisch
Erscheinungsmonat:	8
Erscheinungsjahr:	2016
Anzahl der Seiten:	7
URL zu weiteren Infos:	http://ofai.at/~jan.schlueter/pubs/2016_ismir.pdf
Reichweite:	international
Publikationstyp:	Aufsatz / Paper in Tagungsband (referiert)
Autoren:	Jan Schlüter
Forschungseinheiten:	Institut für Computational Perception

Wissenschaftsgebiete:	Informatik (ÖSTAT:102) Artificial Intelligence (ÖSTAT:102001) Bildverarbeitung (ÖSTAT:102003) Informationssysteme (ÖSTAT:102015) Audiovisuelle Medien (ÖSTAT:202002)

fodok.jku.at

Benutzerbetreuung: Sandra Winzer, letzte Änderung:

Johannes Kepler Universität (JKU) Linz, Altenbergerstr. 69, A-4040 Linz, Austria
Telefon + 43 732 / 2468 - 9121, Fax + 43 732 / 2468 - 29121, Internet www.jku.at, Impressum