Publikationsdetails

Zitat:	Jan Schlüter, Reinhard Sonnleitner, "Unsupervised Feature Learning for Speech and Music Detection in Radio Broadcasts." : Proceedings of the 15th Int. Conference on Digital Audio Effects (DAFx-12),, 9-2012
Original Titel:	Unsupervised Feature Learning for Speech and Music Detection in Radio Broadcasts.
Sprache des Titels:	Englisch
Original Buchtitel:	Proceedings of the 15th Int. Conference on Digital Audio Effects (DAFx-12),
Original Kurzfassung:	Detecting speech and music is an elementary step in extracting information from radio broadcasts. Existing solutions either rely on general-purpose audio features, or build on features specifically engineered for the task. Interpreting spectrograms as images, we can apply unsupervised feature learning methods from computer vision instead. In this work, we show that features learned by a mean-covariance Restricted Boltzmann Machine partly resemble engineered features, but outperform three hand-crafted feature sets in speech and music detection on a large corpus of radio recordings. Our results demonstrate that unsupervised learning is a powerful alternative to knowledge engineering.
Sprache der Kurzfassung:	Englisch
Erscheinungsmonat:	9
Erscheinungsjahr:	2012
Anzahl der Seiten:	8
Reichweite:	international
Publikationstyp:	Aufsatz / Paper in Tagungsband (referiert)
Autoren:	Jan Schlüter, Reinhard Sonnleitner
Forschungseinheiten:	Institut für Computational Perception

Wissenschaftsgebiete:	Informatik (ÖSTAT:1108) Artificial Intelligence (ÖSTAT:1122) Digitale Bildverarbeitung und Graphik (ÖSTAT:1124)

fodok.jku.at

Benutzerbetreuung: Sandra Winzer, letzte Änderung:

Johannes Kepler Universität (JKU) Linz, Altenbergerstr. 69, A-4040 Linz, Austria
Telefon + 43 732 / 2468 - 9121, Fax + 43 732 / 2468 - 29121, Internet www.jku.at, Impressum