Publikationsdetails

Zitat:	Paul Primus, Hamid Eghbal-Zadeh, David Eitelsebner, Khaled Koutini, Andreas Arzt, Gerhard Widmer, "Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification" : Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019), 2019
Original Titel:	Exploiting Parallel Audio Recordings to Enforce Device Invariance in CNN-based Acoustic Scene Classification
Sprache des Titels:	Englisch
Original Buchtitel:	Proceedings of the Detection and Classification of Acoustic Scenes and Events 2019 Workshop (DCASE2019)
Original Kurzfassung:	Distribution mismatches between the data seen at training and at application time remain a major challenge in all application areas of machine learning. We study this problem in the context of ma-chine listening (Task 1b of the DCASE 2019 Challenge). We pro-pose a novel approach to learn domain-invariant classifiers in an end-to-end fashion by enforcing equal hidden layer representations for domain-parallel samples, i.e. time-aligned recordings from different recording devices. No classification labels are needed for our domain adaptation (DA) method, which makes the data collection process cheaper. We show that our method improves the tar-get domain accuracy for both a toy dataset and an urban acoustic scenes dataset. We further compare our method to Maximum Mean Discrepancy-based DA and find it more robust to the choice of DA parameters. Our submission, based on this method, to DCASE 2019Task 1b gave us the 4th place in the team ranking.
Sprache der Kurzfassung:	Englisch
Erscheinungsjahr:	2019
Anzahl der Seiten:	5
Reichweite:	international
Publikationstyp:	Aufsatz / Paper in Tagungsband (referiert)
Autoren:	Paul Primus, Hamid Eghbal-Zadeh, David Eitelsebner, Khaled Koutini, Andreas Arzt, Gerhard Widmer
Forschungseinheiten:	Institut für Computational Perception

Wissenschaftsgebiete:	Informatik (ÖSTAT:102) Artificial Intelligence (ÖSTAT:102001) Bildverarbeitung (ÖSTAT:102003) Informationssysteme (ÖSTAT:102015) Audiovisuelle Medien (ÖSTAT:202002)

fodok.jku.at

Benutzerbetreuung: Sandra Winzer, letzte Änderung:

Johannes Kepler Universität (JKU) Linz, Altenbergerstr. 69, A-4040 Linz, Austria
Telefon + 43 732 / 2468 - 9121, Fax + 43 732 / 2468 - 29121, Internet www.jku.at, Impressum