Publikationsdetails

Zitat:	Markus Hofmarcher, Thomas Schmied, Fabian Paischer, Razvan Pascanu, Sepp Hochreiter, "Learning to Modulate pre-trained Models in RL" : Workshop on Reincarnating Reinforcement Learning at ICLR 2023, 2023
Original Titel:	Learning to Modulate pre-trained Models in RL
Sprache des Titels:	Englisch
Original Buchtitel:	Workshop on Reincarnating Reinforcement Learning at ICLR 2023
Original Kurzfassung:	Reinforcement Learning (RL) has experienced great success in complex games and simulations. However, RL agents are often highly specialized for a particular task, and it is difficult to adapt a trained agent to a new task. In supervised learning, an established paradigm is multi-task pre-training followed by fine-tuning. A similar trend is emerging in RL, where agents are pre-trained on data collections that comprise a multitude of tasks. Despite these developments, it remains an open challenge how to adapt such pre-trained agents to novel tasks while retaining performance on the pre-training tasks. In this regard, we pre-train an agent on a set of tasks from the Meta-World benchmark suite and adapt it to tasks from Continual-World. We conduct a comprehensive comparison of fine-tuning methods originating from supervised learning in our setup. Our findings show that fine-tuning is feasible, but for existing methods, performance on previously learned tasks often deteriorates. Therefore, we propose a novel approach that avoids forgetting by modulating the information flow of the pre-trained model. Our method outperforms existing fine-tuning approaches, and achieves state-of-the-art performance on the Continual-World benchmark. To facilitate future research in this direction, we collect datasets for all Meta-World tasks and make them publicly available.
Sprache der Kurzfassung:	Englisch
Erscheinungsjahr:	2023
Anzahl der Seiten:	33
URL zu weiteren Infos:	https://openreview.net/forum?id=Us6BtPZGei3
Reichweite:	international
Publikationstyp:	Aufsatz / Paper in Tagungsband (referiert)
Autoren:	Markus Hofmarcher, Thomas Schmied, Fabian Paischer, Razvan Pascanu, Sepp Hochreiter
Forschungseinheiten:	Institut für Signalverarbeitung Institut für Machine Learning LIT Artificial Intelligence Lab

Wissenschaftsgebiete:	Biomathematik (ÖSTAT:101004) Numerische Mathematik (ÖSTAT:101014) Operations Research (ÖSTAT:101015) Optimierung (ÖSTAT:101016) Spieltheorie (ÖSTAT:101017) Statistik (ÖSTAT:101018) Stochastik (ÖSTAT:101019) Wahrscheinlichkeitstheorie (ÖSTAT:101024) Zeitreihenanalyse (ÖSTAT:101026) Dynamische Systeme (ÖSTAT:101027) Mathematische Modellierung (ÖSTAT:101028) Mathematische Statistik (ÖSTAT:101029) Approximationstheorie (ÖSTAT:101031) Informatik (ÖSTAT:102) Artificial Intelligence (ÖSTAT:102001) Bildverarbeitung (ÖSTAT:102003) Bioinformatik (ÖSTAT:102004) Human-Computer Interaction (ÖSTAT:102013) Künstliche Neuronale Netze (ÖSTAT:102018) Machine Learning (ÖSTAT:102019) Computational Intelligence (ÖSTAT:102032) Data Mining (ÖSTAT:102033) Statistische Physik (ÖSTAT:103029) Bioinformatik (ÖSTAT:106005) Biostatistik (ÖSTAT:106007) Embedded Systems (ÖSTAT:202017) Robotik (ÖSTAT:202035) Sensorik (ÖSTAT:202036) Signalverarbeitung (ÖSTAT:202037) Computerunterstützte Diagnose und Therapie (ÖSTAT:305901) Medizinische Informatik (ÖSTAT:305905) Medizinische Statistik (ÖSTAT:305907)

Forschungsprojekte:	JKU LIT SAL eSPML Lab (Anfangsjahr: 2020)

fodok.jku.at

Benutzerbetreuung: Sandra Winzer, letzte Änderung:

Johannes Kepler Universität (JKU) Linz, Altenbergerstr. 69, A-4040 Linz, Austria
Telefon + 43 732 / 2468 - 9121, Fax + 43 732 / 2468 - 29121, Internet www.jku.at, Impressum