Publikationsdetails

Zitat:	Fabian Paischer, Thomas Adler, Markus Hofmarcher, Sepp Hochreiter, "Semantic HELM: A Human-Readable Memory for Reinforcement Learning" : Conference Neural Information Processing Systems Foundation (NeurIPS 2023), 2023
Original Titel:	Semantic HELM: A Human-Readable Memory for Reinforcement Learning
Sprache des Titels:	Englisch
Original Buchtitel:	Conference Neural Information Processing Systems Foundation (NeurIPS 2023)
Original Kurzfassung:	Reinforcement learning agents deployed in the real world often have to cope with partially observable environments. Therefore, most agents employ memory mechanisms to approximate the state of the environment. Recently, there have been impressive success stories in mastering partially observable environments, mostly in the realm of computer games like Dota 2, StarCraft II, or MineCraft. However, existing methods lack interpretability in the sense that it is not comprehensible for humans what the agent stores in its memory. In this regard, we propose a novel memory mechanism that represents past events in human language. Our method uses CLIP to associate visual inputs with language tokens. Then we feed these tokens to a pretrained language model that serves the agent as memory and provides it with a coherent and human-readable representation of the past. We train our memory mechanism on a set of partially observable environments and find that it excels on tasks that require a memory component, while mostly attaining performance on-par with strong baselines on tasks that do not. On a challenging continuous recognition task, where memorizing the past is crucial, our memory mechanism converges two orders of magnitude faster than prior methods. Since our memory mechanism is human-readable, we can peek at an agent's memory and check whether crucial pieces of information have been stored. This significantly enhances troubleshooting and paves the way toward more interpretable agents.
Sprache der Kurzfassung:	Englisch
Erscheinungsjahr:	2023
Anzahl der Seiten:	29
URL zu weiteren Infos:	https://arxiv.org/abs/2306.09312
Reichweite:	international
Publikationstyp:	Aufsatz / Paper in Tagungsband (referiert)
Autoren:	Fabian Paischer, Thomas Adler, Markus Hofmarcher, Sepp Hochreiter
Forschungseinheiten:	Institut für Machine Learning LIT Artificial Intelligence Lab

Wissenschaftsgebiete:	Biomathematik (ÖSTAT:101004) Numerische Mathematik (ÖSTAT:101014) Operations Research (ÖSTAT:101015) Optimierung (ÖSTAT:101016) Spieltheorie (ÖSTAT:101017) Statistik (ÖSTAT:101018) Stochastik (ÖSTAT:101019) Wahrscheinlichkeitstheorie (ÖSTAT:101024) Zeitreihenanalyse (ÖSTAT:101026) Dynamische Systeme (ÖSTAT:101027) Mathematische Modellierung (ÖSTAT:101028) Mathematische Statistik (ÖSTAT:101029) Approximationstheorie (ÖSTAT:101031) Informatik (ÖSTAT:102) Artificial Intelligence (ÖSTAT:102001) Bildverarbeitung (ÖSTAT:102003) Bioinformatik (ÖSTAT:102004) Human-Computer Interaction (ÖSTAT:102013) Künstliche Neuronale Netze (ÖSTAT:102018) Machine Learning (ÖSTAT:102019) Computational Intelligence (ÖSTAT:102032) Data Mining (ÖSTAT:102033) Statistische Physik (ÖSTAT:103029) Bioinformatik (ÖSTAT:106005) Biostatistik (ÖSTAT:106007) Embedded Systems (ÖSTAT:202017) Robotik (ÖSTAT:202035) Sensorik (ÖSTAT:202036) Signalverarbeitung (ÖSTAT:202037) Computerunterstützte Diagnose und Therapie (ÖSTAT:305901) Medizinische Informatik (ÖSTAT:305905) Medizinische Statistik (ÖSTAT:305907)

fodok.jku.at

Benutzerbetreuung: Sandra Winzer, letzte Änderung:

Johannes Kepler Universität (JKU) Linz, Altenbergerstr. 69, A-4040 Linz, Austria
Telefon + 43 732 / 2468 - 9121, Fax + 43 732 / 2468 - 29121, Internet www.jku.at, Impressum