Publikationsdetails

Zitat:	Fabian Paischer, Thomas Adler, Andreas Radler, Markus Hofmarcher, Sepp Hochreiter, "Foundation Models for History Compression in Reinforcement Learning" : Neural Information Processing Systems Foundation (NeurIPS 2022), 2022
Original Titel:	Foundation Models for History Compression in Reinforcement Learning
Sprache des Titels:	Englisch
Original Buchtitel:	Neural Information Processing Systems Foundation (NeurIPS 2022)
Original Kurzfassung:	Agents interacting under partial observability require access to past observations via a memory mechanism in order to approximate the true state of the environment. Recent work suggests that leveraging language as abstraction provides benefits for creating a representation of past events. History Compression via Language Models (HELM) leverages a pretrained Language Model (LM) for representing the past. It relies on a randomized attention mechanism to translate environment observations to token embeddings. In this work, we show that the representations resulting from this attention mechanism can collapse under certain conditions. This causes blindness of the agent to subtle changes in the environment that may be crucial for solving a certain task. We propose a solution to this problem consisting of two parts. First, we improve upon HELM by substituting the attention mechanism with a feature-wise centering-and-scaling operation. Second, we take a step toward semantic history compression by leveraging foundation models, such as CLIP, to encode observations, which further improves performance. By combining foundation models, our agent is able to solve the challenging MiniGrid-Memory environment. Surprisingly, however, our experiments suggest that this is not due to the semantic enrichment of the representation presented to the LM, but rather due to the discriminative power provided by CLIP. We make our code publicly available at https://github.com/ml-jku/helm.
Sprache der Kurzfassung:	Englisch
Erscheinungsjahr:	2022
Anzahl der Seiten:	90
Reichweite:	international
Publikationstyp:	Aufsatz / Paper in Tagungsband (referiert)
Autoren:	Fabian Paischer, Thomas Adler, Andreas Radler, Markus Hofmarcher, Sepp Hochreiter
Forschungseinheiten:	Institut für Machine Learning LIT Artificial Intelligence Lab Institut für Signalverarbeitung

Wissenschaftsgebiete:	Biomathematik (ÖSTAT:101004) Numerische Mathematik (ÖSTAT:101014) Operations Research (ÖSTAT:101015) Optimierung (ÖSTAT:101016) Spieltheorie (ÖSTAT:101017) Statistik (ÖSTAT:101018) Stochastik (ÖSTAT:101019) Wahrscheinlichkeitstheorie (ÖSTAT:101024) Zeitreihenanalyse (ÖSTAT:101026) Dynamische Systeme (ÖSTAT:101027) Mathematische Modellierung (ÖSTAT:101028) Mathematische Statistik (ÖSTAT:101029) Approximationstheorie (ÖSTAT:101031) Informatik (ÖSTAT:102) Artificial Intelligence (ÖSTAT:102001) Bildverarbeitung (ÖSTAT:102003) Bioinformatik (ÖSTAT:102004) Human-Computer Interaction (ÖSTAT:102013) Künstliche Neuronale Netze (ÖSTAT:102018) Machine Learning (ÖSTAT:102019) Computational Intelligence (ÖSTAT:102032) Data Mining (ÖSTAT:102033) Statistische Physik (ÖSTAT:103029) Bioinformatik (ÖSTAT:106005) Biostatistik (ÖSTAT:106007) Embedded Systems (ÖSTAT:202017) Robotik (ÖSTAT:202035) Sensorik (ÖSTAT:202036) Signalverarbeitung (ÖSTAT:202037) Computerunterstützte Diagnose und Therapie (ÖSTAT:305901) Medizinische Informatik (ÖSTAT:305905) Medizinische Statistik (ÖSTAT:305907)

Forschungsprojekte:	JKU LIT SAL eSPML Lab (Anfangsjahr: 2020)

fodok.jku.at

Benutzerbetreuung: Sandra Winzer, letzte Änderung:

Johannes Kepler Universität (JKU) Linz, Altenbergerstr. 69, A-4040 Linz, Austria
Telefon + 43 732 / 2468 - 9121, Fax + 43 732 / 2468 - 29121, Internet www.jku.at, Impressum