Generalized Hindsight for Reinforcement Learning

Li, Alexander C.; Pinto, Lerrel; Abbeel, Pieter

Computer Science > Machine Learning

arXiv:2002.11708 (cs)

[Submitted on 26 Feb 2020]

Title:Generalized Hindsight for Reinforcement Learning

Authors:Alexander C. Li, Lerrel Pinto, Pieter Abbeel

View PDF

Abstract:One of the key reasons for the high sample complexity in reinforcement learning (RL) is the inability to transfer knowledge from one task to another. In standard multi-task RL settings, low-reward data collected while trying to solve one task provides little to no signal for solving that particular task and is hence effectively wasted. However, we argue that this data, which is uninformative for one task, is likely a rich source of information for other tasks. To leverage this insight and efficiently reuse data, we present Generalized Hindsight: an approximate inverse reinforcement learning technique for relabeling behaviors with the right tasks. Intuitively, given a behavior generated under one task, Generalized Hindsight returns a different task that the behavior is better suited for. Then, the behavior is relabeled with this new task before being used by an off-policy RL optimizer. Compared to standard relabeling techniques, Generalized Hindsight provides a substantially more efficient reuse of samples, which we empirically demonstrate on a suite of multi-task navigation and manipulation tasks. Videos and code can be accessed here: this https URL.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE); Robotics (cs.RO); Machine Learning (stat.ML)
Cite as:	arXiv:2002.11708 [cs.LG]
	(or arXiv:2002.11708v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2002.11708

Submission history

From: Alexander Li [view email]
[v1] Wed, 26 Feb 2020 18:57:05 UTC (8,431 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2020-02

Change to browse by:

cs
cs.AI
cs.NE
cs.RO
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Alexander C. Li
Lerrel Pinto
Pieter Abbeel

export BibTeX citation

Computer Science > Machine Learning

Title:Generalized Hindsight for Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generalized Hindsight for Reinforcement Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators