RL Unplugged is suite of benchmarks for offline reinforcement learning. The RL Unplugged is designed around the following considerations: to facilitate ease of use, we provide the datasets with a unified API which makes it easy for the practitioner to work with all data in the suite once a general pipeline has been established. This is a dataset accompanying the paper RL Unplugged: Benchmarks for Offline Reinforcement Learning.
In this suite of benchmarks, we try to focus on the following problems:
-High dimensional action spaces, for example the locomotion humanoid domains, we have 56 dimensional actions.
-High dimensional observations.
-Partial observability, observations have egocentric vision.
-Difficulty of exploration, using states of the art algorithms and imitation to generate data for difficult environments.
-Real world challenges.