Asynchronous Methods for Deep Reinforcement Learning.
Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu: Asynchronous Methods for Deep Reinforcement Learning. CoRR...
View ArticleAsynchronous Methods for Deep Reinforcement Learning.
Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu: Asynchronous Methods for Deep Reinforcement Learning. ICML 2016:...
View ArticleHybrid computing using a neural network with dynamic external memory.
Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwinska, Sergio Gomez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John P. Agapiou, Adrià Puigdomènech...
View ArticleImagination-Augmented Agents for Deep Reinforcement Learning.
Theophane Weber, Sébastien Racanière, David P. Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter W....
View ArticleNeural Episodic Control.
Alexander Pritzel, Benigno Uria, Sriram Srinivasan, Adrià Puigdomènech Badia, Oriol Vinyals, Demis Hassabis, Daan Wierstra, Charles Blundell: Neural Episodic Control. CoRR abs/1703.01988 (2017)
View ArticleImagination-Augmented Agents for Deep Reinforcement Learning.
Sébastien Racanière, Theophane Weber, David P. Reichert, Lars Buesing, Arthur Guez, Danilo Jimenez Rezende, Adrià Puigdomènech Badia, Oriol Vinyals, Nicolas Heess, Yujia Li, Razvan Pascanu, Peter W....
View ArticleNeural Episodic Control.
Alexander Pritzel, Benigno Uria, Sriram Srinivasan, Adrià Puigdomènech Badia, Oriol Vinyals, Demis Hassabis, Daan Wierstra, Charles Blundell: Neural Episodic Control. ICML 2017: 2827-2836
View ArticleMemory-based Parameter Adaptation.
Pablo Sprechmann, Siddhant M. Jayakumar, Jack W. Rae, Alexander Pritzel, Adrià Puigdomènech Badia, Benigno Uria, Oriol Vinyals, Demis Hassabis, Razvan Pascanu, Charles Blundell: Memory-based Parameter...
View ArticleMemory-based Parameter Adaptation.
Pablo Sprechmann, Siddhant M. Jayakumar, Jack W. Rae, Alexander Pritzel, Adrià Puigdomènech Badia, Benigno Uria, Oriol Vinyals, Demis Hassabis, Razvan Pascanu, Charles Blundell: Memory-based Parameter...
View ArticleGeneralization of Reinforcement Learners with Working and Episodic Memory.
Meire Fortunato, Melissa Tan, Ryan Faulkner, Steven Hansen, Adrià Puigdomènech Badia, Gavin Buttimore, Charlie Deck, Joel Z. Leibo, Charles Blundell: Generalization of Reinforcement Learners with...
View ArticleGeneralization of Reinforcement Learners with Working and Episodic Memory.
Meire Fortunato, Melissa Tan, Ryan Faulkner, Steven Hansen, Adrià Puigdomènech Badia, Gavin Buttimore, Charlie Deck, Joel Z. Leibo, Charles Blundell: Generalization of Reinforcement Learners with...
View ArticleAgent57: Outperforming the Atari Human Benchmark.
Adrià Puigdomènech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Daniel Guo, Charles Blundell: Agent57: Outperforming the Atari Human Benchmark. CoRR abs/2003.13350...
View ArticleNever Give Up: Learning Directed Exploration Strategies.
Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andrew Bolt, Charles Blundell:...
View ArticleMEMO: A Deep Network for Flexible Combination of Episodic Memories.
Andrea Banino, Adrià Puigdomènech Badia, Raphael Köster, Martin J. Chadwick, Vinícius Flores Zambaldi, Demis Hassabis, Caswell Barry, Matthew M. Botvinick, Dharshan Kumaran, Charles Blundell: MEMO: A...
View ArticleAgent57: Outperforming the Atari Human Benchmark.
Adrià Puigdomènech Badia, Bilal Piot, Steven Kapturowski, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Daniel Guo, Charles Blundell: Agent57: Outperforming the Atari Human Benchmark. ICML 2020: 507-517
View ArticleMEMO: A Deep Network for Flexible Combination of Episodic Memories.
Andrea Banino, Adrià Puigdomènech Badia, Raphael Köster, Martin J. Chadwick, Vinícius Flores Zambaldi, Demis Hassabis, Caswell Barry, Matthew M. Botvinick, Dharshan Kumaran, Charles Blundell: MEMO: A...
View ArticleNever Give Up: Learning Directed Exploration Strategies.
Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi, Zhaohan Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martín Arjovsky, Alexander Pritzel, Andrew Bolt, Charles Blundell:...
View ArticleCoBERL: Contrastive BERT for Reinforcement Learning.
Andrea Banino, Adrià Puigdomènech Badia, Jacob C. Walker, Tim Scholtes, Jovana Mitrovic, Charles Blundell: CoBERL: Contrastive BERT for Reinforcement Learning. CoRR abs/2107.05431 (2021)
View ArticleCoverage as a Principle for Discovering Transferable Behavior in...
Víctor Campos, Pablo Sprechmann, Steven Hansen, André Barreto, Steven Kapturowski, Alex Vitvitskyi, Adrià Puigdomènech Badia, Charles Blundell: Coverage as a Principle for Discovering Transferable...
View ArticleHuman-level Atari 200x faster.
Steven Kapturowski, Víctor Campos, Ray Jiang, Nemanja Rakicevic, Hado van Hasselt, Charles Blundell, Adrià Puigdomènech Badia: Human-level Atari 200x faster. CoRR abs/2209.07550 (2022)
View ArticleThe CLRS Algorithmic Reasoning Benchmark.
Petar Velickovic, Adrià Puigdomènech Badia, David Budden, Razvan Pascanu, Andrea Banino, Misha Dashevskiy, Raia Hadsell, Charles Blundell: The CLRS Algorithmic Reasoning Benchmark. CoRR abs/2205.15659...
View ArticleRetrieval-Augmented Reinforcement Learning.
Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adrià Puigdomènech Badia, Arthur Guez, Mehdi Mirza, Ksenia Konyushkova, Michal Valko, Simon Osindero, Timothy P....
View ArticleThe CLRS Algorithmic Reasoning Benchmark.
Petar Velickovic, Adrià Puigdomènech Badia, David Budden, Razvan Pascanu, Andrea Banino, Misha Dashevskiy, Raia Hadsell, Charles Blundell: The CLRS Algorithmic Reasoning Benchmark. ICML 2022: 22084-22102
View ArticleRetrieval-Augmented Reinforcement Learning.
Anirudh Goyal, Abram L. Friesen, Andrea Banino, Theophane Weber, Nan Rosemary Ke, Adrià Puigdomènech Badia, Arthur Guez, Mehdi Mirza, Peter C. Humphreys, Ksenia Konyushkova, Michal Valko, Simon...
View ArticleCoBERL: Contrastive BERT for Reinforcement Learning.
Andrea Banino, Adrià Puigdomènech Badia, Jacob C. Walker, Tim Scholtes, Jovana Mitrovic, Charles Blundell: CoBERL: Contrastive BERT for Reinforcement Learning. ICLR 2022
View ArticleHuman-level Atari 200x faster.
Steven Kapturowski, Victor Campos, Ray Jiang, Nemanja Rakicevic, Hado van Hasselt, Charles Blundell, Adrià Puigdomènech Badia: Human-level Atari 200x faster. ICLR 2023
View Article
More Pages to Explore .....