Retrospective on the 2021 MineRL BASALT Competition on Learning from Human Feedback

Shah, Rohin; Wang, Steven H.; Wild, Cody; Milani, Stephanie; Kanervisto, Anssi; Goecks, Vinicius G.; Waytowich, Nicholas; Watkins-Valls, David; Prakash, Bharat; Mills, Edmund; Garg, Divyansh; Fries, Alexander; Souly, Alexandra; Chan, Jun Shern; Castillo, Daniel del; Lieberum, Tom

Retrospective on the 2021 MineRL BASALT Competition on Learning from Human Feedback

dc.contributor.author	Shah, Rohin
dc.contributor.author	Wang, Steven H.
dc.contributor.author	Wild, Cody
dc.contributor.author	Milani, Stephanie
dc.contributor.author	Kanervisto, Anssi
dc.contributor.author	Goecks, Vinicius G.
dc.contributor.author	Waytowich, Nicholas
dc.contributor.author	Watkins-Valls, David
dc.contributor.author	Prakash, Bharat
dc.contributor.author	Mills, Edmund
dc.contributor.author	Garg, Divyansh
dc.contributor.author	Fries, Alexander
dc.contributor.author	Souly, Alexandra
dc.contributor.author	Chan, Jun Shern
dc.contributor.author	Castillo, Daniel del
dc.contributor.author	Lieberum, Tom
dc.date.accessioned	2022-08-02T21:12:29Z
dc.date.available	2022-08-02T21:12:29Z
dc.date.issued	2022-07
dc.description.abstract	We held the first-ever MineRL Benchmark for Agents that Solve Almost-Lifelike Tasks (MineRL BASALT) Competition at the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS 2021). The goal of the competition was to promote research towards agents that use learning from human feedback (LfHF) techniques to solve open-world tasks. Rather than mandating the use of LfHF techniques, we described four tasks in natural language to be accomplished in the video game Minecraft, and allowed participants to use any approach they wanted to build agents that could accomplish the tasks. Teams developed a diverse range of LfHF algorithms across a variety of possible human feedback types. The three winning teams implemented significantly different approaches while achieving similar performance. Interestingly, their approaches performed well on different tasks, validating our choice of tasks to include in the competition. While the outcomes validated the design of our competition, we did not get as many participants and submissions as our sister competition, MineRL Diamond. We speculate about the causes of this problem and suggest improvements for future iterations of the competition.	en_US
dc.description.uri	https://proceedings.mlr.press/v176/shah22a.html	en_US
dc.format.extent	14 pages	en_US
dc.genre	conference papers and proceedings	en_US
dc.identifier	doi:10.13016/m2gbci-wyhb
dc.identifier.citation	Shah, R., Wang, S.H., Wild, C., Milani, S., Kanervisto, A., Goecks, V.G., Waytowich, N., Watkins-Valls, D., Prakash, B., Mills, E., Garg, D., Fries, A., Souly, A., Chan, J.S., del Castillo, D. & Lieberum, T.. (2022). Retrospective on the 2021 MineRL BASALT Competition on Learning from Human Feedback. <i>Proceedings of the NeurIPS 2021 Competitions and Demonstrations Track</i>, in <i>Proceedings of Machine Learning Research</i> 176:259-272 Available from https://proceedings.mlr.press/v176/shah22a.html.	en_US
dc.identifier.uri	http://hdl.handle.net/11603/25281
dc.language.iso	en_US	en_US
dc.publisher	Proceedings of Machine Learning Research	en_US
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartof	UMBC Student Collection
dc.rights	This work was written as part of one of the author's official duties as an Employee of the United States Government and is therefore a work of the United States Government. In accordance with 17 U.S.C. 105, no copyright protection is available for such works under U.S. Law.	en_US
dc.rights	Public Domain Mark 1.0	*
dc.rights.uri	http://creativecommons.org/publicdomain/mark/1.0/	*
dc.title	Retrospective on the 2021 MineRL BASALT Competition on Learning from Human Feedback	en_US
dc.type	Text	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: shah22a.pdf
Size:: 1.37 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 2.56 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

UMBC Computer Science and Electrical Engineering Department
UMBC Student Collection