Using Web Images & Natural Language for Object Localization in a Robotics Environment
dc.contributor.advisor | Matuszek, Cynthia | |
dc.contributor.author | Rokisky, Justin Douglass | |
dc.contributor.department | Computer Science and Electrical Engineering | |
dc.contributor.program | Computer Science | |
dc.date.accessioned | 2021-09-01T13:55:24Z | |
dc.date.available | 2021-09-01T13:55:24Z | |
dc.date.issued | 2020-01-20 | |
dc.description.abstract | The ability for humans to interact with robots via language would allow for more natural interactions between robots and humans. To this end, in this work I introduce a novel approach that allows robots to localize objects from an unbounded set of classes given only a description of a target object. The first part of this work is a performance analysis of current state of the art object detectors and a region proposal approach \cite{UijlingsIJCV2013} on the Autonomous Robot Indoor Dataset \cite{arid}. The second part of this work introduces a three stage natural language guided webly object localization approach and associated experiments to evaluate its performance. The first stage of the approach generates a webly dataset without any manual curation from a human description of the target object. The second stage of the approach uses the webly dataset to train a binary classifier for the target object. Finally, region proposals from selective search \cite{UijlingsIJCV2013} are input to the webly supervised binary classifier and the region proposal with the highest confidence score is returned as the prediction. | |
dc.format | application:pdf | |
dc.genre | theses | |
dc.identifier | doi:10.13016/m2vkrv-8xtp | |
dc.identifier.other | 12291 | |
dc.identifier.uri | http://hdl.handle.net/11603/22833 | |
dc.language | en | |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department Collection | |
dc.relation.ispartof | UMBC Theses and Dissertations Collection | |
dc.relation.ispartof | UMBC Graduate School Collection | |
dc.relation.ispartof | UMBC Student Collection | |
dc.source | Original File Name: Rokisky_umbc_0434M_12291.pdf | |
dc.subject | Computer Vision | |
dc.subject | Natural Language | |
dc.subject | Object Localization | |
dc.subject | Robotics | |
dc.subject | Webly Supervised | |
dc.title | Using Web Images & Natural Language for Object Localization in a Robotics Environment | |
dc.type | Text | |
dcterms.accessRights | Distribution Rights granted to UMBC by the author. | |
dcterms.accessRights | This item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu |