Using Web Images & Natural Language for Object Localization in a Robotics Environment

dc.contributor.advisorMatuszek, Cynthia
dc.contributor.authorRokisky, Justin Douglass
dc.contributor.departmentComputer Science and Electrical Engineering
dc.contributor.programComputer Science
dc.date.accessioned2021-09-01T13:55:24Z
dc.date.available2021-09-01T13:55:24Z
dc.date.issued2020-01-20
dc.description.abstractThe ability for humans to interact with robots via language would allow for more natural interactions between robots and humans. To this end, in this work I introduce a novel approach that allows robots to localize objects from an unbounded set of classes given only a description of a target object. The first part of this work is a performance analysis of current state of the art object detectors and a region proposal approach \cite{UijlingsIJCV2013} on the Autonomous Robot Indoor Dataset \cite{arid}. The second part of this work introduces a three stage natural language guided webly object localization approach and associated experiments to evaluate its performance. The first stage of the approach generates a webly dataset without any manual curation from a human description of the target object. The second stage of the approach uses the webly dataset to train a binary classifier for the target object. Finally, region proposals from selective search \cite{UijlingsIJCV2013} are input to the webly supervised binary classifier and the region proposal with the highest confidence score is returned as the prediction.
dc.formatapplication:pdf
dc.genretheses
dc.identifierdoi:10.13016/m2vkrv-8xtp
dc.identifier.other12291
dc.identifier.urihttp://hdl.handle.net/11603/22833
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.sourceOriginal File Name: Rokisky_umbc_0434M_12291.pdf
dc.subjectComputer Vision
dc.subjectNatural Language
dc.subjectObject Localization
dc.subjectRobotics
dc.subjectWebly Supervised
dc.titleUsing Web Images & Natural Language for Object Localization in a Robotics Environment
dc.typeText
dcterms.accessRightsDistribution Rights granted to UMBC by the author.
dcterms.accessRightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Rokisky_umbc_0434M_12291.pdf
Size:
5.8 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Rokisky-Justin_Open.pdf
Size:
120.51 KB
Format:
Adobe Portable Document Format
Description: