Towards Deployment of Computer Vision Neural Networks for Scene Understanding

dc.contributor.advisorOates, Tim
dc.contributor.authorHumes, Edward Steven
dc.contributor.departmentComputer Science and Electrical Engineering
dc.contributor.programComputer Science
dc.date.accessioned2025-09-24T14:07:05Z
dc.date.issued2025-01-01
dc.description.abstractScene understanding is a cornerstone of autonomous operation for robotics and edge computing platforms. However, deploying advanced computer vision neural networks on these platforms presents two central challenges: the need for vast amounts of meticulously labeled training data, and the stringent energy and compute constraints imposed by embedded hardware. Meeting these requirements demands models that achieve both high accuracy and efficiency, balancing performance with limited latency, memory, and power budgets. This thesis addresses both of these barriers to real-world deployment. First, we propose a novel synthetic-to-real domain adaptation framework that substantially reduces the need for large volumes of labeled real-world data, enabling effective image segmentation and robust scene understanding with minimal annotation effort. Second, we introduce Squeezed Edge YOLO, a lightweight object detector architecture specifically designed to operate within the tight latency and energy budgets of edge computing platforms. Both the domain adaptation framework and the object detector demonstrate strong empirical performance. Our domain adaptation approach is validated on the challenging synthetic-to-real ”SYNTHIA → Cityscapes” and ”GTAV → Cityscapes” benchmarks, where we outperform the previous state of the art, HALO. To evaluate Squeezed Edge YOLO, we deploy it on a nano-UAV and collect real-world measurements, achieving real-time object detection at approximately 8 inferences per second with low power consumption. Together, these contributions advance the deployment of deep neural scene understanding on resource constrained robotic and edge platforms.
dc.formatapplication:pdf
dc.genrethesis
dc.identifierdoi:10.13016/m2dfxs-kco3
dc.identifier.other13096
dc.identifier.urihttp://hdl.handle.net/11603/40260
dc.languageen
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department Collection
dc.relation.ispartofUMBC Theses and Dissertations Collection
dc.relation.ispartofUMBC Graduate School Collection
dc.relation.ispartofUMBC Student Collection
dc.rightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by UMBC for non-commercial research and education. For permission to publish or reproduce, please see http://aok.lib.umbc.edu/specoll/repro.php or contact Special Collections at speccoll(at)umbc.edu
dc.sourceOriginal File Name: Humes_umbc_0434M_13096.pdf
dc.subjectedge
dc.subjectmachine learning
dc.subjectsyn2real
dc.subjecttinyml
dc.titleTowards Deployment of Computer Vision Neural Networks for Scene Understanding
dc.typeText
dcterms.accessRightsDistribution Rights granted to UMBC by the author.

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Humes_umbc_0434M_13096.pdf
Size:
14.73 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Humes-Edward_Open.pdf
Size:
532.96 KB
Format:
Adobe Portable Document Format
Description: