Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks
dc.contributor.author | Mousavian, Arsalan | |
dc.contributor.author | Pirsiavash, Hamed | |
dc.contributor.author | Košecká, Jana | |
dc.date.accessioned | 2019-07-01T18:05:07Z | |
dc.date.available | 2019-07-01T18:05:07Z | |
dc.date.issued | 2016-12-19 | |
dc.description.abstract | Multi-scale deep CNNs have been used successfully for problems mapping each pixel to a label, such as depth estimation and semantic segmentation. It has also been shown that such architectures are reusable and can be used for multiple tasks. These networks are typically trained independently for each task by varying the output layer(s) and training objective. In this work we present a new model for simultaneous depth estimation and semantic segmentation from a single RGB image. Our approach demonstrates the feasibility of training parts of the model for each task and then fine tuning the full, combined model on both tasks simultaneously using a single loss function. Furthermore we couple the deep CNN with fully connected CRF, which captures the contextual relationships and interactions between the semantic and depth cues improving the accuracy of the final results. The proposed model is trained and evaluated on NYUDepth V2 dataset [23] outperforming the state of the art methods on semantic segmentation and achieving comparable results on the task of depth estimation. | en_US |
dc.description.sponsorship | We also acknowledge support from NSF NRI grant 1527208. Some of the experiments were run on ARGO, a research computing cluster provided by the Office of Research Computing at George Mason University. | en_US |
dc.description.uri | https://ieeexplore.ieee.org/abstract/document/7785137 | en_US |
dc.format.extent | 9 pages | en_US |
dc.genre | conference papers and proceedings preprints | en_US |
dc.identifier | doi:10.13016/m2uswx-e8bj | |
dc.identifier.citation | Arsalan Mousavian, et.al, Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks, 2016 Fourth International Conference on 3D Vision (3DV), DOI: 10.1109/3DV.2016.69 | en_US |
dc.identifier.uri | https://doi.org/10.1109/3DV.2016.69 | |
dc.identifier.uri | http://hdl.handle.net/11603/14328 | |
dc.language.iso | en_US | en_US |
dc.publisher | IEEE | en_US |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department Collection | |
dc.relation.ispartof | UMBC Faculty Collection | |
dc.rights | This item is likely protected under Title 17 of the U.S. Copyright Law. Unless on a Creative Commons license, for uses protected by Copyright Law, contact the copyright holder or the author. | |
dc.rights | © 2016 IEEE | |
dc.subject | convolution | en_US |
dc.subject | image segmentation | en_US |
dc.subject | learning (artificial intelligence) | en_US |
dc.subject | neural nets | en_US |
dc.subject | joint semantic segmentation | en_US |
dc.subject | deep convolutional networks | en_US |
dc.subject | multiscale deep CNNs | en_US |
dc.subject | pixel mapping | en_US |
dc.subject | independent training | en_US |
dc.subject | contextual relationships | en_US |
dc.title | Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks | en_US |
dc.type | Text | en_US |