Halluci-Net: Scene Completion by Exploiting Object Co-occurrence Relationships

dc.contributor.authorKulkarni, Kuldeep
dc.contributor.authorGokhale, Tejas
dc.contributor.authorSingh, Rajhans
dc.contributor.authorTuraga, Pavan
dc.contributor.authorSankaranarayanan, Aswin
dc.date.accessioned2025-06-05T14:02:32Z
dc.date.available2025-06-05T14:02:32Z
dc.date.issued2021-05-21
dc.descriptionAI for Content Creation Workshop @CVPR 2021
dc.description.abstractRecently, there has been substantial progress in image synthesis from semantic labelmaps. However, methods used for this task assume the availability of complete and unambiguous labelmaps, with instance boundaries of objects, and class labels for each pixel. This reliance on heavily annotated inputs restricts the application of image synthesis techniques to real-world applications, especially under uncertainty due to weather, occlusion, or noise. On the other hand, algorithms that can synthesize images from sparse labelmaps or sketches are highly desirable as tools that can guide content creators and artists to quickly generate scenes by simply specifying locations of a few objects. In this paper, we address the problem of complex scene completion from sparse labelmaps. Under this setting, very few details about the scene (30\% of object instances) are available as input for image synthesis. We propose a two-stage deep network based method, called `Halluci-Net', that learns co-occurence relationships between objects in scenes, and then exploits these relationships to produce a dense and complete labelmap. The generated dense labelmap can then be used as input by state-of-the-art image synthesis techniques like pix2pixHD to obtain the final image. The proposed method is evaluated on the Cityscapes dataset and it outperforms two baselines methods on performance metrics like Fr\'echet Inception Distance (FID), semantic segmentation accuracy, and similarity in object co-occurrences. We also show qualitative results on a subset of ADE20K dataset that contains bedroom images.
dc.description.sponsorshipThis work was supported by a gift from Adobe Inc. to the Geometric Media Lab. The work of KK, TG, and AS was supported by ARO Grant W911NF-16-1-0441
dc.description.urihttp://arxiv.org/abs/2004.08614
dc.format.extent10 pages
dc.genreconference papers and proceedings
dc.genrepostprints
dc.identifierdoi:10.13016/m2pm6b-mf8t
dc.identifier.urihttps://doi.org/10.48550/arXiv.2004.08614
dc.identifier.urihttp://hdl.handle.net/11603/38538
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsAttribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/deed.en
dc.subjectComputer Science - Machine Learning
dc.subjectComputer Science - Computer Vision and Pattern Recognition
dc.titleHalluci-Net: Scene Completion by Exploiting Object Co-occurrence Relationships
dc.typeText
dcterms.creatorhttps://orcid.org/0000-0002-5593-2804

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
200408614v2.pdf
Size:
9 MB
Format:
Adobe Portable Document Format