Automated Virtual Product Placement and Assessment in Images using Diffusion Models

dc.contributor.authorAlam, Mohammad Mahmudul
dc.contributor.authorSokhandan, Negin
dc.contributor.authorGoodman, Emmett
dc.date.accessioned2024-05-29T14:38:03Z
dc.date.available2024-05-29T14:38:03Z
dc.date.issued2024-05-02
dc.description6th AI for Content Creation (AI4CC) workshop at CVPR 2024
dc.description.abstractIn Virtual Product Placement (VPP) applications, the discrete integration of specific brand products into images or videos has emerged as a challenging yet important task. This paper introduces a novel three-stage fully automated VPP system. In the first stage, a language-guided image segmentation model identifies optimal regions within images for product inpainting. In the second stage, Stable Diffusion (SD), fine-tuned with a few example product images, is used to inpaint the product into the previously identified candidate regions. The final stage introduces an ‘Alignment Module’, which is designed to effectively sieve out lowquality images. Comprehensive experiments demonstrate that the Alignment Module ensures the presence of the intended product in every generated image and enhances the average quality of images by 35%. The results presented in this paper demonstrate the effectiveness of the proposed VPP system, which holds significant potential for transforming the landscape of virtual advertising and marketing strategies.
dc.description.urihttp://arxiv.org/abs/2405.01130
dc.format.extent9 pages
dc.genreconference papers and proceedings
dc.genrepostprints
dc.identifierdoi:10.13016/m2zohj-kwd1
dc.identifier.urihttps://doi.org/10.48550/arXiv.2405.01130
dc.identifier.urihttp://hdl.handle.net/11603/34295
dc.language.isoen_US
dc.relation.isAvailableAtThe University of Maryland, Baltimore County (UMBC)
dc.relation.ispartofUMBC Student Collection
dc.relation.ispartofUMBC Computer Science and Electrical Engineering Department
dc.rightsCC BY 4.0 DEED Attribution 4.0 International
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.subjectComputer Science - Computer Vision and Pattern Recognition
dc.titleAutomated Virtual Product Placement and Assessment in Images using Diffusion Models
dc.typeText
dcterms.creatorhttps://orcid.org/0009-0004-3054-5914

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2405.01130v1.pdf
Size:
3.54 MB
Format:
Adobe Portable Document Format