Using Style Ambiguity Loss to Improve Aesthetics of Diffusion Models
| dc.contributor.author | Baker, James | |
| dc.date.accessioned | 2024-11-14T15:18:37Z | |
| dc.date.available | 2024-11-14T15:18:37Z | |
| dc.date.issued | 2024-10-02 | |
| dc.description.abstract | Teaching text-to-image models to be creative involves using style ambiguity loss. In this work, we explore using the style ambiguity training objective, used to approximate creativity, on a diffusion model. We then experiment with forms of style ambiguity loss that do not require training a classifier or a labeled dataset, and find that the models trained with style ambiguity loss can generate better images than the baseline diffusion models and GANs. Code is available at https://github.com/jamesBaker361/clipcreate. | |
| dc.description.uri | http://arxiv.org/abs/2410.02055 | |
| dc.format.extent | 27 pages | |
| dc.genre | journal articles | |
| dc.genre | preprints | |
| dc.identifier | doi:10.13016/m26cjd-odma | |
| dc.identifier.uri | https://doi.org/10.48550/arXiv.2410.02055 | |
| dc.identifier.uri | http://hdl.handle.net/11603/36943 | |
| dc.language.iso | en_US | |
| dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
| dc.relation.ispartof | UMBC Student Collection | |
| dc.relation.ispartof | UMBC Computer Science and Electrical Engineering Department | |
| dc.rights | Attribution 4.0 International CC BY 4.0 Deed | |
| dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | Computer Science - Computer Vision and Pattern Recognition | |
| dc.title | Using Style Ambiguity Loss to Improve Aesthetics of Diffusion Models | |
| dc.type | Text |
Files
Original bundle
1 - 1 of 1
