Eyeballing Combinatorial Problems: A Case Study of Using Multimodal Large Language Models to Solve Traveling Salesman Problems
dc.contributor.author | Elhenawy, Mohammed | |
dc.contributor.author | Abdelhay, Ahmed | |
dc.contributor.author | Alhadidi, Taqwa I. | |
dc.contributor.author | Ashqar, Huthaifa | |
dc.contributor.author | Jaradat, Shadi | |
dc.contributor.author | Jaber, Ahmed | |
dc.contributor.author | Glaser, Sebastien | |
dc.contributor.author | Rakotonirainy, Andry | |
dc.date.accessioned | 2024-10-28T14:31:09Z | |
dc.date.available | 2024-10-28T14:31:09Z | |
dc.date.issued | 2024-06-11 | |
dc.description.abstract | Multimodal Large Language Models (MLLMs) have demonstrated proficiency in processing di-verse modalities, including text, images, and audio. These models leverage extensive pre-existing knowledge, enabling them to address complex problems with minimal to no specific training examples, as evidenced in few-shot and zero-shot in-context learning scenarios. This paper investigates the use of MLLMs' visual capabilities to 'eyeball' solutions for the Traveling Salesman Problem (TSP) by analyzing images of point distributions on a two-dimensional plane. Our experiments aimed to validate the hypothesis that MLLMs can effectively 'eyeball' viable TSP routes. The results from zero-shot, few-shot, self-ensemble, and self-refine zero-shot evaluations show promising outcomes. We anticipate that these findings will inspire further exploration into MLLMs' visual reasoning abilities to tackle other combinatorial problems. | |
dc.description.uri | https://arxiv.org/abs/2406.06865v1 | |
dc.format.extent | 19 pages | |
dc.genre | journal articles | |
dc.genre | preprints | |
dc.identifier | doi:10.13016/m2pojb-6umd | |
dc.identifier.uri | https://doi.org/10.48550/arXiv.2406.06865 | |
dc.identifier.uri | http://hdl.handle.net/11603/36802 | |
dc.language.iso | en_US | |
dc.relation.isAvailableAt | The University of Maryland, Baltimore County (UMBC) | |
dc.relation.ispartof | UMBC Faculty Collection | |
dc.relation.ispartof | UMBC Data Science | |
dc.rights | Attribution 4.0 International | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.title | Eyeballing Combinatorial Problems: A Case Study of Using Multimodal Large Language Models to Solve Traveling Salesman Problems | |
dc.type | Text | |
dcterms.creator | https://orcid.org/0000-0002-6835-8338 |
Files
Original bundle
1 - 1 of 1