Translation with LLMs through Prompting with Long-Form Context

Ashqar, Huthaifa; Tami, Mohammad

Translation with LLMs through Prompting with Long-Form Context

dc.contributor.author	Ashqar, Huthaifa
dc.contributor.author	Tami, Mohammad
dc.date.accessioned	2025-10-16T15:27:08Z
dc.date.issued	2025-08-02
dc.description	ACL 2023, 61st Annual Meeting of the Association for Computational Linguistics, July 9th - 14th, 2023, Toronto, Canada
dc.description.abstract	Stable generation of text in low-resource languages is an unsolved issue in large language models. While Large Language Models (LLMs) can often produce good translations despite not being explicitly trained for this task, this does not hold for low-resource languages. LLMs are both more likely to generate off-target text (text in another language than intended) when prompted to translate to a low-resource language, and show increased instability in translation quality across prompt templates in low-resource languages. This study implemented a prepended monolingual text prompting method in the target language and used a context-and topic-aware with few-shot machine translation (MT). We quantified these methods for low-, mid-, and high-resource languages using OpenAI GPT-4o-mini and Google Gemini-1.5-flash. Gemini results showed that the use of contexttopic-aware with few-shot MT (CTAFSMT) significantly boosted the performance for the three language categories. However, this was not consistently observed in the case of ChatGPT. It was found that the significance of the results depended on the language itself rather than the level of its resources. This study is part of Stanford University's meta-study on whether LLMs can generate novel research ideas. The code, prompts, and results of the study can be found at https://github.com/HuthaifaAshqar/Translationwith-LLMs.
dc.description.sponsorship	We would like to acknowledge that the original idea was submitted by Elizabeth Salesky from Google DeepMind for the purpose of this study, which is part of the funded Stanford University’s meta-study on whether LLMs can generate novel research ideas.
dc.description.uri	https://www.authorea.com/users/884736/articles/1318311-translation-with-llms-through-prompting-with-long-form-context
dc.format.extent	7 pages
dc.genre	conference papers and proceedings
dc.genre	preprints
dc.identifier	doi:10.13016/m2iz7n-bfyh
dc.identifier.uri	https://doi.org/10.36227/techrxiv.175416071.14800516/v1
dc.identifier.uri	http://hdl.handle.net/11603/40436
dc.language.iso	en
dc.relation.isAvailableAt	The University of Maryland, Baltimore County (UMBC)
dc.relation.ispartof	UMBC Data Science
dc.rights	Attribution 4.0 International
dc.rights.uri	https://creativecommons.org/licenses/by/4.0/
dc.title	Translation with LLMs through Prompting with Long-Form Context
dc.type	Text
dcterms.creator	https://orcid.org/0000-0002-6835-8338

Files

Original bundle

Now showing 1 - 1 of 1

Name:: ACL_2023_Proceedings_LLMsTranslation_Preprint.pdf
Size:: 176.47 KB
Format:: Adobe Portable Document Format

Download

Collections

UMBC Data Science