Your First 10 Rides Discounted 🚕
Use "WEB" as your referral code, for a free 50 Toko bonus.
Our experience of translating over 140 Albanian words with the newest LLMs. Uncover insights on translation nuances and AI performance.
Use "WEB" as your referral code, for a free 50 Toko bonus.
Our team at Patoko, based in Tirana, wanted to see how four state of the art language models handle common Albanian words.
We selected over 140 words, aiming to cover a broad range of everyday usage. For each word, we ran the same prompt from different API providers. We asked for a direct translation of the word, then asked the model to craft a sentence using that word. The results highlight where each model excels and where they fall short. Want to try for yourself? Try this for English, and this for other languages.
Albanian is spoken by millions but is often underrepresented in AI datasets. Its grammatical rules and cultural nuances offer a strong test for any language model. By focusing on Albanian, we get a clearer sense of how these models might work (or struggle) in real-world tasks that require more inclusive language support. Consider this a version 1 of our unique LLM leaderboard.
Here’s a table that breaks down how each word performed in context with each model’s translation and sentence creation. We encourage you to explore this table to see the detailed differences:
Category | Words |
---|---|
Greetings and Courtesies | thank you, please, sir |
Common Verbs | is, be, have, do, think, go, see, look, come, wait, says |
Pronouns | I, you, me, it, they, we, she, who |
Adjectives | good, bad, fair, ready, fast |
Adverbs | really, just, well, beautifully, never |
Prepositions | by, from, within, outside, after, before |
Conjunctions | and, or, because, thus |
Time-related Words (Great for Taxis) | today, day, times, then, last |
Quantifiers and Determiners | all, any, more, less, lot, some |
Question Words | what, whether |
Interjections and Expressions | damn, come on, heck, forgive |
Miscellaneous Nouns | people, man, country, church, houses, age, lord, providence |
Other Common Words | this, that, there, here, up, down, front, still, maybe, only, everything, nothing, something |
Accurate handling of Albanian by language models matters for connectivity. It helps in education, translation services, and daily life. We’re working to close the gap between languages for visitors to Tirana, along with many other places in the world.
If you’re looking to use something longer term, you’ll want to examine our test results based on your translation needs. If you translate business documents, look at how each model handled formal vocabulary and complex sentences. For social media content, check their performance with colloquial terms and idioms. Consider your budget alongside accuracy needs – while GPT-4o Mini is fairly cheap per million tokens, DeepSeek v3 shows stronger results with technical terms at a higher cost. For basic conversations, a model excelling in common vocabulary might work well, even if it struggles with specialized terms.
In future work, we plan to add more state-of-the-art models like Claude Sonnet, OpenAI’s O1, Perplexity’s custom models, and others. As more LLMs enter the field, our goal is to expand this testing to maintain an overview of how AI adapts to languages like Albanian. Want to work with us on this? Don’t hesitate to reach out.