Kirill Semenov's Avatar

Kirill Semenov

@kiryukhasemenov

PhD student at the University of Zurich. Trying to get to know what LLMs know🤔

53
Followers
48
Following
5
Posts
05.06.2025
Joined
Posts Following

Latest posts by Kirill Semenov @kiryukhasemenov

Post image

Ein Gespenst geht um in NLP...

(from arxiv.org/pdf/2408.01416)

09.02.2026 15:42 👍 1 🔁 0 💬 0 📌 0
Post image

Let's meet at #EMNLP and talk about multilingual knowledge benchmarks!

⚠️MLAMA is full of disfluent sentences
❓Reason: templated translation
💡Simple full-sentence translation improves factual retrieval up to 25%
🙌Remember to check your benchmarks with speakers!

Link: arxiv.org/pdf/2510.15115

28.10.2025 21:09 👍 1 🔁 1 💬 0 📌 0

🎉 Terminology Shared Task @WMT25: Paper Out 🎉
Highlights:
- sentence translation seems solvable, document translation is still challenging
- better systems benefit more from proper terminologies
- term-based metrics correlate poorly with general translation quality

www2.statmt.org/wmt25/pdf/20...

24.10.2025 19:57 👍 2 🔁 2 💬 0 📌 0
Preview
GitHub - Kiryukhasemenov/InFlags: Python package for dictionary-based inline tokenization preprocessing Python package for dictionary-based inline tokenization preprocessing - Kiryukhasemenov/InFlags

Our paper at TokShop

InCa and InDia: more stable and interpretable tokenizer preprocessing that handles casing and diacritization!

Check out our:
💻package: github.com/Kiryukhaseme...
🎥video: www.youtube.com/watch?v=XgDP...
📝paper: openreview.net/pdf?id=9GwVW...

25.07.2025 12:54 👍 1 🔁 0 💬 0 📌 0
Terminology Translation Task

📣Take part in 3rd Terminology shared task @WMT!📣
This year:
👉5 language pairs: EN->{ES, RU, DE, ZH},
👉2 tracks - sentence-level and doc-level translation,
👉authentic data from 2 domains: finance and IT!

www2.statmt.org/wmt25/termin...

Don't miss an opportunity - we only do it once in two years😏

06.06.2025 15:54 👍 3 🔁 2 💬 0 📌 2