Matthias Hagen's Avatar

Matthias Hagen

@matthias-hagen

Professor of "Databases and Information Systems" at Friedrich-Schiller-Universität Jena, Germany, and member of @webis.de. Research in information retrieval and natural language processing.

246
Followers
262
Following
1
Posts
15.11.2024
Joined
Posts Following

Latest posts by Matthias Hagen @matthias-hagen

Congrats, Leonie!

11.12.2025 22:00 👍 1 🔁 0 💬 1 📌 0
Post image

Honored to win the ICTIR Best Paper Honorable Mention Award for "Axioms for Retrieval-Augmented Generation"!
Our new axioms are integrated with ir_axioms: github.com/webis-de/ir_...
Nice to see axiomatic IR gaining momentum.

18.07.2025 14:18 👍 16 🔁 6 💬 1 📌 0
Post image Post image

Now @fschlatt.bsky.social presents "TITE: Token-Independent Text Encoder for Information Retrieval" at #SIGIR2025

Paper: webis.de/publications...

16.07.2025 09:08 👍 8 🔁 3 💬 0 📌 0
Post image

Want to know how to make bi-encoders more than 3x faster with a new backbone encoder model? Check out our talk on the Token-Independent Text Encoder (TITE) #SIGIR2025 in the efficiency track. It pools vectors within the model to improve efficiency dl.acm.org/doi/10.1145/...

16.07.2025 07:28 👍 10 🔁 5 💬 0 📌 0
Post image

Happy to share that our paper "The Viability of Crowdsourcing for RAG Evaluation" received the Best Paper Honourable Mention at #SIGIR2025! Very grateful to the community for recognizing our work on improving RAG evaluation.

 📄 webis.de/publications...

16.07.2025 21:04 👍 27 🔁 10 💬 2 📌 1
Post image

Thank you Carlos for the shout-out of Lightning IR in the LSR tutorial at #SIGIR2025

If you want to fine your own LSR models, check out our framework at github.com/webis-de/lig...

13.07.2025 14:41 👍 7 🔁 5 💬 0 📌 0
Dory from finding nemo with the quote: "I remember it like it was yesterday. Of course, I dont remember yesterday."

Dory from finding nemo with the quote: "I remember it like it was yesterday. Of course, I dont remember yesterday."

Do not forget to participate in the #TREC2025 Tip-of-the-Tongue (ToT) Track :)

The corpus and baselines (with run files) are now available and easily accessible via the ir_datasets API and the HuggingFace Datasets API.

More details are available at: trec-tot.github.io/guidelines

27.06.2025 14:46 👍 11 🔁 7 💬 0 📌 0

🧵 4/4 The shared task continues the research on LLM-based advertising. Participants can submit systems for two sub-tasks: First, generate responses with and without ads. Second, classify whether a response contains an ad.
Submissions are open until May 10th and we look forward to your contributions.

30.04.2025 11:17 👍 2 🔁 1 💬 0 📌 0
Post image

🧵 3/4 In a lot of cases, survey participants did not notice brand or product placements in the responses. As a first step towards ad-blockers for LLMs, we created a dataset of responses with and without ads and trained classifiers on the task of identifying the ads.
dl.acm.org/doi/10.1145/...

30.04.2025 11:17 👍 3 🔁 1 💬 1 📌 0
Post image

🧵 2/4 Given the high operating costs of LLMs, they require a business model to sustain them and advertising is a natural candidate.
Hence, we have analyzed how well LLMs can blend product placements with "organic" responses and whether users are able to identify the ads.
dl.acm.org/doi/10.1145/...

30.04.2025 11:17 👍 2 🔁 1 💬 1 📌 0
Post image

Can LLM-generated ads be blocked? With OpenAI adding shopping options to ChatGPT, this question gains further importance.
If you are interested in contributing to the research on LLM-based advertising, please check out our shared task: touche.webis.de/clef25/touch...

More details below.

30.04.2025 11:17 👍 8 🔁 5 💬 1 📌 1
Post image Post image Post image Post image

The Workshop on Open Web Search at #ECIR2025 just starts with a keynote by @claclarke.bsky.social on Annotative Indexing. #WOWS25 #WOWS2025 #ECIR25

10.04.2025 07:16 👍 10 🔁 5 💬 0 📌 0
Post image Post image Post image Post image

The Workshop on Open Web Search just finished #WOWS2025 #ECIR2025.

It was a very cool experience with many interesting talks. Lets hope we can do it again next year at #ECIR2026 in Delft :)

10.04.2025 15:05 👍 8 🔁 5 💬 0 📌 0
Post image

Today I had the pleasure to talk about child-safe search at #ECIR2025. We created an cranfield-style evaluation dataset to contrast relevance with harm in web search scenarios.

Details: webis.de/publications...

10.04.2025 15:14 👍 6 🔁 2 💬 0 📌 0
Post image Post image Post image Post image

Now we have @fschlatt.bsky.social on the #ECIR2025 stage predenting the research on the Set-Encoder.

The paper is online at: webis.de/publications...

09.04.2025 08:00 👍 9 🔁 3 💬 0 📌 1
Webis Publications Publications by the Webis group

Short Paper: Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-ranking webis.de/publications...

Full Paper: Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders webis.de/publications...

09.04.2025 12:37 👍 4 🔁 2 💬 0 📌 0
Post image

Honored to receive the best short paper award and best paper honourable mention award at #ECIR2025. Thank you to all co-authors @maik-froebe.bsky.social, @hscells.bsky.social, Shengyao Zhuang, @bevankoopman.bsky.social, Guido Zuccon, Benno Stein, @martin-potthast.com, @matthias-hagen.bsky.social 🥳

09.04.2025 12:36 👍 17 🔁 4 💬 1 📌 0
Post image Post image Post image Post image

I was very happy to talk about corpus subsampling at #ECIR2025 today.

Please find the paper at webis.de/publications...

And lat bur not least, here are some of my favorite impressions of the first day of ECIR :)

07.04.2025 22:30 👍 8 🔁 2 💬 0 📌 0
Post image

📢 Our paper "The Viability of Crowdsourcing for RAG Evaluation" has been accepted to #SIGIR2025 !
We compared how good humans and LLMs are at writing and judging RAG responses, assembling 1800+ responses across 3 styles, and 47K+ pairwise judgments in 7 quality dimensions. 🧵➡️

07.04.2025 15:33 👍 12 🔁 7 💬 1 📌 0

Interested in joining our research group or do you know someone who might be interested?
We have a new vacancy: Research position at the Webis group on Watermarking for Large Language Models.
More information:
webis.de/for-students...

17.02.2025 08:55 👍 7 🔁 4 💬 0 📌 0