Giorgos Tolias (@gtolias)

Let me introduce our new paper: Multimodal Large Language Models as Image Classifiers

❓ Multimodal LLMs are increasingly used for visual tasks, but evaluating their image classification ability has produced conflicting conclusions.

Link: arxiv.org/html/2603.06...

09.03.2026 20:08 👍 11 🔁 3 💬 2 📌 1

We validate EP across diverse pretrained backbones. It complements LoRA tuning and delivers improved object localization through its internal attention maps.

Simple idea. Strong gains. Broad applicability.

arxiv: arxiv.org/pdf/2506.10178

23.02.2026 10:00 👍 3 🔁 0 💬 0 📌 0

EP becomes especially effective when the backbone is pretrained for local representation learning, such as MAE. If your downstream task requires global prediction, EP bridges that gap. MAE-style models can, in fact, excel at global tasks when paired with the right probe.

23.02.2026 10:00 👍 3 🔁 0 💬 1 📌 0

Efficient Probing will be presented at ICLR 2026.

We introduce EP, an attentive probing method that consistently outperforms linear probing and prior attentive approaches. It's a simple, intuitive design that avoids over-parameterization compared to the black-box use of standard components.

23.02.2026 10:00 👍 12 🔁 1 💬 1 📌 0

New proceedings means low chance to be indexed by scopus/web-of-science from its first year, with the consequence of not getting recognized by some grant agencies, for example in Czech Republic. I recall the NeurIPS datasets track was not indexed from year 1.

22.02.2026 17:25 👍 0 🔁 0 💬 0 📌 0

03.02.2026 08:28 👍 0 🔁 0 💬 0 📌 0

ČVUT Starting Grant 2026: Call for Proposals - Public web - Czech technical university in Prague

www.cvut.cz/en/cvut-star...

03.02.2026 08:16 👍 1 🔁 0 💬 0 📌 0

The new CTU Rector begins their term in office with strong support for excellence. CTU has just launched a Starting Grant to attract outstanding early‑career researchers who wish to join CTU and establish their own research group. Funding: up to €160k per year for 3 years. Deadline: 30 March 2026.

03.02.2026 08:16 👍 10 🔁 3 💬 2 📌 0

Clarifications for eligibility: 3 papers in total with each one being either a CORE A*/A conference or a journal with IF.

08.01.2026 13:28 👍 1 🔁 0 💬 0 📌 0

Start date is negotiable. Gross salary is 75 000 CZK. Plus the possibility of up to 20% extra in bonuses.

08.01.2026 11:11 👍 0 🔁 0 💬 0 📌 0

Postdoctoral research position in Instance-level visual generation Czech Technical University in Prague (CTU) offers a fellowship program, the CTU Global Postdoc Fellowship. This new and attractive two-year fellowship-program offers excellent researchers who have rec...

I have an opening for a two years post-doc position on instance-level (personalized) visual generation. Eligibility: (i) <=7 years from Ph.D. (ii) studies or 1 year outside of Czechia (ii) >=3 journal with IF or CORE A*/A conference papers. Deadline: 15 Feb.
Details: www.euraxess.cz/jobs/399390

08.01.2026 11:11 👍 12 🔁 10 💬 2 📌 1

🚀New task: Instance-level Image+Text→Image Retrieval

🔎Given a query image + an edit (“during night”), retrieve the same specific instance after the change — not just any similar object.

🛢New dataset on HF: i-CIR huggingface.co/datasets/bil...

🔥Download, run, and share results!

06.01.2026 20:00 👍 12 🔁 5 💬 0 📌 0

billpsomas/icir · Datasets at Hugging Face We’re on a journey to advance and democratize artificial intelligence through open source and open science.

📣 i-CIR dataset (NeurIPS 25) is now on
@hf.co.

🚀Easier download + better discoverability + WebDataset shards for large-scale use (~750K images).

🤗 Grab it here: huggingface.co/datasets/bil...

#computervision #retrieval #datasets #huggingface #NeurIPS

20.12.2025 18:42 👍 5 🔁 1 💬 0 📌 0

1/n REGLUE Your Latents! 🚀

We introduce REGLUE: a unified framework that entangles VAE latents ➕ Global ➕ Local semantics for faster, higher-fidelity image generation.

Links (paper + code) at the end👇

27.12.2025 10:26 👍 14 🔁 4 💬 1 📌 0

This is a very serious initiative. While AGI risk debates get much attention, we should worry more about the immediate danger from AI’s role in automating war and surveillance.

16.12.2025 19:00 👍 10 🔁 2 💬 1 📌 0

maybe it's time for a larger cvpr in Paris?

10.12.2025 08:36 👍 7 🔁 0 💬 1 📌 0

It was a big pleasure to be in Nicolas's committee. Congratulations to Nicolas for the great work, and congratulations to the advisors too!

28.11.2025 11:49 👍 5 🔁 1 💬 0 📌 0

Prof. @tokehoye.bsky.social (Aarhus University) and I have an open PhD position (jointly advised) on biodiversity monitoring with camera trap networks. Deadline: 15-Jan-2026

Please help us share this post among students you know with an interest in Machine Learning and Biodiversity! 🤖🪲🌱

11.11.2025 13:12 👍 20 🔁 11 💬 1 📌 2

This is a paper that will be presented next month at #NeurIPS2025. The dataset and code are already publicly available.

06.11.2025 14:12 👍 4 🔁 0 💬 0 📌 0

The studied setting allows to explore large image collections in flexible and creative ways: query with an image showing a particular object and add a text query to transform aspects like context, environment, lighting conditions, object state, and more.

06.11.2025 14:12 👍 2 🔁 0 💬 1 📌 0

There is a lot of work done recently on composed image retrieval, but we felt that none of the existing benchmarks reflect the real-world challenges and applications. So, we created a new test benchmark for instance-level composed image retrieval.

06.11.2025 14:12 👍 11 🔁 0 💬 1 📌 0

Looking for a PhD program? It all starts with great supervision. Choose wisely.

www.nature.com/articles/d41...

01.11.2025 19:26 👍 50 🔁 11 💬 2 📌 0

AnyUp is great. We are already using it flawlessly.

29.10.2025 16:31 👍 3 🔁 0 💬 0 📌 0

Armed police handcuff teen after AI mistakes crisp packet for gun in US Taki Allen, 16, said he was eating a bag of Doritos after football practice before being handcuffed by police.

This is the real-world harm of computer vision: false accusations of gun possession over a bag of crisps. Deploying this in public is reckless.
www.bbc.com/news/article...

26.10.2025 18:37 👍 7 🔁 2 💬 0 📌 0

Honored to receive a Google award to support research on vision-language models for retrieval. Grateful for the opportunity to strengthen our collaboration with Google researchers, especially Ahmet Iscen.

29.10.2025 08:28 👍 28 🔁 0 💬 3 📌 0

All slides for the RANSAC in 2025 tutorial are online
#ICCV2025
danini.github.io/ransac-2025-...

21.10.2025 18:44 👍 6 🔁 1 💬 0 📌 1

Today at #ICCV2025 (afternoon poster session): see how sensitive some foundational models are to non-semantic cues like JPEG compression and camera model. Such cues can heavily distort their semantic predictions.

22.10.2025 19:58 👍 13 🔁 2 💬 0 📌 0

This is the 7th edition of a workshop series that started from landmark recognition alone (CVPR18,CVPR19) and later broadened its scrope to instance-level recognition (ECCV20,ICCV21,ECCV22,ECCV24). This year we are expanding to include the so called personalized (instance-level) generation models.

16.10.2025 06:53 👍 2 🔁 0 💬 0 📌 0

Join our Instance-level Recognition and Generation workshop at #ICCV2025 with keynote and oral/poster presentations on image object recognition and generation at its finest granularity; each unique object of the physical world forms its own class.

16.10.2025 06:53 👍 4 🔁 0 💬 1 📌 0

The colloquium at CTU in Prague had 6 great talks and a lot of discussions before, during and after the event. The slides are now shared online. It was the 50th and our administrators surprised us with a huge Czech cake - Koláč. See you in April again!

15.10.2025 07:53 👍 8 🔁 0 💬 0 📌 0

Giorgos Tolias

Latest posts by Giorgos Tolias @gtolias