Alexandre Boulch (@alexandreboulch)

GitHub - valeoai/muddos: Official repository of the BMVC 2025 paper "Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift" Official repository of the BMVC 2025 paper "Improving Multimodal Distillation for 3D Semantic Segmentation under Domain Shift" - valeoai/muddos

For more details
📝 Paper: bmva-archive.org.uk/bmvc/2025/a...
💻 Code: github.com/valeoai/muddos

This is a joint work with my great co-authors @alexandreboulch.bsky.social, @gillespuy.bsky.social, @tuanhungvu.bsky.social, Renaud Marlet, @ncourty.bsky.social and myself.

24.11.2025 05:00 👍 1 🔁 1 💬 0 📌 0

Need pixel-level features from your backbone (DINOv3, CLIP, RADIO, FRANCA...)?

🚀Introducing NAF: A universal, zero-shot feature upsampler.

It turns low-res ViT features into pixel-perfect maps.

-⚡ Model-agnostic
-🥇 SoTA results
-🚀 4× faster than SoTA
-📈 Scales up to 2K res

25.11.2025 10:44 👍 16 🔁 3 💬 1 📌 2

Évaluation des générateurs d'images à partir de peu d'exemples : calculer le FID avec 10 fois moins d'images, c'est possible La distance Inception de Fréchet (Fréchet Inception Distance ou FID) est une métrique standard pour l'évaluation des modèles génératifs images. Construite sur la distance de Wasserstein, le FID mesure...

Pour les collègues francophones, vous saviez que le FID était tout cassé ? Moi non plus. Pourtant, si on s'y prend bien, on peut calculer le FID avec moins de 1000 images.

J'en parlerai au GRETSI fin août : hal.science/hal-05142942 👀

09.07.2025 14:11 👍 3 🔁 3 💬 0 📌 0

How to make your DINOv2 excel at dense in-context scene understanding tasks.
Check out DIP an effective post-training strategy by @ssirko.bsky.social @spyrosgidaris.bsky.social ‬
@vobeckya.bsky.social ‬@abursuc.bsky.social and Nicolas Thome 👇
#iccv2025

25.06.2025 19:35 👍 6 🔁 2 💬 0 📌 0

We just released the code of #LiDPM, go ahead and play with it (and don't forget to star 🤭🤩)!

Training and inference code available, along with the model checkpoint.

Github repo: github.com/astra-vision...

#IV2025

25.06.2025 20:05 👍 6 🔁 3 💬 1 📌 0

Presenting our project #LiDPM in the afternoon oral session at #IV2025!

Project page: astra-vision.github.io/LiDPM/

w/ @gillespuy.bsky.social, @alexandreboulch.bsky.social, Renaud Marlet, Raoul de Charette

Also, see our poster at 3pm in the Caravaggio room and AMA 😉

23.06.2025 10:12 👍 10 🔁 3 💬 1 📌 1

Okay that was stressful 🥲

23.06.2025 11:18 👍 7 🔁 1 💬 1 📌 0

🚀Thrilled to introduce JAFAR—a lightweight, flexible, plug-and-play module that upsamples features from any Foundation Vision Encoder to any desired output resolution (1/n)

Paper : arxiv.org/abs/2506.11136
Project Page: jafar-upsampler.github.io
Github: github.com/PaulCouairon...

16.06.2025 13:58 👍 26 🔁 6 💬 1 📌 0

🚗 Ever wondered if an AI model could learn to drive just by watching YouTube? 🎥👀

We trained a 1.2B parameter model on 1,800+ hours of raw driving videos.

No labels. No maps. Just pure observation.

And it works! 🤯

🧵👇 [1/10]

24.02.2025 12:53 👍 25 🔁 7 💬 1 📌 2

This amazing team ❤️

27.01.2025 17:01 👍 19 🔁 3 💬 1 📌 0

Check out our new work with @gastruc.bsky.social and @nicaogr.bsky.social and Clément Mallet! The one-stop shop for multimodal Earth Observation 🤩

19.12.2024 10:53 👍 12 🔁 3 💬 0 📌 0

Airborne #LiDAR has revolutionized the study of ancient rainforest civilizations by seeing through dense canopies. Yet archaeologists still annotate their data manually. Introducing Archaeoscape at #NeurIPS2024 —the first deep learning-scale, open-access archaeological dataset🧵👇

09.12.2024 09:47 👍 27 🔁 8 💬 1 📌 0

Motion Modes: What Could Happen Next? Motion Modes is the first training-free method to generate multiple plausible yet distinct motions for a given object, disentangled from the motion of other objects, camera and other scene changes, fr...

I could easily spend an afternoon looking at the results of this paper: motionmodes.github.io
or this paper: rollingdepth.github.io
or this paper: romosfm.github.io

vision is cool 😎

05.12.2024 11:23 👍 20 🔁 8 💬 1 📌 0

At INRIA Paris for @anhquancao.bsky.social for his PhD defense. Subject is Learning Semantics and Geometry for Scene Understanding.
anhquancao.github.io

05.12.2024 13:26 👍 6 🔁 0 💬 0 📌 0

Alexandre Boulch

Latest posts by Alexandre Boulch @alexandreboulch