Need pixel-level features from your backbone (DINOv3, CLIP, RADIO, FRANCA...)?
๐Introducing NAF: A universal, zero-shot feature upsampler.
It turns low-res ViT features into pixel-perfect maps.
-โก Model-agnostic
-๐ฅ SoTA results
-๐ 4ร faster than SoTA
-๐ Scales up to 2K res
25.11.2025 10:44
๐ 16
๐ 3
๐ฌ 1
๐ 2
How to make your DINOv2 excel at dense in-context scene understanding tasks.
Check out DIP an effective post-training strategy by @ssirko.bsky.social @spyrosgidaris.bsky.social โฌ
@vobeckya.bsky.social โฌ@abursuc.bsky.social and Nicolas Thome ๐
#iccv2025
25.06.2025 19:35
๐ 6
๐ 2
๐ฌ 0
๐ 0
We just released the code of #LiDPM, go ahead and play with it (and don't forget to star ๐คญ๐คฉ)!
Training and inference code available, along with the model checkpoint.
Github repo: github.com/astra-vision...
#IV2025
25.06.2025 20:05
๐ 6
๐ 3
๐ฌ 1
๐ 0
Presenting our project #LiDPM in the afternoon oral session at #IV2025!
Project page: astra-vision.github.io/LiDPM/
w/ @gillespuy.bsky.social, @alexandreboulch.bsky.social, Renaud Marlet, Raoul de Charette
Also, see our poster at 3pm in the Caravaggio room and AMA ๐
23.06.2025 10:12
๐ 10
๐ 3
๐ฌ 1
๐ 1
Okay that was stressful ๐ฅฒ
23.06.2025 11:18
๐ 7
๐ 1
๐ฌ 1
๐ 0
๐Thrilled to introduce JAFARโa lightweight, flexible, plug-and-play module that upsamples features from any Foundation Vision Encoder to any desired output resolution (1/n)
Paper : arxiv.org/abs/2506.11136
Project Page: jafar-upsampler.github.io
Github: github.com/PaulCouairon...
16.06.2025 13:58
๐ 26
๐ 6
๐ฌ 1
๐ 0
๐ Ever wondered if an AI model could learn to drive just by watching YouTube? ๐ฅ๐
We trained a 1.2B parameter model on 1,800+ hours of raw driving videos.
No labels. No maps. Just pure observation.
And it works! ๐คฏ
๐งต๐ [1/10]
24.02.2025 12:53
๐ 25
๐ 7
๐ฌ 1
๐ 2
This amazing team โค๏ธ
27.01.2025 17:01
๐ 19
๐ 3
๐ฌ 1
๐ 0
Check out our new work with @gastruc.bsky.social and @nicaogr.bsky.social and Clรฉment Mallet! The one-stop shop for multimodal Earth Observation ๐คฉ
19.12.2024 10:53
๐ 12
๐ 3
๐ฌ 0
๐ 0
Airborne #LiDAR has revolutionized the study of ancient rainforest civilizations by seeing through dense canopies. Yet archaeologists still annotate their data manually. Introducing Archaeoscape at #NeurIPS2024 โthe first deep learning-scale, open-access archaeological dataset๐งต๐
09.12.2024 09:47
๐ 27
๐ 8
๐ฌ 1
๐ 0
At INRIA Paris for @anhquancao.bsky.social for his PhD defense. Subject is Learning Semantics and Geometry for Scene Understanding.
anhquancao.github.io
05.12.2024 13:26
๐ 6
๐ 0
๐ฌ 0
๐ 0