๐ข๐ We have open PhD positions in Computer Vision & Machine Learning at @tuda.bsky.social and @hessianai.bsky.social within the Reasonable AI Cluster of Excellence โ supervised by @stefanroth.bsky.social, @simoneschaub.bsky.social and many others!
www.career.tu-darmstadt.de/tu-darmstadt...
04.11.2025 14:04
๐ 8
๐ 6
๐ฌ 0
๐ 0
[6/8] Motion-Refined DINOSAUR for Unsupervised Multi-Object Discovery (Oral at ILR+G Workshop)
by Xinrui Gong*, @olvrhhn.bsky.social *, @christophreich.bsky.social , Krishnakant Singh, @simoneschaub.bsky.social , @dcremers.bsky.social @stefanroth.bsky.social
19.10.2025 15:35
๐ 3
๐ 1
๐ฌ 1
๐ 0
[3/8] Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
by @jev-aleks.bsky.social *, @christophreich.bsky.social *, @fwimbauer.bsky.social , @olvrhhn.bsky.social , Christian Rupprecht, @stefanroth.bsky.social, @dcremers.bsky.social
๐ visinf.github.io/scenedino/
19.10.2025 15:35
๐ 2
๐ 1
๐ฌ 1
๐ 0
Interested in 3D DINO features from a single image or unsupervised scene understanding?๐ฆ
Come by our SceneDINO poster at NeuSLAM today 14:15 (Kamehameha II) or Tue, 15:15 (Ex. Hall I 627)!
W/ Jevtiฤ @fwimbauer.bsky.social @olvrhhn.bsky.social Rupprecht, @stefanroth.bsky.social @dcremers.bsky.social
19.10.2025 20:38
๐ 8
๐ 3
๐ฌ 0
๐ 0
ELLIS PhD Program: Call for Applications 2025
The ELLIS mission is to create a diverse European network that promotes research excellence and advances breakthroughs in AI, as well as a pan-European PhD program to educate the next generation of AI...
๐ Looking for a PhD position in computer vision? Apply to the European Laboratory for Learning & Intelligent Systems (ELLIS) and work with @stefanroth.bsky.social & @simoneschaub.bsky.social! Join the info session on Oct 1.
@ellis.eu @tuda.bsky.social
ellis.eu/news/ellis-p...
29.09.2025 09:34
๐ 10
๐ 6
๐ฌ 0
๐ 0
Check out our blog post about SceneDINO ๐ฆ
For more details, check out our project page, ๐ค demo, and the hashtag #ICCV2025 paper ๐
๐Project page: visinf.github.io/scenedino/
๐คDemo: visinf.github.io/scenedino/
๐Paper: arxiv.org/abs/2507.06230
@jev-aleks.bsky.social
24.07.2025 13:16
๐ 2
๐ 1
๐ฌ 0
๐ 0
The code for our #CVPR2025 paper, PRaDA: Projective Radial Distortion Averaging, is now out!
Turns out distortion calibration from multiview 2D correspondences can be fully decoupled from 3D reconstruction, greatly simplifying the problem
arxiv.org/abs/2504.16499
github.com/DaniilSinits...
09.07.2025 13:54
๐ 12
๐ 5
๐ฌ 1
๐ 0
SceneDINO
Feed-forward SceneDINO: Single input image -> 3D geometry and features -> unsupervised semantics (SSC).
๐Project Page: visinf.github.io/scenedino/
๐Paper: arxiv.org/abs/2507.06230
๐ปCode: github.com/tum-vision/s...
๐คDemo: huggingface.co/spaces/jev-a...
09.07.2025 13:17
๐ 4
๐ 0
๐ฌ 0
๐ 0
Work by: @jev-aleks.bsky.social*, @christophreich.bsky.social*, @olvrhhn.bsky.social, Christian Rupprecht, @stefanroth.bsky.social, and @dcremers.bsky.social @ TUM CVG, @visinf.bsky.social, @munichcenterml.bsky.social, @zuseschooleliza.bsky.social, and @hessianai.bsky.social
09.07.2025 13:17
๐ 3
๐ 0
๐ฌ 1
๐ 0
โ
SceneDINO offers refined, high-resolution, and multi-view consistent (rendered) 2D features.
09.07.2025 13:17
๐ 3
๐ 0
๐ฌ 1
๐ 0
โ
SceneDINO outperforms our unsupervised baseline (S4C + STEGO) in unsupervised SSC accuracy.
โ
Linear probing our feature field leads to an SSC accuracy on par with 2D supervised S4C.
09.07.2025 13:17
๐ 3
๐ 0
๐ฌ 1
๐ 0
โ๏ธDistilling and clustering SceneDINO's feature field in 3D results in unsupervised semantic scene completion predictions.
09.07.2025 13:17
๐ 3
๐ 0
๐ฌ 1
๐ 0
๐SceneDINO is trained to estimate an expressive 3D feature field using multi-view self-supervision and 2D DINO features.
09.07.2025 13:17
๐ 3
๐ 0
๐ฌ 1
๐ 0
๐ SceneDINO is unsupervised and infers 3D geometry and features from a single image in a feed-forward manner. Distilling and clustering SceneDINO's 3D feature field lead to unsupervised semantic scene completion predictions.
09.07.2025 13:17
๐ 4
๐ 0
๐ฌ 1
๐ 0
๐ฆ We present โFeed-Forward SceneDINO for Unsupervised Semantic Scene Completionโ. #ICCV2025
๐: visinf.github.io/scenedino/
๐: arxiv.org/abs/2507.06230
๐ค: huggingface.co/spaces/jev-a...
@jev-aleks.bsky.social @fwimbauer.bsky.social @olvrhhn.bsky.social @stefanroth.bsky.social @dcremers.bsky.social
09.07.2025 13:17
๐ 24
๐ 10
๐ฌ 1
๐ 1
Aleksandar Jevti\'c, Christoph Reich, Felix Wimbauer, Oliver Hahn, Christian Rupprecht, Stefan Roth, Daniel Cremers
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
https://arxiv.org/abs/2507.06230
09.07.2025 05:20
๐ 2
๐ 1
๐ฌ 0
๐ 0
Got a strong XAI paper rejected from ICCV? Submit it to our ICCV eXCV Workshop todayโwe welcome high-quality work!
๐๏ธ Submissions open until June 26 AoE.
๐ Got accepted to ICCV? Congrats! Consider our non-proceedings track.
#ICCV2025 @iccv.bsky.social
26.06.2025 09:21
๐ 20
๐ 9
๐ฌ 0
๐ 3
Scene-Centric Unsupervised Panoptic Segmentation
by @olvrhhn.bsky.social , @christophreich.bsky.social , @neekans.bsky.social , @dcremers.bsky.social, Christian Rupprecht, and @stefanroth.bsky.social
Sunday, 8:30 AM, ExHall D, Poster 330
Project Page: visinf.github.io/cups
11.06.2025 20:56
๐ 7
๐ 2
๐ฌ 1
๐ 0
Can we match vision and language representations without any supervision or paired data?
Surprisingly, yes!ย
Our #CVPR2025 paper with @neekans.bsky.social and @dcremers.bsky.social shows that the pairwise distances in both modalities are often enough to find correspondences.
โฌ๏ธ 1/4
03.06.2025 09:27
๐ 27
๐ 12
๐ฌ 1
๐ 0
Can you train a model for pose estimation directly on casual videos without supervision?
Turns out you can!
In our #CVPR2025 paper AnyCam, we directly train on YouTube videos and achieve SOTA results by using an uncertainty-based flow loss and monocular priors!
โฌ๏ธ
13.05.2025 08:11
๐ 25
๐ 10
๐ฌ 1
๐ 1
Check out our latest recent #CVPR2025 paper AnyCam, a fast method for pose estimation in casual videos!
1๏ธโฃ Can be directly trained on casual videos without the need for 3D annotation.
2๏ธโฃ Based around a feed-forward transformer and light-weight refinement.
Code and more info: โฉ fwmb.github.io/anycam/
23.04.2025 15:52
๐ 23
๐ 6
๐ฌ 1
๐ 0
Check out our recent #CVPR2025 #highlight paper on unsupervised panoptic segmentation๐
๐ visinf.github.io/cups/
04.04.2025 13:45
๐ 8
๐ 0
๐ฌ 0
๐ 0
Check out the #MCML blog post on our recent #CVPR2025 #highlight paper๐ฅ
04.04.2025 13:36
๐ 7
๐ 1
๐ฌ 0
๐ 0
Nice one! Have you tried instance segmentation?
28.03.2025 16:25
๐ 0
๐ 0
๐ฌ 1
๐ 0
Check out the recent CVG papers at #CVPR2025, including our (@olvrhhn.bsky.social, @neekans.bsky.social, @dcremers.bsky.social, Christian Rupprecht, and @stefanroth.bsky.social) work on unsupervised panoptic segmentation. The paper will soon be available on arXiv. ๐
13.03.2025 15:49
๐ 6
๐ 2
๐ฌ 0
๐ 0
๐๏ธโท๏ธ Looking back on a fantastic week full of talks, research discussions, and skiing in the Austrian mountains!
31.01.2025 19:38
๐ 32
๐ 11
๐ฌ 0
๐ 0