Check out this new paper from Willis on inference time scaling of diffusion models!
Check out this new paper from Willis on inference time scaling of diffusion models!
two exciting directions for diffusion models in 2025: either going (extremely) small or going (extremely) big with your steps
Visual-spatial intelligence–we rely on it to perceive, interact, and navigate our everyday spaces. To what capacity do MLLMs possess it? Do they mirror how humans think and reason about space?
Presenting “Thinking in Space: How Multimodal Models See, Remember, and Recall Spaces”! [1/n]