Agneet Chatterjee's Avatar

Agneet Chatterjee

@agneet

Image and Video generation. https://agneetchatterjee.com/

154
Followers
37
Following
2
Posts
20.11.2024
Joined
Posts Following

Latest posts by Agneet Chatterjee @agneet

Preview
REVISION: Rendering Tools Enable Spatial Fidelity in Vision-Language Models Text-to-Image (T2I) and multimodal large language models (MLLMs) have been adopted in solutions for several computer vision and multimodal learning tasks. However, it has been found that such vision-l...

We also develop a benchmark to evaluate spatial understanding of VLM's. The core idea is to use synthetic images which avoids any possibility of test time leakage: arxiv.org/abs/2408.02231

26.11.2024 15:26 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 0 ๐Ÿ“Œ 0

@csprofkgd.bsky.social could you add me too? Thank you!

24.11.2024 21:11 ๐Ÿ‘ 1 ๐Ÿ” 0 ๐Ÿ’ฌ 1 ๐Ÿ“Œ 0