Trending
Vedant Shah's Avatar

Vedant Shah

@veds12

Research at Mila / UdeM https://veds12.github.io/

198
Followers
94
Following
3
Posts
20.11.2024
Joined
Posts Following

Latest posts by Vedant Shah @veds12

N E R D

09.01.2026 13:06 πŸ‘ 1 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image

(1/n)🚨Train a model solving DFT for any geometry with almost no training data
Introducing Self-Refining Training for Amortized DFT: a variational method that predicts ground-state solutions across geometries and generates its own training data!
πŸ“œ arxiv.org/abs/2506.01225
πŸ’» github.com/majhas/self-...

10.06.2025 19:49 πŸ‘ 12 πŸ” 4 πŸ’¬ 1 πŸ“Œ 1

hank you to our funders for this project: CIFAR, NSERC, and Abundant Intelligences. Thank you also for meeting me with the rich discussions @tyrellturing.bsky.social, @veds12.bsky.social, @mnoukhov.bsky.social and @arnaghosh.bsky.social that gave clarity to the problem.

05.06.2025 15:32 πŸ‘ 6 πŸ” 1 πŸ’¬ 1 πŸ“Œ 0
Post image

New preprint! πŸ§ πŸ€–

How do we build neural decoders that are:
⚑️ fast enough for real-time use
🎯 accurate across diverse tasks
🌍 generalizable to new sessions, subjects, and even species?

We present POSSM, a hybrid SSM architecture that optimizes for all three of these axes!

🧡1/7

06.06.2025 17:40 πŸ‘ 54 πŸ” 24 πŸ’¬ 2 πŸ“Œ 8
Post image

I will be presenting our work at the MATH-AI workshop at #NeurIPS2024 today.

Location: West Meeting Room 118-120
Time: 11:00 AM - 12:30 PM; 4:00 PM - 5:00 PM

Come by if you want to chat about designing difficult evaluation benchmarks, follow-up work, and mathematical reasoning in LLMs!

14.12.2024 16:53 πŸ‘ 5 πŸ” 1 πŸ’¬ 0 πŸ“Œ 0
Post image

I will be at #NeurIPS2024 this week and will be presenting our work

"AI-Assisted Generation of Difficult Math Questions"

at the MATH-AI Workshop on Saturday πŸš€!

Would love to chat if you are interested in topics related to LLM reasoning and systematic generalization!

arxiv.org/abs/2407.21009

09.12.2024 22:45 πŸ‘ 3 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Re: the scale is dead debate. Isn't it pretty obvious that just scaling is never going to work if your method breaks down on OOD inputs? The world is non-stationary, so it's constantly presenting new OOD inputs.

20.11.2024 16:55 πŸ‘ 45 πŸ” 4 πŸ’¬ 6 πŸ“Œ 1