Vlad Niculae (@vn-ml)

Your data is low-rank, so stop wasting compute! In our new paper on low-rank thinning, we share one weird trick to speed up Transformer inference, SGD training, and hypothesis testing at scale. Come by ICML poster W-1012 Tuesday at 4:30!

14.07.2025 18:29 👍 25 🔁 7 💬 2 📌 2

So you want to skip our thinning proofs—but you’d still like our out-of-the-box attention speedups? I’ll be presenting the Thinformer at two ICML workshop posters tomorrow!

Catch me at Es-FoMo (1-2:30, East hall A) and at LCFM (10:45-11:30 & 3:30-4:30, West 202-204)

19.07.2025 07:04 👍 5 🔁 4 💬 0 📌 0

new working paper! we (me, Su Lin Blodgett, @ninamarkl.bsky.social) examine how recent marketing of LLMs extends older discourses that cast workers as bundles of skills, and unpack the false promises of empowerment these discourses embed, in times of precarity

tisjune.github.io/papers/aarhu...

27.06.2025 19:41 👍 35 🔁 3 💬 3 📌 5

Looking forward to this year's edition! With great speakers: Ryan McDonald Yulan He @vn-ml.bsky.social @antonisa.bsky.social Raquel Fernandez @annarogers.bsky.social Preslav Nakov @mohitbansal.bsky.social @eunsol.bsky.social Marie-Catherine de Marnefffe !

06.06.2025 09:10 👍 6 🔁 3 💬 0 📌 0

Language and Computation in Neural Systems We are an international group of scientists consisting of linguists, cognitive scientists, cognitive neuroscientists, computational neuroscientists, computational modellers, computational scientists, ...

my lab (lacns.github.io) at @mpi-nl.bsky.social and @dondersinst.bsky.social is recruiting for two PhD and two postdoctoral positions funded by an @erc.europa.eu Consolidator - come join us!

PhD: www.mpi.nl/career-educa...

Postdoc: www.mpi.nl/career-educa...

(please share widely)

20.05.2025 13:49 👍 81 🔁 53 💬 3 📌 9

Does the "agreement" part refer only the previous question or to something else, and does the answer there have any consequences in the review process (can we review regardless of option? can we submit papers regardless of option?)

thanks in advance!

20.05.2025 12:35 👍 2 🔁 0 💬 0 📌 0

The "attribution" section only has an option "yes", signaling agreement to deanonymize your reviews. Is there an option to say no? (eg. by not selecting anything?) This is not communicated and is different from most other entries in the form.

20.05.2025 12:35 👍 2 🔁 0 💬 1 📌 0

hi, i'm struggling with the author registration form. i can't work out how to navigate the dark design patterns used in the "attribution" and "agreement" part of the form.

could you please provide some details about those choices?

most imp., are there choices that result in desk reject?

20.05.2025 12:35 👍 2 🔁 0 💬 1 📌 0

Schematic illustration of a scalar-valued residual deep GP with L hidden layers. The last layer is a scalar-valued GP on the manifold. If it is not present, the model is manifold-valued. If it is replaced with a Gaussian vector field (GVF), the model is a vector field on the manifold.

Excited to share our ICLR 2025 oral "Residual Deep Gaussian Processes on Manifolds"!

With @vabor112.bsky.social & @arkrause.bsky.social, we introduce manifold-to-manifold GPs that can be composed together, generalising deep GPs to manifolds. Applications include wind prediction & Bayes opt! 1/n

13.02.2025 16:45 👍 37 🔁 9 💬 1 📌 2

i can't believe how long we've spent fooling ourselves about the value of fully specified, massive matmuls instead of embracing the gods of sparsity

04.01.2025 04:01 👍 5 🔁 2 💬 0 📌 0

Vlad Niculae

Recruiting a PhD candidate at U. of Amsterdam (funded, 4yr). We will use ML&NLP, prob. models, and user studies, to make adaptive scientific-assistant systems that communicate & justify decisions in ways helpful to experts.

More: vene.ro/jobs.html
Apply by May 18: werkenbij.uva.nl/en/vacancies...

24.04.2025 13:03 👍 11 🔁 6 💬 0 📌 0

Variational approximation with Gaussian mixtures is looking cute! So here it's just gradient descent on K(q||p) for optimising the mixtures means & covariances & weights...
@lacerbi.bsky.social

20.11.2024 18:23 👍 33 🔁 7 💬 2 📌 0

This review paper by @guillaume-garrigos.com on SGD-related algorithms is a fantastic resource, offering elegant, self-contained, and concise proofs in a single, accessible reference. arxiv.org/pdf/2301.11235

29.01.2025 16:15 👍 189 🔁 39 💬 1 📌 0

These phenomenon have been observed since early vision systems. It is important to report these things, though. Maybe it will permeate and we won’t keep making the same mistakes over and over

14.12.2024 18:25 👍 12 🔁 3 💬 1 📌 0

This is such a beautiful algorithm (and a nice analysis): to check if an array is sorted vs. far from being sorted (many entries need to be changed), just:
- pick an element uniformly at random in the array
- "forget" where it was
- try to find it again via binary search
Repeat this a few times.

14.12.2024 10:50 👍 28 🔁 6 💬 0 📌 0

It is in our hands: To protect our safety and the right to protest, do more than re-formulating the house rules. Column by anonymous FNV-member from UvA Protest is a fundamental right, and universities have a duty to facilitate and protect it. The violent events surrounding pro-Palestine protests this past year ...

I and hundreds other workers at the University of Amsterdam are on strike with @fnv.bsky.social

www.linkedin.com/pulse/our-ha...

12.12.2024 12:08 👍 2 🔁 1 💬 0 📌 0

"AI can be bad but also it can be good" is just a really dumb way to talk about anything...it's the grade-school exercise of "make a list of pros and cons" but pressed into service for producing a sense of inevitability and making the medicine go down

06.12.2024 05:17 👍 4 🔁 1 💬 1 📌 0

OpenAI’s new defense contract completes its military pivot A new partnership with Anduril, announced today, will deploy AI on the battlefield. It represents an overhaul of the company’s position in just a year.

OpenAI in 2024:

“No AI for weapons or military”

“Do use our AI to make weapons to hurt yourself or others”

“Military is fine, but no AI for weapons”

“Sure put it on battlefield drones”

www.technologyreview.com/2024/12/04/1...

04.12.2024 23:43 👍 193 🔁 71 💬 6 📌 17

Blue skies 🦋 , hot (?) takes 🔥

Constrained output for LLMs, e.g., outlines library for vllm which forces models to output json/pydantic schemas, is cool!

But, because output tokens cost much more latency than input tokens, if speed matters: bespoke, low-token output formats are often better.

03.12.2024 22:25 👍 8 🔁 1 💬 2 📌 0

I hope I am not late to the party (was away post-quals chilling) but here are some thoughts on why this is bad IMO:

First, a disclaimer that I am writing this as an African who is a speaker of multiple African languages, NLP researcher of African languages, and HCI researcher focusing broadly on..

02.12.2024 23:43 👍 125 🔁 60 💬 9 📌 8

Vlad Niculae

Latest posts by Vlad Niculae @vn-ml