Abhinav Kumar (@akumar03)

Theoretical Foundations of Conformal Prediction This book is about conformal prediction and related inferential techniques that build on permutation tests and exchangeability. These techniques are useful in a diverse array of tasks, including hypot...

The new conformal prediction book now seems to be final after a bunch of updates: arxiv.org/abs/2411.118...

09.03.2026 20:21 👍 10 🔁 3 💬 1 📌 0

In my #OTD threads I often try to draw attention to important scientists who historically haven't received the same recognition as their peers.

A science education involves a lot of lore: Stories and anecdotes meant to flesh out a narrative about the development of a field...

08.03.2026 20:37 👍 58 🔁 18 💬 2 📌 3

I have added a new tutorial on discrete diffusion models:
github.com/gpeyre/ot4ml

01.03.2026 14:40 👍 51 🔁 15 💬 0 📌 0

Joseph Y. Halpern Obituary February 13, 2026 - Bangs Funeral Home View Joseph Y. Halpern's obituary, send flowers, find service dates, and sign the guestbook.

Was saddened today to hear the passing of Joseph Halpern. I knew him from my undergraduate days at Cornell, for part of which he was the department chair.
www.bangsfuneralhome.com/obituaries/j...

14.02.2026 20:18 👍 10 🔁 4 💬 0 📌 1

If you are interested in all things ✨causal inference✨, please join our multidisciplinary Causal Inference Interest Group (CIIG). We host monthly seminars featuring speakers with various academic backgrounds and research interests.

Links below.

cc @clscohorts.bsky.social @pwgtennant.bsky.social

20.01.2026 12:35 👍 47 🔁 29 💬 1 📌 1

We propose a method for estimating long-term treatment effects with many short-term proxy outcomes: a central challenge when experimenting on digital platforms. We formalize this challenge as a latent variable problem where observed proxies are noisy measures of a low-dimensional set of unobserved surrogates that mediate treatment effects. Through theoretical analysis and simulations, we demonstrate that regularized regression methods substantially outperform naive proxy selection. We show in particular that the bias of Ridge regression decreases as more proxies are added, with closed-form expressions for the bias-variance tradeoff. We illustrate our method with an empirical application to the California GAIN experiment.

arXiv📈🤖
Long-Term Causal Inference with Many Noisy Proxies
By Lal, Imbens, Hull

13.01.2026 16:17 👍 11 🔁 5 💬 0 📌 0

I've just started reading this new book by Shekhar Khare, a renowned number theorist at UCLA, and I highly recommend it. It takes the reader on an intellectual adventure, with very illuminating analogies and vivid storytelling. It's a book about some brilliant math, but it also has a great heart.

09.01.2026 11:57 👍 46 🔁 7 💬 2 📌 0

We introduce epiplexity, a new measure of information that provides a foundation for how to select, generate, or transform data for learning systems. We have been working on this for almost 2 years, and I cannot contain my excitement! arxiv.org/abs/2601.03220 1/7

07.01.2026 17:27 👍 143 🔁 34 💬 9 📌 9

Inference on Local Variable Importance Measures for Heterogeneous Treatment Effects We provide an inferential framework to assess variable importance for heterogeneous treatment effects. This assessment is especially useful in high-risk domains such as medicine, where decision makers...

New paper: Inference on Local Variable Importance Measures for Heterogeneous Treatment Effects (with Peter B. Gilbert & @alexluedtke.bsky.social).

Preprint: arxiv.org/abs/2510.18843.

15.12.2025 11:15 👍 4 🔁 2 💬 1 📌 0

Gradient optimization methods: the benefits of instability YouTube video by Sydney Mathematical Research Institute - SMRI

Gradient optimization methods: the benefits of instability — Peter Bartlett, UC Berkeley

www.youtube.com/watch?v=wEgT...

#MathSky #SMRISeminar

11.12.2025 04:29 👍 16 🔁 6 💬 0 📌 0

Read last night. Very nice. arxiv.org/abs/2512.01868

03.12.2025 17:46 👍 21 🔁 4 💬 2 📌 1

The official home of the Python Programming Language

TLDR; The PSF has made the decision to put our community and our shared diversity, equity, and inclusion values ahead of seeking $1.5M in new revenue. Please read and share. pyfound.blogspot.com/2025/10/NSF-...
🧵

27.10.2025 14:47 👍 6413 🔁 2756 💬 125 📌 452

A Martingale Kernel Two-Sample Test The Maximum Mean Discrepancy (MMD) is a widely used multivariate distance metric for two-sample testing. The standard MMD test statistic has an intractable null distribution typically requiring costly...

A nice variant of the kernel two-sample test. arxiv.org/abs/2510.11853

Sketch of the idea: The MMD is the core of a commonly used nonparametric test for distribution testing. It works by embedding distributions into a RKHS and comparing their mean embeddings. [+]

15.10.2025 23:51 👍 5 🔁 1 💬 1 📌 0

As a grad student, the biologist Yitzhi “Patrick” Cai helped program 𝘌. 𝘤𝘰𝘭𝘪 bacteria to become a biosensor for arsenic contamination in drinking water. Today, he is leading a global effort to build the first-ever synthetic eukaryotic genome. www.quantamagazine.org/hes-gleaning...

30.09.2025 20:04 👍 36 🔁 5 💬 1 📌 1

We have two new mentees who are offering their time via office hours! Please show Sandeep Silwal and Kevin Tian some love and sign up to meet them!
let-all.com/officehours....

30.09.2025 20:59 👍 9 🔁 3 💬 1 📌 0

The paper this talk is based on is quite impressive arxiv.org/abs/2507.04441 one of those cases where you see direct real actionable insight using the categorical hammer.

25.09.2025 14:21 👍 3 🔁 1 💬 1 📌 0

Simulating Time With Square-Root Space (And With Details) - Ryan Williams YouTube video by Institute for Advanced Study

Today at IAS, I gave a 2 hr 15 mins lecture on why TIME[t] is in SPACE[√(t log t)]. You can watch it here!
www.youtube.com/watch?v=ThLv...

23.09.2025 20:22 👍 39 🔁 7 💬 1 📌 0

Aligning an AI with human preferences might be hard. But there is more than one AI out there, and users can choose which to use. Can we get the benefits of a fully aligned AI without solving the alignment problem? In a new paper we study a setting in which the answer is yes.

19.09.2025 12:12 👍 27 🔁 4 💬 1 📌 0

Egan conjecture holds Given a Euclidean simplex of dimension n⩾2 let its radii of inscribed and circumscribed spheres be r and R, and the distance between the centers of th…

Sergei Drozdov has published his nice proof using hyperbolic simplexes of the necessary and sufficient condition on the radii of spheres that sit inside and outside a Euclidean simplex in any dimension.

www.sciencedirect.com/science/arti...

03.09.2025 21:47 👍 17 🔁 3 💬 2 📌 0

The workshops focused on (in chronological order):

- Variational Inference (youtube.com/playlist?lis...)
- Optimal Transport (youtube.com/playlist?lis...)
- Parallel Computing (youtube.com/playlist?lis...), and
- Computational Physics (youtube.com/playlist?lis...).

02.09.2025 12:13 👍 3 🔁 2 💬 1 📌 0

The mathematician Lingrui Ge recently helped find a new way to understand the solutions of almost-periodic functions, important equations that appear in quantum physics. The work has helped cement an intriguing connection between number theory and physics. www.quantamagazine.org/ten-martini-...

30.08.2025 19:45 👍 45 🔁 11 💬 0 📌 0

Pretty cool: the "Fundamental Examples" of independence structures in Non-Commutative Probability.

26.08.2025 08:11 👍 5 🔁 3 💬 1 📌 0

Carlos Cinelli, Avi Feller, Guido Imbens, Edward Kennedy, Sara Magliacane, Jose Zubizarreta
Challenges in Statistics: A Dozen Challenges in Causality and Causal Inference
https://arxiv.org/abs/2508.17099

26.08.2025 05:56 👍 10 🔁 4 💬 0 📌 0

Adi Shamir's advice to young researchers:

1. Read, read, read. Back in the eighties, I read every cryptography paper out there. Once that became impossible, I read the abstract of every paper. Now I read at least every title.

26.03.2025 11:24 👍 16 🔁 4 💬 2 📌 1

An Ode to the Spherical Cow How Imperfect Models Drive Scientific Discovery

This week's post is about why spherical cows are physics' mascot ⚛️🧪

open.substack.com/pub/nirmalya...

17.08.2025 22:58 👍 32 🔁 11 💬 3 📌 0

Big fan of this perspective:

07.05.2025 18:46 👍 44 🔁 8 💬 2 📌 0

Lessons from Paula Harris / by Sophie Huiberts YouTube video by Mixed Integer Programming

This is about one of my greatest inspirations. It would mean a lot to me if you gave it a watch

11.08.2025 11:06 👍 22 🔁 7 💬 0 📌 1

Regardless of whether you plan to use them in applications, everyone should learn about Gaussian processes, and Bayesian methods. They provide a foundation for reasoning about model construction and all sorts of deep learning behaviour that would otherwise appear mysterious.

09.08.2025 14:42 👍 55 🔁 6 💬 3 📌 0

E5: What Confounding Really Is YouTube video by Causal Foundations

After a bit of a summer pause, I'm back to making episodes. In this episode, I explain the notion of confounding, and clarify why confounders should not be thought of as alternate explanations of an observed effect.

youtu.be/kAgS7cltBhM

08.08.2025 22:18 👍 4 🔁 1 💬 0 📌 0

Randomized trials (RCTs) help evaluate if deploying AI/ML systems actually improves outcomes (e.g., survival rates in a healthcare context).

But AI/ML systems can change: Do we need a new RCT every time we update the model? Not necessarily, as we show in our UAI paper! arxiv.org/abs/2502.09467

23.07.2025 14:09 👍 5 🔁 1 💬 1 📌 0

Abhinav Kumar

Latest posts by Abhinav Kumar @akumar03