Stimulus reliability but not boundary distance manipulations violate the folded-X pattern of confidence
The folded-X pattern has been identified as a critical signature of confidence: as conditions become easier, confidence increases for correct trials b…
Our new paper is out in Cognition! What determines whether confidence follows the classic "folded-X" pattern vs. the "double-increase" pattern? The answer lies in the type of stimulus manipulation. Big thanks to my advisor Doby @dobyrahnev.bsky.social and co-first author @herrickfung.bsky.social !
25.02.2026 00:33
👍 21
🔁 7
💬 1
📌 1
More evidence for ‘task-defining’ vs. ‘auxiliary’ stimulus manipulations, each with distinct effects on confidence.
Check out my new paper with @dobyrahnev.bsky.social and @kaixue98.bsky.social, now in Cognition. Also check out our earlier sister paper on this matter!
Kai's thread 👇
25.02.2026 01:13
👍 5
🔁 1
💬 0
📌 0
Turns out that individual differences in accuracy, confidence, and RT among ANNs that only differ in their random initialization mimic the individual differences in humans.
It may be time for NeuroAI to take individual differences even more seriously.
Check out Herrick's thread 👇
12.02.2026 14:20
👍 14
🔁 3
💬 1
📌 0
Huge thanks to my two wonderful mentors @apurvaratan.bsky.social & @dobyrahnev.bsky.social
Preprint: www.biorxiv.org/content/10.6...
Data and code: github.com/herrickfung/...
8/8
12.02.2026 14:07
👍 2
🔁 0
💬 0
📌 0
Our work highlights an important but often overlooked dimension in human-model evaluations. We propose that a truly human-like model should not only exhibit strong alignment with the average human behavior, but also capture the structured idiosyncrasies that characterize individual humans.
7/8
12.02.2026 14:05
👍 2
🔁 1
💬 1
📌 0
We then conducted a preregistered human experiment (N = 60) and showed generalization of these findings to a more complex 10-choice object task, though the human-ANN mappings are generally less robust than the human-human benchmark, demonstrating partial mapping of individual differences.
6/8
12.02.2026 14:04
👍 0
🔁 0
💬 1
📌 0
Critically, these human-ANN mappings are consistent not only across splits of the data but also across behavioral metrics (accuracy, confidence, RT). This means that if an ANN instance is more similar to you in accuracy, it’s also more similar to you in confidence and RT.
5/8
12.02.2026 14:04
👍 0
🔁 0
💬 1
📌 0
To ensure robustness, we performed 1,000 bootstrap iterations of data splits and found that human–ANN mappings exhibit variability and consistency comparable to the human–human benchmark.
4/8
12.02.2026 14:03
👍 0
🔁 0
💬 1
📌 0
We found substantial individual differences in both humans and ANNs. Crucially, different ANN instances varied in how well they aligned with individual humans, with each instance providing a distinct fit to different subjects.
3/8
12.02.2026 14:03
👍 0
🔁 0
💬 1
📌 0
We reanalyzed a dataset of 60 humans performing an 8-choice digit task and, in parallel, trained 60 randomly initialized instances of each ANN architecture (RTNet, AlexNet, and ResNet18) on the same task. We then quantified human–ANN alignment at the individual level.
2/8
12.02.2026 14:02
👍 2
🔁 0
💬 1
📌 0
🚨 New preprint on individual differences in artificial neural networks and human behavior.
We show that individual differences among ANN instances trained with different random initializations capture the individual differences in human behavior.
1/8
12.02.2026 14:00
👍 9
🔁 7
💬 1
📌 1
OSF
New preprint: Confidence-accuracy dissociations in perceptual decision making. A review I was supposed to write 3 years ago for my VSS Young Investigator Award. Better late than never 😅 I tried to organize the literature and explore the likely mechanisms. Feedback welcome!
osf.io/preprints/ps...
13.01.2026 18:13
👍 63
🔁 23
💬 0
📌 0
31.12.2025 14:38
👍 6
🔁 2
💬 0
📌 0
"How participants create illusory experiences to help experimenters." A new preprint of a multi-study manuscript w/
@zoltandienes.bsky.social, @anilseth.bsky.social & Ryan Scott, extending previous work on demand characteristics & phenomenological control in psychological effects. osf.io/n2ukj
16.12.2025 17:45
👍 11
🔁 2
💬 0
📌 1
Excited to share my first paper: Model–Behavior Alignment under Flexible Evaluation: When the Best-Fitting Model Isn’t the Right One (NeurIPS 2025). link below.
20.11.2025 14:05
👍 18
🔁 4
💬 1
📌 2
Need more fMRI data (beyond the amazing NSD)? Introducing MOSAIC! Incredible effort led expertly by Ben Lahner, with help from grad student Mayukh Deb. Work in collaboration with the amazing Aude Oliva! @neurosky.bsky.social. More below..
04.12.2025 14:26
👍 26
🔁 9
💬 1
📌 3
Hi Tahereh—Just in case you missed my recent post, here's the link to my preprint: www.biorxiv.org/content/10.1...
26.10.2025 23:45
👍 1
🔁 0
💬 0
📌 0
So lucky to have two wonderful mentors behind this project @apurvaratan.bsky.social & @dobyrahnev.bsky.social
26.10.2025 23:41
👍 0
🔁 0
💬 0
📌 0
Human-like individual differences emerge from random weight initializations in neural networks
Much of AI research targets the behavior of an average human, a focus that traces to Turing's imitation game. Yet, no two human individuals behave exactly alike. In this study, we show that artificial...
No two humans behave exactly alike. But what about neural networks? We found early evidence that human-like individual differences in behavior emerge from networks trained with different initializations. Here’s a peek at our results—to be presented at UniReps & DBM @NeurIPS. Full paper on the way!
26.10.2025 23:39
👍 11
🔁 3
💬 2
📌 1
Hi Tahereh — Good to hear from you. We haven't posted it yet, but I'll put it on arXiv in the next few days. I'll send you the link once it's up, in case you'd like to add it now or later.
22.10.2025 22:22
👍 1
🔁 0
💬 0
📌 0
It looks like stimulus manipulations can be divided into "task-defining" and "auxiliary". The manipulations from each group have very different effects on accuracy vs. confidence. And all auxiliary manipulations seem to work in basically the same way. Really cool stuff by @herrickfung.bsky.social.
13.10.2025 12:09
👍 10
🔁 1
💬 0
📌 0
Huge thanks to Medha Shekhar, @kaixue98.bsky.social, @manurausch.bsky.social, & @dobyrahnev.bsky.social for their guidance and collaboration on this project.
Full article: www.sciencedirect.com/science/arti...
Data and code: github.com/herrickfung/...
n/n
12.10.2025 16:13
👍 2
🔁 1
💬 0
📌 0
These findings show a clear behavioral difference between manipulating the task-defining feature (orientation) and other auxiliary manipulations that only affect stimulus clarity, offering a way to predict how novel manipulations affect confidence and accuracy.
4/n
12.10.2025 16:12
👍 1
🔁 0
💬 1
📌 0
Manipulating other features—size, duration, spatial frequency, and noise—though visually distinct, showed strikingly similar effects: they (1) affected confidence more strongly than accuracy, (2) showed sub-additive or no interaction with other features, & (3) violated the Folded-X pattern.
3/n
12.10.2025 16:11
👍 1
🔁 0
💬 1
📌 0
In an orientation discrimination task, manipulating the “task-defining” tilt offset of the Gabor demonstrated three unique effects: it (1) affected accuracy more strongly than confidence, (2) showed supra-additive interaction with other features, & (3) exhibited the Folded-X pattern.
2/n
12.10.2025 16:11
👍 0
🔁 0
💬 1
📌 0
Similarities and differences in the effects of different stimulus manipulations on accuracy and confidence
Visual stimuli can vary in multiple dimensions that affect accuracy and confidence in a perceptual decision-making task. However, previous studies hav…
Glad to see my first-year project is out!
In two experiments, we manipulated multiple stimulus features in a perception task, yet their effects on confidence and accuracy fell into just two distinct behavioral patterns, offering a way to predict the effects of novel stimulus manipulations.
1/n
12.10.2025 16:10
👍 22
🔁 8
💬 1
📌 1