Dang Nguyen (@divingwithorcas)

The Mirage of Autonomous AI Scientists Science as AI’s killer application cannot succeed without scientist-AI interaction: Introducing Hypogenic.ai.

AI can accelerate scientific discovery, but only if we get the scientist–AI interaction right.

The dream of “autonomous AI scientists” is tempting:
machines that generate hypotheses, run experiments, and write papers. But science isn’t just automation.

cichicago.substack.com/p/the-mirage...
🧵

23.10.2025 18:55 👍 22 🔁 6 💬 2 📌 2

On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions Understanding and mitigating biases is critical for the adoption of large language models (LLMs) in high-stakes decision-making. We introduce Admissions and Hiring, decision tasks with hypothetical ap...

Paper: arxiv.org/abs/2504.06303

08.10.2025 15:14 👍 0 🔁 0 💬 0 📌 0

📣 Announcing our poster session at COLM 2025:

On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions

I will talk about biases in LLMs and how to mitigate them. Come say hi!

Poster #43, 4:30 PM

08.10.2025 15:13 👍 1 🔁 2 💬 1 📌 1

This game from UChicago is incredible! It might be a bit painful to play, especially for those of us who already spend too much time on email, but the concept and execution are brilliant!

03.10.2025 00:04 👍 3 🔁 1 💬 0 📌 0

HR Simulator™: Be the Person You Hate A game that will change how you write emails.

and yes, you can play on your mobile browser: hrsimulator.communicationgames.ai

29.09.2025 02:50 👍 0 🔁 0 💬 0 📌 0

Playing HR Simulator™: think I'm getting on Brittany's good side

This is what she says about my attempt to get Dave to return to in-person work.

Any big tech company wanna hire me for HR? 👀

#HRSimulator #RoastedByBrittany

29.09.2025 02:49 👍 0 🔁 0 💬 1 📌 0

Please use a VPN. We're sorry for any inconvenience!

27.09.2025 14:26 👍 0 🔁 0 💬 0 📌 0

Home-grown at CHAI and
@uchicagoci.bsky.social
!! The first ever AI-driven game from academia 🎮Give it a go and let us know your rank on the leaderboard!

26.09.2025 18:51 👍 1 🔁 1 💬 0 📌 0

Stay tuned for more on communication games! Big thanks to @ari-holtzman.bsky.social @Harvey Fu @chenhaotan.bsky.social @Peter West for making this project happen!

26.09.2025 18:48 👍 1 🔁 0 💬 0 📌 0

HR Simulator™: Be the Person You Hate A game that will change how you write emails.

hrsimulator.communicationgames.ai

We’re serious! Economic coordination happens via emails. How do humans fare against AIs in getting things done with words?

We see a genre co-emerging with LLMs: communication games, where communication is crucial and not just “cheap talk” like Mafia or Diplomacy.

26.09.2025 18:45 👍 2 🔁 0 💬 1 📌 0

HR Simulator™: a game where you gaslight, deflect, and “let’s circle back” your way to victory.
Every email a boss fight, every “per my last message” a critical hit… or maybe you just overplayed your hand 🫠
Can you earn Enlightened Bureaucrat status?

(link below!)

26.09.2025 18:41 👍 4 🔁 5 💬 2 📌 3

Prompting is our most successful tool for exploring LLMs, but the term evokes eye-rolls and grimaces from scientists. Why? Because prompting as scientific inquiry has become conflated with prompt engineering.

This is holding us back. 🧵and new paper with @ari-holtzman.bsky.social .

09.07.2025 20:07 👍 37 🔁 15 💬 2 📌 0

When you walk into the ER, you could get a doc:
1. Fresh from a week of not working
2. Tired from working too many shifts

@oziadias.bsky.social has been both and thinks that they're different! But can you tell from their notes? Yes we can! Paper @natcomms.nature.com www.nature.com/articles/s41...

02.07.2025 19:22 👍 26 🔁 11 💬 1 📌 0

@chachachen.bsky.social @haokunliu.bsky.social @divingwithorcas.bsky.social present posters on human-AI decision making, hypothesis generation, interpretability and fairness at MMLS 2025!

24.06.2025 20:07 👍 6 🔁 3 💬 0 📌 0

Since @elenal3ai.bsky.social cannot make it, I presented the poster on concept incongruence: arxiv.org/abs/2505.14905

23.06.2025 19:18 👍 7 🔁 2 💬 0 📌 0

🚨 New paper alert 🚨

Ever asked an LLM-as-Marilyn Monroe who the US president was in 2000? 🤔 Should the LLM answer at all? We call these clashes Concept Incongruence. Read on! ⬇️

1/n 🧵

27.05.2025 13:59 👍 28 🔁 17 💬 1 📌 1

1/n 🚀🚀🚀 Thrilled to share our latest work🔥: HypoEval - Hypothesis-Guided Evaluation for Natural Language Generation! 🧠💬📊
There’s a lot of excitement around using LLMs for automated evaluation, but many methods fall short on alignment or explainability — let’s dive in! 🌊

12.05.2025 19:23 👍 22 🔁 7 💬 1 📌 1

🧑‍⚖️How well can LLMs summarize complex legal documents? And can we use LLMs to evaluate?

Excited to be in Albuquerque presenting our paper this afternoon at @naaclmeeting 2025!

01.05.2025 19:25 👍 23 🔁 13 💬 2 📌 0

🚀🚀🚀Excited to share our latest work: HypoBench, a systematic benchmark for evaluating LLM-based hypothesis generation methods!

There is much excitement about leveraging LLMs for scientific hypothesis generation, but principled evaluations are missing - let’s dive into HypoBench together.

28.04.2025 19:35 👍 11 🔁 9 💬 1 📌 0

The Midwest Machine Learning Symposium will happen in Chicago on June 23-4 on the University of Chicago campus (midwest-ml.org/2025/). We have an amazing lineup of speakers:@profsanjeevarora.bsky.social from Princeton, Heng Ji from UIUC, Tuomas Sandholm from CMU, @ravenben.bsky.social from UChicago.

21.04.2025 15:12 👍 3 🔁 4 💬 0 📌 3

Encourage your students to submit posters and register! Limited free housing is provided for student participants only, on a first-come (i.e., request)-first-serve basis.

We are also actively looking for sponsors. Reach out if you are interested!

Please repost! Help spread the words!

21.04.2025 15:12 👍 10 🔁 10 💬 2 📌 0

GitHub - ChicagoHAI/llm-prediction-bias Contribute to ChicagoHAI/llm-prediction-bias development by creating an account on GitHub.

12/n

Big thanks to @chenhaotan.bsky.social for advice on the project, as well as helpful feedback from the wonderful members of the @chicagohai.bsky.social lab! Check out our code at github.com/ChicagoHAI/l....

DM me for any questions!

14.04.2025 20:13 👍 3 🔁 0 💬 0 📌 0

11/n

So strangely, changing the prompt can change how a model represents race. Thus, in some cases, the model’s representation may be sensitive to spurious prompt features, which poses a challenge to the generalizability of debiasing methods. Future work on debiasing should take this into account.

14.04.2025 20:12 👍 2 🔁 0 💬 1 📌 0

10/n

We found the race subspace generalizes cross-family (from admissions to hiring) and, to a lesser extent, cross-explicitness (from implicit race via name to explicit race), but it fails to generalize cross-prompt (from one prompt template to another).

14.04.2025 20:11 👍 1 🔁 0 💬 1 📌 0

9/n

So we were able to debias via interventions on the race subspaces, but do they generalize? Here, the story gets more complicated.

14.04.2025 20:10 👍 1 🔁 0 💬 1 📌 0

8/n

Race Averaging can reduce Gemma’s bias by 37-57% in admissions and hiring. Projecting out the race subspace is similarly effective.

We find more mixed results for LLaMA, where our methods reduce the bias by 33% in admissions, but fail to work in hiring.

14.04.2025 20:10 👍 1 🔁 0 💬 1 📌 0

7/n

With the race subspaces, we debias models’ decisions in two ways:
1. Race Averaging: we average the subspace representation across different races (see illustration).
2. Race Projection: we project out the race subspace altogether.

14.04.2025 20:08 👍 0 🔁 0 💬 1 📌 0

6/n

Turning away from prompt engineering, we used Distributed Alignment Search to find subspaces in model representations that encode an applicant’s race.

We found strong race representation at the last prompt token, layers 10-12 for Gemma, and layers 24-26 for LLaMA.

14.04.2025 20:07 👍 1 🔁 0 💬 1 📌 0

5/n

Despite LLMs’ instruction-following ability, we found that multiple prompting strategies all fail to promote fairness. Prompts either fail to reduce our Bias Score metric, or drastically alter the average acceptance rate.

14.04.2025 20:04 👍 1 🔁 0 💬 1 📌 0

Dang Nguyen

Latest posts by Dang Nguyen @divingwithorcas