π¨New Preprint π¨
Current agent evals mostly measure competence, but miss behavior e.g. are their decisions stable, rational, manipulable, human-like?
We introduce ABxLAB, a framework for studying agent behavior. Using it we create an agentic consumer behavior benchmark.
π§΅1/9
23.10.2025 18:16
π 1
π 1
π¬ 1
π 1
Excited to share ABxLab, our new open-source framework that lets you run controlled experiments on AI agents in real web environments! We used it to study purchasing behavior, an emerging application for agents, with pretty wild results.
23.10.2025 18:44
π 2
π 0
π¬ 0
π 0
We are recruiting two postdoctoral scholars for a research project in human collective intelligence and creativity at UC Davis and Cornell. Joint project with @enfascination.com, @norijacoby.bsky.social, @oferon.bsky.social & Dalton Conley. Please forward this thread to relevant people. 1/n
29.04.2025 02:46
π 22
π 24
π¬ 2
π 2
β¨Contrastive Learning from Synthetic Audio DoppelgΓ€ngers #ICLR2025β¨ w/
@nikhilsinghmus.bsky.social
Our method learns useful audio representations with randomly synthesized sounds (often better than real data!)
πProject: doppelgangers.media.mit.edu
πPaper: arxiv.org/abs/2406.05923
π§΅1/3
12.03.2025 20:25
π 4
π 1
π¬ 1
π 0
Paper title: Superficial Alignment, Subtle Divergence, and Nudge Sensitivity in LLM Decision-Making; Authors: Manuel Cherep*, Nikhil Singh*, and Pattie Maes
Excited to present our new paper on nudging LLMs (ππ€) as a spotlight talk at the NeurIPS Behavioral ML Workshop! @neuripsconf.bsky.social
w/ Nikhil Singh* (@nikhilsinghmus.bsky.social) and Pattie Maes
π openreview.net/forum?id=chb...
π§΅ 1/3
26.11.2024 23:07
π 5
π 2
π¬ 1
π 0
I'm recruiting a PhD student to join the Human AI Collaboration lab at Kellogg, NU CS, and @nicoatnu.bsky.social
If you're excited about computational social science, LLMs, digital experiments, real-world problem solving, this could be a great fit
Please reshare!
Deets π
19.11.2024 21:28
π 26
π 22
π¬ 1
π 0