Gokhan Tur (@gokhantur)

ConvAI had a great NeurIPS season with four accepted papers to the main conference🎉 Find all the authors in San Diego this December ☀️

20.09.2025 15:17 👍 3 🔁 1 💬 1 📌 0

Thrilled to announce our new survey that explores the exciting possibilities and troubling risks of computational persuasion in the era of LLMs 🤖💬
📄Arxiv: arxiv.org/pdf/2505.07775
💻 GitHub: github.com/beyzabozdag/...

13.05.2025 20:12 👍 8 🔁 5 💬 1 📌 0

We won a Senior Area Chair Award at NAACL!! Many thanks again to my amazing coauthors Gaurav Kamath and @sivareddyg.bsky.social :-)

03.05.2025 15:50 👍 13 🔁 2 💬 0 📌 0

Congratulations to Mila members @adadtur.bsky.social , Gaurav Kamath and @sivareddyg.bsky.social for their SAC award at NAACL! Check out Ada's talk in Session I: Oral/Poster 6. Paper: arxiv.org/abs/2502.05670

01.05.2025 14:30 👍 13 🔁 7 💬 0 📌 3

Super excited that this is finally out! We evaluated leading LLM-based web agents from OpenAI, Anthropic, and more, on our new benchmark SafeArena and found that many are surprisingly compliant with malicious requests. Check out the leaderboard here: huggingface.co/spaces/McGil...

11.03.2025 15:10 👍 8 🔁 1 💬 0 📌 0

This work was done by an awesome team of authors: @adadtur.bsky.social, Nick, @arkil.bsky.social, @karstanczak.bsky.social, Esin, @spandanagella.bsky.social, and @sivareddyg.bsky.social.

It's also important to recognize the incredible works that helped us build SafeArena:

10.03.2025 17:45 👍 4 🔁 1 💬 1 📌 0

Agents like OpenAI Operator can solve complex computer tasks, but what happens when users use them to cause harm, e.g. spread misinformation?

To find out, we introduce SafeArena (safearena.github.io), a benchmark to assess the capabilities of web agents to complete harmful web tasks. A thread 👇

10.03.2025 17:45 👍 17 🔁 7 💬 1 📌 5

While persuasive models are promising for social good, they can also be misused towards harmful behavior. Recent work by @beyzabozdag.bsky.social and @shuhaib.bsky.social aims to assess LLM persuasiveness and susceptibility towards persuasion.

05.03.2025 05:54 👍 5 🔁 2 💬 0 📌 0

[1/6] Can LLMs out-persuade each other? 🤖🧠💬

Introducing Persuade Me If You Can (PMIYC)—a new framework to evaluate (1) how persuasive LLMs are and (2) how easily they can be persuaded! 🚀

📄Arxiv: arxiv.org/abs/2503.01829
🌐Project Page: beyzabozdag.github.io/PMIYC/

04.03.2025 17:28 👍 9 🔁 2 💬 1 📌 1

Overview figure for paper, showing creation of constituent movement data, in addition to three step experimentation: "Model Shifting Preference", "Motivating Factors of Model Preference", "Human-Model Preference Correlation"

Super excited to finally announce our NAACL 2025 main conference paper “Language Models Largely Exhibit Human-like Constituent Ordering Preferences”!

We examine constituent ordering preferences between humans and LLMs; we present two main findings… 🧵

19.02.2025 19:31 👍 5 🔁 2 💬 1 📌 1

🚀Very excited about my new paper!

NN-CIFT slashes data valuation costs by 99% using tiny neural nets (205k params, just 0.0027% of 8B LLMs) while maintaining top-tier performance!

17.02.2025 04:06 👍 11 🔁 4 💬 1 📌 1

The secret sauce for this work is the ReAct style training data preparation: “User-Thought1-Action/API-Observation-Thought2-Response”. We transformed public dialogue datasets into this format for training. Congratulations to @emrecanacikgoz and the @convai_uiuc and Oumi teams!

14.02.2025 18:57 👍 2 🔁 1 💬 0 📌 0

Instruction data can also be synthesized using feedback based on reference examples. Please check our recent work for more information. Thanks to @shuhaib.bsky.social, Xiusi Chen, and Heng Ji!

10.02.2025 19:43 👍 4 🔁 1 💬 0 📌 0

💡 Introducing Reference-Level Feedback: A new paradigm for using feedback to improve synthetic data!
🌐 shuhaibm.github.io/refed/
🧵 [1/n]

10.02.2025 15:56 👍 6 🔁 2 💬 1 📌 1

AI over-reliance is an important issue for conversational agents. Our work supported mainly by the DARPA FACT program proposes introducing positive friction to encourage users to think critically when making decisions. Great team-work, all!
@convai-uiuc.bsky.social @gokhantur.bsky.social

09.02.2025 00:54 👍 10 🔁 3 💬 0 📌 0

‼️ Ever wish LLMs would just... slow down for a second?

In our latest work, "Better Slow than Sorry: Introducing Positive Friction for Reliable Dialogue Systems", we delve into how strategic delays can enhance dialogue systems.

Paper Website: merterm.github.io/positive-fri...

08.02.2025 22:42 👍 15 🔁 5 💬 1 📌 2

Seeing this year's ACL Fellows is like walking through the hallways of Microsoft Research, Building 99 in 2016. Congratulations @dilekh.bsky.social Scott Yih, Jianfeng Gao, and Lucy Vanderwende 👏

11.12.2024 14:53 👍 10 🔁 0 💬 0 📌 0

Building Conversational AI Agents By Integrating Reasoning, Speaking & Acting With LLMs AI Agents meet Conversational UI for intuitive & natural conversations.

Nice overview of the ReSpAct framework for conversational task completion agents @convai-uiuc.bsky.social
cobusgreyling.medium.com/building-con...

19.11.2024 20:23 👍 15 🔁 2 💬 0 📌 0

@chrupala.me please add me in to the SLP pack

19.11.2024 15:49 👍 4 🔁 0 💬 0 📌 0

Gokhan Tur

Latest posts by Gokhan Tur @gokhantur