Trending
Asher Zheng's Avatar

Asher Zheng

@asher-zheng

PhD @ UT Linguistics Semantics/Pragmatics/NLP https://asherz720.github.io/ Prev.@UoEdinburgh @Hanyang

163
Followers
171
Following
15
Posts
22.11.2024
Joined
Posts Following

Latest posts by Asher Zheng @asher-zheng

Thank you venkat!

30.01.2026 16:46 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0

Thank you kanishka!

30.01.2026 15:14 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Strategic Dialogue Assessment: The Crooked Path to Innocence Language is often used strategically, particularly in high-stakes, adversarial settings, yet most work on pragmatics and LLMs centers on cooperative settings. This leaves a gap in the systematic under...

If you are interested in legal language, you should also check it out, as we focus on real courtroom cross-examination dialogues.

Here is the paper website: asherz720.github.io/SDA/

30.01.2026 03:02 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

We explore how strategic effectiveness can be quantified by a bunch of discourse properties and evaluate a suite of LLMs in terms of how they understand such strategic dialogues under adversarial settings.

30.01.2026 03:02 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

Very excited to share that the paper w/
@jessyjli.bsky.social @DavidBeaver
"Strategic Dialogue Assessment: The Crooked Path to Innocence" (used to have the name COBRA) was accepted by Dialogue and Discourse Vol 17 No.1. Check it out! πŸ‘‰https://journals.uic.edu/ojs/index.php/dad/article/view/14503

30.01.2026 03:02 πŸ‘ 6 πŸ” 3 πŸ’¬ 2 πŸ“Œ 1
Preview
CoBRA: Quantifying Strategic Language Use and LLM Pragmatics Language is often used strategically, particularly in high-stakes, adversarial settings, yet most work on pragmatics and LLMs centers on cooperativity. This leaves a gap in systematic understanding of...

Current LLMs are not able to jailbreak cooperative principles and still show limited understanding of strategic language. We believe this work lays foundations for sophisticated strategic reasoning and safety monitoring in downstream tasks.
πŸ“„: arxiv.org/abs/2506.01195

03.06.2025 11:56 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0
Post image Post image

By analyzing model reasoning, we find extra reasoning introduces overcomplication (img left), misunderstanding, and internal inconsistency (img right). This shows the current LLMs still lack sophisticated pragmatic understanding in many ways.

03.06.2025 11:56 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

We evaluate a range of LLMs in terms of how good they are at perceiving strategic language. We show models struggle with our metrics while showing an overall good understanding of Gricean principles. Model size tends to have a positive effect, while reasoning does not help.

03.06.2025 11:56 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image

(2) BaT and PaT are valid terms that reflect strategic gains/losses, which can to some extent predict conversational outcomes. In addition, our metrics are more objective. When conditioned on cases where the outcome is made based on logical arguments, the predictive power rises.

03.06.2025 11:56 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Post image

We also introduce CHARM, an annotated dataset of real legal cross-examination dialogues. By applying our framework, we show (1) (non-)cooperative discourse are distinct over the identified properties (img left), and BaT and PaT show such a distributional distinction (img right).

03.06.2025 11:56 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Post image

Based on the components above, we introduce three metricsβ€”Benefit at Turn (BaT), Penalty at Turn (PaT), and Normalized Relative Benefit at Turn (NRBaT)β€”to measure the strategic gains, losses, and cumulative benefits at a turn.

03.06.2025 11:56 πŸ‘ 2 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

For example, one witness can make a commitment that leads to a win for her but violates the maxim of manner to make her less liable to the commitment. The commitment itself is beneficial, but the gains are reduced due to vagueness.

03.06.2025 11:56 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0

We derive non-cooperativity from both Gricean and game-theoretic pragmatics. In our framework, a strategic move is evaluated based on two components: the commitment it expresses (base value) and the violation of maxims to maintain consistency (penalties/compensations).

03.06.2025 11:56 πŸ‘ 0 πŸ” 0 πŸ’¬ 1 πŸ“Œ 0
Post image Post image

Language is often strategic, but LLMs tend to play nice. How strategic are they really? Probing into that is key for future safety alignment.

πŸ‘‰Introducing CoBRA🐍, a framework that assesses strategic language.

Work with my amazing advisors @jessyjli.bsky.social and @David I. Beaver!

03.06.2025 11:56 πŸ‘ 11 πŸ” 3 πŸ’¬ 1 πŸ“Œ 1
Post image Post image

Have that eerie feeling of dΓ©jΓ  vu when reading model-generated text πŸ‘€, but can’t pinpoint the specific words or phrases πŸ‘€?

✨We introduce QUDsim, to quantify discourse similarities beyond lexical, syntactic, and content overlap.

21.04.2025 21:29 πŸ‘ 21 πŸ” 9 πŸ’¬ 2 πŸ“Œ 3

πŸ™‹β€β™‚οΈ

25.11.2024 14:39 πŸ‘ 0 πŸ” 0 πŸ’¬ 0 πŸ“Œ 0