๐ ๏ธ ๐ข๐ฟ๐ด๐ฎ๐ป๐ถ๐๐ฒ๐ฟ๐: @egorzverev.bsky.social, @aideenfay.bsky.social, myself, Mario Fritz, @thegruel.bsky.social
Looking forward to interesting discussions in Copenhagen!
#EurIPS2025 #LLMSafety #LLMSecurity #AIResearch #ELLIS #AISafety #EurIPS
09.10.2025 14:16
๐ 2
๐ 0
๐ฌ 0
๐ 0
We're hosting a poster session at the UnConference
๐ ๐ช๐ต๐ ๐ฃ๐ฟ๐ฒ๐๐ฒ๐ป๐?
- Connect with researchers working on LLM Safety and Security
- Share insights from your recently published research
- Get feedback and fresh perspectives
- Find new collaborators among participants
09.10.2025 14:16
๐ 1
๐ 0
๐ฌ 1
๐ 0
๐ข ๐๐ฎ๐น๐น ๐ณ๐ผ๐ฟ ๐ฃ๐ผ๐๐๐ฒ๐ฟ๐: ๐๐๐ ๐ฆ๐ฎ๐ณ๐ฒ๐๐ ๐ฎ๐ป๐ฑ ๐ฆ๐ฒ๐ฐ๐๐ฟ๐ถ๐๐ ๐ช๐ผ๐ฟ๐ธ๐๐ต๐ผ๐ฝ @ ๐๐๐๐๐ฆ ๐จ๐ป๐๐ผ๐ป๐ณ๐ฒ๐ฟ๐ฒ๐ป๐ฐ๐ฒ
๐
December 2, 2025
๐ Copenhagen
An opportunity to discuss your work with colleagues working on similar problems in LLM safety and security
09.10.2025 14:16
๐ 2
๐ 2
๐ฌ 1
๐ 0
(1/n) In our #ICLR2025 paper, we explore a fundamental issue that enables prompt injections: ๐๐๐๐ฌโ ๐ข๐ง๐๐๐ข๐ฅ๐ข๐ญ๐ฒ ๐ญ๐จ ๐ฌ๐๐ฉ๐๐ซ๐๐ญ๐ ๐ข๐ง๐ฌ๐ญ๐ซ๐ฎ๐๐ญ๐ข๐จ๐ง๐ฌ ๐๐ซ๐จ๐ฆ ๐๐๐ญ๐ ๐ข๐ง ๐ญ๐ก๐๐ข๐ซ ๐ข๐ง๐ฉ๐ฎ๐ญ.
โ
Definition of separation
๐ SEP Benchmark
๐ LLM evals on SEP
18.03.2025 14:46
๐ 18
๐ 4
๐ฌ 2
๐ 2