Trending

#Robustness

Latest posts tagged with #Robustness on Bluesky

Latest Top
Trending

Posts tagged #Robustness

Preview
2 PhD Positions on Learning Causally Grounded Concepts for Safe AI Are you interested in improving the interpretability, robustness and safety of AI by integrating causal reasoning? The Causality team in the AMLab group at the University of Amsterdam is looking for 2...

🚨2 PhD positions with me @amlab.bsky.social on learning causally grounded concepts 🚨

Are you interested in improving the #interpretability #robustness and #safety of AI by integrating #causal reasoning? Join us in beautiful Amsterdam 🇳🇱🌷🚲

Deadline: 20 April

www.academictransfer.com/en/jobs/3593...

9 3 0 0
Course au Large 2030 - Accueil

6/6 Tomorrow, teams like Banque Populaire or SVR will be judged on their ability to sail under constraints. Sponsors are becoming transition partners. The 2030 Gold Fleet will be those who mastered their footprint before their final layline. 🏆 #OceanRacing #SailingBusiness #Robustness #Sponsorship

0 0 0 0
Post image

#Resolve #Resillience #Robustness #ReadItSomewhere

0 0 0 0

The Geometry of Algorithmic Stability: A Hodge Theoretic View on Structural vs. Statistical Insta...

Karen Sargsyan

Action editor: Alberto Bietti

https://openreview.net/forum?id=rFqsgVXZYO

#robustness #stability #instability

1 0 0 0

Adversarial Vulnerability from On-Manifold Inseparability and Poor Off-Manifold Convergence

Rajdeep Haldar, Yue Xing, Qifan Song, Guang Lin

Action editor: Olivier Cappé

https://openreview.net/forum?id=pa90uRZATF

#adversarial #robustness #classification

0 0 0 0

End-to-End Conformal Calibration for Optimization Under Uncertainty

Christopher Yeh, Nicolas Christianson, Alan Wu, Adam Wierman, Yisong Yue

Action editor: Jake C. Snell

https://openreview.net/forum?id=yM8qkT0f9H

#optimization #robustness #optimize

1 0 0 0
Preview
BIRD: Behavior Induction via Representation-structure Distillation Human-aligned deep learning models exhibit behaviors consistent with human values, such as robustness, fairness, and honesty. Transferring these behavioral properties to models trained on different ta...

Most transfer learning assumes shared data, tasks, or domains.

BIRD shows you can transfer behavior itself even when those assumptions break.

All details here:
arxiv.org/abs/2505.23933

#KnowledgeDistillation #Robustness #MachineLearning #AIResearch #ResponsibleAI

0 0 0 0
Two-panel schematic illustrating the BIRD framework. Left panel shows independent pre-training of a teacher and a student network on different datasets, each optimized with its own task loss. Right panel shows representation-structure distillation: selected intermediate layers from teacher and student are compared via a representation loss, which aligns the geometry of their internal activations while the student is still trained on its own task loss. A snowflake icon indicates the teacher is frozen. The diagram emphasizes that behavior is transferred by aligning internal representation structure rather than outputs or shared data.

Two-panel schematic illustrating the BIRD framework. Left panel shows independent pre-training of a teacher and a student network on different datasets, each optimized with its own task loss. Right panel shows representation-structure distillation: selected intermediate layers from teacher and student are compared via a representation loss, which aligns the geometry of their internal activations while the student is still trained on its own task loss. A snowflake icon indicates the teacher is frozen. The diagram emphasizes that behavior is transferred by aligning internal representation structure rather than outputs or shared data.

We introduce BIRD: Behavior Induction via Representation-structure Distillation.

Instead of transferring outputs, BIRD aligns the geometry of internal representations between teacher and student, enabling weak → strong generalization.

#KnowledgeDistillation #TransferLearning #Robustness

0 0 1 0

Robust Reinforcement Learning in a Sample-Efficient Setting

Siemen Herremans, Ali Anwar, Siegfried Mercelis

Action editor: Marcello Restelli

https://openreview.net/forum?id=iij6nLYLjF

#reinforcement #robustness #robust

0 0 0 0

Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes

David Mark Bossens, Atsushi Nitanda

Action editor: Alberto Maria Metelli

https://openreview.net/forum?id=tmfdqtFUqO

#adversarial #robustness #optimise

0 0 0 0

Consistency Aware Robust Learning under Noisy Labels

Fahad Sarfraz, Bahram Zonooz, Elahe Arani

Action editor: Yu Yao

https://openreview.net/forum?id=pZulfLkARr

#robust #consistency #robustness

0 0 0 0
Digital Red Queen: Adversarial Program Evolution in Core War with LLMs A self-play algorithm that uses LLMs to evolve adversarially competing programs in Core War

When LLMs masturbate in #DRQ safe spaces, they maintain fitness pub.sakana.ai/drq/ #Sandbox #CoreWar #DigitalRedQueen #Generalists #ObjectiveShifting #Adaptation #Evolution #Robustness

0 0 0 0

Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics

PANKAJ KUMAR, Subhankar Mishra

Action editor: Aditya Menon

https://openreview.net/forum?id=Bchvaaod6g

#robustness #nlp #adversarial

0 0 0 0
Preview
Stories About Control: Why Optimisation So Often Produces Fragility – Traders Outpost “What looked like control was only stability borrowed from the past.” The feeling of control is not the same thing as control. The traders most vulnerable to catastrophic failure are rarely naive. …

Stories About Control: Why Optimisation So Often Produces Fragility

👉 Read here:
atstradingsolutions.com/stories-abou...

#StoriesAboutControl #Robustness #Fragility #ProcessOverPrediction

0 0 0 0

Are Domain Generalization Benchmarks with Accuracy on the Line Misspecified?

Olawale Elijah Salaudeen, Nicole Chiou, Shiny Weng, Sanmi Koyejo

Action editor: Ozan Sener

https://openreview.net/forum?id=fNywRyqPQo

#robustness #generalization #benchmarks

0 0 0 0

Rethinking Robustness in Machine Learning: A Posterior Agreement Approach

João B. S. Carvalho, Víctor Jiménez Rodríguez, Alessandro Torcinovich et al.

Action editor: Mohammad Emtiyaz Khan

https://openreview.net/forum?id=Bpc9uZ6kcg

#robustness #adversarial #generalization

0 0 0 0

New #Featured Certification, #Reproducibility Certification, #J2C Certification:

Robust Reinforcement Learning in a Sample-Efficient Setting

Siemen Herremans, Ali Anwar, Siegfried Mercelis

https://openreview.net/forum?id=iij6nLYLjF

#reinforcement #robustness #robust

0 0 0 0

New #J2C Certification:

Mirror Descent Policy Optimisation for Robust Constrained Markov Decision Processes

David Mark Bossens, Atsushi Nitanda

https://openreview.net/forum?id=tmfdqtFUqO

#adversarial #robustness #optimise

0 0 0 0
Post image

The most impressive feature of our microbial #consortium is its #robustness. It can efficiently process #mixed #plastic #waste with #fluctuating plastic #compositions, maintaining its capabilities and population balance for 21 days. It's crucial for applications, where compositions of waste vary.

0 0 0 0
Carbon Performance for Banks: methodology note v1.0, December 2026

Photo of Manhattan skyline

Carbon Performance for Banks: methodology note v1.0, December 2026 Photo of Manhattan skyline

Our 𝗖𝗮𝗿𝗯𝗼𝗻 𝗣𝗲𝗿𝗳𝗼𝗿𝗺𝗮𝗻𝗰𝗲 𝗳𝗼𝗿 𝗕𝗮𝗻𝗸𝘀 assessments are guided by key design principles of #transparency, #accountability and #robustness, essential for ensuring the #credibility of the Centre’s assessment process.

Check out the methodology note: www.transitionpathwayinitiative.org/publications...

3 0 0 0
Post image

I’ve been testing a prompt-level operator that acts like a soft control layer for #LLMs.

It produces a 7.4× contraction in behavioural manifolds and suppresses adversarial drift in repeated generations.

Methods + metrics👉 zenodo.org/records/1771...

#AI #PromptEngineering #Robustness #AIEvaluation

2 0 0 0
Post image

“The #robustness of people is really staggering.” - Ilya #Sutskever - Safe #Superintelligence

Understanding what Sutskever means by robustness requires examining not just human capabilities but the specific ways in which #AI systems are fragile by comparison... - https://with.ga/fjaz5
#quote

1 0 0 0
Post image

“The #robustness of people is really staggering.” - Ilya #Sutskever - Safe #Superintelligence

Understanding what Sutskever means by robustness requires examining not just human capabilities but the specific ways in which #AI systems are fragile by comparison... - https://with.ga/fjaz5
#quote

1 0 0 0

Certified Robustness to Data Poisoning in Gradient-Based Training

Philip Sosnin, Mark Niklas Mueller, Maximilian Baader, Calvin Tsay, Matthew Robert Wicker

Action editor: Chuan Guo

https://openreview.net/forum?id=9WHifn9ZVX

#robustness #backdoor #attacks

0 0 0 0

Latent Adversarial Training Improves Robustness to Persistent Harmful Behaviors in LLMs

Abhay Sheshadri, Aidan Ewart, Phillip Huang Guo et al.

Action editor: Daphne Ippolito

https://openreview.net/forum?id=6LxMeRlkWl

#adversarial #adversary #robustness

0 0 0 0

AlignFix: Fixing Adversarial Perturbations by Agreement Checking for Adversarial Robustness again...

Ashutosh Kumar Nirala, Jin Tian, Olukorede Fakorede, Modeste Atsague

Action editor: Pin-Yu Chen

https://openreview.net/forum?id=XgK05fssnx

#adversarial #adversarially #robustness

0 0 0 0
Preview
Launching the AI Model Arena The Defence AI Centre has worked with industry to develop a new tool that will help redefine how Defence evaluates and procures AI technologies.

FYI - the Defence AI Centre is launching the AI Model Arena to help redefine how Defence evaluates and procures artificial intelligence technologies ... www.gov.uk/government/n...
#DAIC #Defence #MOD #AI #AIModelArena #JSP936 #performance #reliability #robustness #security

1 0 0 0

Set-Based Training for Neural Network Verification

Lukas Koller, Tobias Ladner, Matthias Althoff

Action editor: Kuldeep S. Meel

https://openreview.net/forum?id=n0lzHrAWIA

#adversarial #robustness #robust

0 0 0 0

New #J2C Certification:

Are Domain Generalization Benchmarks with Accuracy on the Line Misspecified?

Olawale Elijah Salaudeen, Nicole Chiou, Shiny Weng, Sanmi Koyejo

https://openreview.net/forum?id=fNywRyqPQo

#robustness #generalization #benchmarks

1 0 0 0

Model Tampering Attacks Enable More Rigorous Evaluations of LLM Capabilities

Zora Che, Stephen Casper, Robert Kirk et al.

Action editor: Chuan Sheng Foo

https://openreview.net/forum?id=E60YbLnQd2

#tampering #robustness #attacks

0 0 0 0