#adversarialattacks

1 month ago

New research shows how to fool CLIP‑style vision‑language models with fresh adversarial tricks. Could this expose hidden AI security gaps? Dive into the latest evasion techniques and what they mean for multimodal ML. #AdversarialAttacks #VisionLanguage #AIsecurity

🔗 aidailypost.com/news/researc...

0 0 0 0

Chema Alonso

@chemaalonso.com

1 month ago

Prompt Injection con Advesarial Preprocesing Attacks en Imágenes usando Anamorpher Blog personal de Chema Alonso ( https://MyPublicInbox.com/ChemaAlonso ): Ciberseguridad, IA, Innovación, Tecnología, Cómics & Cosas Personasles.

El lado del mal - Prompt Injection con Advesarial Preprocesing Attacks en Imágenes usando Anamorpher elladodelmal.com/2026/01/prom... #PromptInjection #Gemini #AdversarialAttacks #ImageScaling #IA #AI #Hacking #Pentesting

0 0 0 0

HackerNoon

@hackernoon.com

3 months ago

Adversarial Attacks on Large Language Models and Defense Mechanisms

Comprehensive guide to LLM security threats and defenses. Learn how attackers exploit AI models and practical strategies to protect against adversarial attacks. #adversarialattacks

0 0 0 0

Annual Computer Security Applications Conference

@acsacconf.bsky.social

4 months ago

Zhang et al.'s "CIGA: Detecting Adversarial Samples via Critical Inference Graph Analysis"

Following that was Zhang et al.'s "CIGA: Detecting Adversarial Samples via Critical Inference Graph Analysis," which explores how different layer connections help identify adversarial samples effectively. (www.acsac.org/2024/p...) 4/6
#ML #AdversarialAttacks #CyberSecurity

1 0 1 0

CySecurity News

@cysecuritynews.bsky.social

4 months ago

The Hidden Risk Behind 250 Documents and AI Corruption #Adversarialattacks #AIgovernance #AIRiskManagement

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Universal Adversarial Attacks Threaten Robot Learning Algorithms

Researchers warn that universal adversarial attacks could compromise robot learning algorithms, potentially destabilizing autonomous systems, according to the researchers. getnews.me/universal-adversarial-at... #adversarialattacks #robotics

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Data‑Space Attacks Transfer While Representation‑Space Attacks Do Not

Data‑space attacks transfer across models; representation‑space attacks only do so when models have latent geometry, shown in image, language and vision‑language experiments. getnews.me/data-space-attacks-trans... #adversarialattacks #dataspace

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Survey of Transferable Adversarial Image Attacks and Defenses

The survey of 23 transferable attacks against 11 defenses found DiffPure still vulnerable to black‑box attacks, while the older Diversity Input method matches newer variants. Read more: getnews.me/survey-of-transferable-a... #adversarialattacks #diffpure

0 0 0 0

CySecurity News

@cysecuritynews.bsky.social

6 months ago

How Image Resizing Could Expose AI Systems to Attacks Security experts have identified a new kind of cyber attack that hides instructions inside ordinary pictures. These commands do not appear in the full image but become visible only when the photo is automatically resized by artificial intelligence (AI) systems. The attack works by adjusting specific pixels in a large picture. To the human eye, the image looks normal. But once an AI platform scales it down, those tiny adjustments blend together into readable text. If the system interprets that text as a command, it may carry out harmful actions without the user’s consent. Researchers tested this method on several AI tools, including interfaces that connect with services like calendars and emails. In one demonstration, a seemingly harmless image was uploaded to an AI command-line tool. Because the tool automatically approved external requests, the hidden message forced it to send calendar data to an attacker’s email account. The root of the problem lies in how computers shrink images. When reducing a picture, algorithms merge many pixels into fewer ones. Popular methods include nearest neighbor, bilinear, and bicubic interpolation. Each creates different patterns when compressing images. Attackers can take advantage of these predictable patterns by designing images that reveal commands only after scaling. To prove this, the researchers released Anamorpher, an open-source tool that generates such images. The tool can tailor pictures for different scaling methods and software libraries like TensorFlow, OpenCV, PyTorch, or Pillow. By hiding adjustments in dark parts of an image, attackers can make subtle brightness shifts that only show up when downscaled, turning backgrounds into letters or symbols. Mobile phones and edge devices are at particular risk. These systems often force images into fixed sizes and rely on compression to save processing power. That makes them more likely to expose hidden content. The researchers also built a way to identify which scaling method a system uses. They uploaded test images with patterns like checkerboards, circles, and stripes. The artifacts such as blurring, ringing, or color shifts revealed which algorithm was at play. This discovery also connects to core ideas in signal processing, particularly the Nyquist-Shannon sampling theorem. When data is compressed below a certain threshold, distortions called aliasing appear. Attackers use this effect to create new patterns that were not visible in the original photo. According to the researchers, simply switching scaling methods is not a fix. Instead, they suggest avoiding automatic resizing altogether by setting strict upload limits. Where resizing is necessary, platforms should show users a preview of what the AI system will actually process. They also advise requiring explicit user confirmation before any text detected inside an image can trigger sensitive operations. This new attack builds on past research into adversarial images and prompt injection. While earlier studies focused on fooling image-recognition models, today’s risks are greater because modern AI systems are connected to real-world tools and services. Without stronger safeguards, even an innocent-looking photo could become a gateway for data theft.

How Image Resizing Could Expose AI Systems to Attacks #Adversarialattacks #AItools #algorithms

1 0 0 0

CySecurity News

@cysecuritynews.bsky.social

6 months ago

AI Agents and the Rise of the One-Person Unicorn #Accesscontrol #Adversarialattacks #agenticAI

0 0 0 0

Annual Computer Security Applications Conference

@acsacconf.bsky.social

7 months ago

Shin et al.'s "You Only Perturb Once: Bypassing (Robust) Ad-Blockers Using Universal Adversarial Perturbations"

Thereafter came Shin et al.'s "You Only Perturb Once: Bypassing (Robust) Ad-Blockers Using Universal Adversarial Perturbations", revealing vulnerabilities in ATS models to universal adversarial attacks. (www.acsac.org/2024/p...) 5/6
#Privacy #AdversarialAttacks #WebSecurity

0 0 1 0

Chema Alonso

@chemaalonso.com

7 months ago

LightShed versus NightShade & Glaze: La guerra del copyright que envenena imágenes contra la GenAI Blog personal de Chema Alonso ( https://MyPublicInbox.com/ChemaAlonso ): Ciberseguridad, IA, Innovación, Tecnología, Cómics & Cosas Personasles.

El lado del mal - LightShed versus NightShade & Glaze: La guerra del copyright que envenena imágenes contra la GenAI www.elladodelmal.com/2025/07/ligh... #IA #AI #GenAI #InteligenciaArtificial #MachineLearning #copyright #StableDiffusion #AdversarialAttacks

2 1 0 0

AZoAI

@azoai.bsky.social

1 year ago

Los Alamos AI Breakthrough Neutralizes Adversarial Attacks and Restores Trust in Neural Networks Los Alamos scientists unveil LoRID, a cutting-edge AI defense that wipes out adversarial threats without compromising data integrity—setting a new gold standard for secure and trustworthy neural...

Los Alamos AI Breakthrough Neutralizes Adversarial Attacks and Restores Trust in Neural Networks 🔐🤖⚙️ www.azoai.com/news/2025031... #AI #NeuralNetworks #Cybersecurity #MachineLearning #AdversarialAttacks #DiffusionModels #TensorDecomposition #Innovation #DataSecurity #Supercomputing

0 0 0 0

ThamesTech AI

@thametechai.bsky.social

1 year ago

Protect Your AI Systems from Input Manipulation Attacks Discover how to secure your AI systems against Input Manipulation Attacks. Learn about adversarial training, robust model design, and input validation with Thamestechai. Build resilient AI systems tha...

ChatGPT
🔒 Secure Your AI Systems from Input Manipulation Attacks 🔒

Attackers manipulate data to trick AI systems. Learn how to defend with strategies like adversarial training and input validation.

thamestech.ai/secure-ai-sy...

#AI #Cybersecurity #MachineLearning #AdversarialAttacks #Innovation

0 0 0 0

AI-C

@aicompetence.org

1 year ago

Adversarial Attacks: Can One Attack Fool Multiple Models? Adversarial attacks can transfer between AI models, raising security concerns as one attack might fool multiple models with different architectures.

Discover Transferability of Adversarial Attacks! #adversarialattacks #adversarialexamples #AIattacks #AIsecurity #deeplearning #foolingAImodels #MachineLearning #modelvulnerability #transferability
aicompetence.org/adversarial-...

0 0 0 0

Annual Computer Security Applications Conference

@acsacconf.bsky.social

1 year ago

Weeks et al.'s "A First Look at Toxicity Injection Attacks on Open-domain Chatbots"

Then followed Weeks et al.'s "A First Look at Toxicity Injection Attacks on Open-domain Chatbots", exploring the ease of injecting #toxicity post-deployment into #chatbots by malicious users. (www.acsac.org/2023/p...) 3/4
#LLM #CyberSecurity #AdversarialAttacks #AIrisks

0 0 1 0

Posts tagged #adversarialattacks