<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>machine-learning on The Trail of Bits Blog</title><link>https://miscreants.github.io/blog.trailofbits.com/categories/machine-learning/</link><description>Recent content in machine-learning on The Trail of Bits Blog</description><generator>Hugo</generator><language>en-us</language><lastBuildDate>Tue, 13 Jan 2026 00:00:00 -0500</lastBuildDate><atom:link href="https://miscreants.github.io/blog.trailofbits.com/categories/machine-learning/index.xml" rel="self" type="application/rss+xml"/><item><title>Lack of isolation in agentic browsers resurfaces old vulnerabilities</title><link>https://miscreants.github.io/blog.trailofbits.com/2026/01/13/lack-of-isolation-in-agentic-browsers-resurfaces-old-vulnerabilities/</link><pubDate>Tue, 13 Jan 2026 07:00:00 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2026/01/13/lack-of-isolation-in-agentic-browsers-resurfaces-old-vulnerabilities/</guid><description>We exploited a lack of isolation mechanisms in multiple agentic browsers to perform attacks ranging from the dissemination of false information to cross-site data leaks. These attacks resurface decades-old patterns of vulnerabilities that the web security community spent years building effective defenses against.</description></item><item><title>Can chatbots craft correct code?</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/12/19/can-chatbots-craft-correct-code/</link><pubDate>Fri, 19 Dec 2025 07:00:00 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/12/19/can-chatbots-craft-correct-code/</guid><description>LLMs fundamentally differ from compilers because they lack determinism and semantic guarantees, making them useful coding assistants but unreliable for autonomous code generation without human review and formal verification.</description></item><item><title>Prompt injection to RCE in AI agents</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/10/22/prompt-injection-to-rce-in-ai-agents/</link><pubDate>Wed, 22 Oct 2025 07:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/10/22/prompt-injection-to-rce-in-ai-agents/</guid><description>We bypassed human approval protections for system command execution in AI agents, achieving RCE in three agent platforms.</description></item><item><title>Fickling’s new AI/ML pickle file scanner</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/09/16/ficklings-new-ai/ml-pickle-file-scanner/</link><pubDate>Tue, 16 Sep 2025 07:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/09/16/ficklings-new-ai/ml-pickle-file-scanner/</guid><description>We&amp;rsquo;ve added a pickle file scanner to Fickling that uses an allowlist approach to protect AI/ML environments from malicious pickle files that could compromise models or infrastructure.</description></item><item><title>Intern projects that outlived the internship</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/08/28/intern-projects-that-outlived-the-internship/</link><pubDate>Thu, 28 Aug 2025 07:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/08/28/intern-projects-that-outlived-the-internship/</guid><description>Our business operations intern at Trail of Bits built two AI-powered tools that became permanent company resources—a podcast workflow that saves 1,250 hours annually and a Slack exporter that enables efficient knowledge retrieval across the organization.</description></item><item><title>Weaponizing image scaling against production AI systems</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/08/21/weaponizing-image-scaling-against-production-ai-systems/</link><pubDate>Thu, 21 Aug 2025 07:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/08/21/weaponizing-image-scaling-against-production-ai-systems/</guid><description>In this blog post, we’ll detail how attackers can exploit image scaling on Gemini CLI, Vertex AI Studio, Gemini’s web and API interfaces, Google Assistant, Genspark, and other production AI systems. We’ll also explain how to mitigate and defend against these attacks, and we’ll introduce Anamorpher, our open-source tool that lets you explore and generate these crafted images.</description></item><item><title>Trail of Bits' Buttercup wins 2nd place in AIxCC Challenge</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/08/09/trail-of-bits-buttercup-wins-2nd-place-in-aixcc-challenge/</link><pubDate>Sat, 09 Aug 2025 10:30:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/08/09/trail-of-bits-buttercup-wins-2nd-place-in-aixcc-challenge/</guid><description>Our team won the runner-up prize of $3M at DARPA&amp;rsquo;s AI Cyber Challenge, demonstrating Buttercup&amp;rsquo;s world-class automated vulnerability discovery and patching capabilities with remarkable cost efficiency.</description></item><item><title>Buttercup is now open-source!</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/08/08/buttercup-is-now-open-source/</link><pubDate>Fri, 08 Aug 2025 00:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/08/08/buttercup-is-now-open-source/</guid><description>Now that DARPA&amp;rsquo;s AI Cyber Challenge (AIxCC) has officially ended, we can finally make Buttercup, our CRS (Cyber Reasoning System), open source!</description></item><item><title>AIxCC finals: Tale of the tape</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/08/07/aixcc-finals-tale-of-the-tape/</link><pubDate>Thu, 07 Aug 2025 00:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/08/07/aixcc-finals-tale-of-the-tape/</guid><description>While the AIxCC winner has not yet been announced, differences in the finalists&amp;rsquo; approaches show that there are multiple viable paths forward to using AI for vulnerability detection.</description></item><item><title>Prompt injection engineering for attackers: Exploiting GitHub Copilot</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/08/06/prompt-injection-engineering-for-attackers-exploiting-github-copilot/</link><pubDate>Wed, 06 Aug 2025 00:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/08/06/prompt-injection-engineering-for-attackers-exploiting-github-copilot/</guid><description>Prompt injection pervades discussions about security for LLMs and AI agents. But there is little public information on how to write powerful, discreet, and reliable prompt injection exploits. In this post, we will design and implement a prompt injection exploit targeting GitHub’s Copilot Agent, with a focus on maximizing reliability and minimizing the odds of detection.</description></item><item><title>Uncovering memory corruption in NVIDIA Triton (as a new hire)</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/08/04/uncovering-memory-corruption-in-nvidia-triton-as-a-new-hire/</link><pubDate>Tue, 05 Aug 2025 07:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/08/04/uncovering-memory-corruption-in-nvidia-triton-as-a-new-hire/</guid><description>In my first month at Trail of Bits as an AI/ML security engineer, I found two remotely accessible memory corruption bugs in NVIDIA’s Triton Inference Server during a routine onboarding practice.</description></item><item><title>Hijacking multi-agent systems in your PajaMAS</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/07/31/hijacking-multi-agent-systems-in-your-pajamas/</link><pubDate>Thu, 31 Jul 2025 09:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/07/31/hijacking-multi-agent-systems-in-your-pajamas/</guid><description>We’re releasing pajaMAS: a curated set of MAS hijacking demos that illustrate important principles of MAS security.</description></item><item><title>We built the security layer MCP always needed</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/07/28/we-built-the-security-layer-mcp-always-needed/</link><pubDate>Mon, 28 Jul 2025 07:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/07/28/we-built-the-security-layer-mcp-always-needed/</guid><description>Today we’re announcing the beta release of mcp-context-protector, a security wrapper for LLM apps using the Model Context Protocol (MCP). It defends against the line jumping attacks documented earlier in this blog series, such as prompt injection via tool descriptions and ANSI terminal escape codes.</description></item><item><title>Datasig: Fingerprinting AI/ML datasets to stop data-borne attacks</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/05/02/datasig-fingerprinting-ai/ml-datasets-to-stop-data-borne-attacks/</link><pubDate>Fri, 02 May 2025 07:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/05/02/datasig-fingerprinting-ai/ml-datasets-to-stop-data-borne-attacks/</guid><description>Datasig generates compact, unique fingerprints for AI/ML datasets that let you compare training data with high accuracy—without needing access to the raw data itself.&lt;br&gt;
This critical capability helps AIBOM (AI bill of materials) tools detect data-borne vulnerabilities that traditional security tools completely miss.</description></item><item><title>Insecure credential storage plagues MCP</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/04/30/insecure-credential-storage-plagues-mcp/</link><pubDate>Wed, 30 Apr 2025 03:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/04/30/insecure-credential-storage-plagues-mcp/</guid><description>This post describes how many examples of MCP software store long-term API keys for third-party services in plaintext on the local filesystem, often with insecure, world-readable permissions.</description></item><item><title>Deceiving users with ANSI terminal codes in MCP</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/04/29/deceiving-users-with-ansi-terminal-codes-in-mcp/</link><pubDate>Tue, 29 Apr 2025 09:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/04/29/deceiving-users-with-ansi-terminal-codes-in-mcp/</guid><description>This post describes attacks using ANSI terminal code escape sequences to hide malicious instructions to the LLM, leveraging the line jumping vulnerability we discovered in MCP.</description></item><item><title>How MCP servers can steal your conversation history</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/04/23/how-mcp-servers-can-steal-your-conversation-history/</link><pubDate>Wed, 23 Apr 2025 10:30:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/04/23/how-mcp-servers-can-steal-your-conversation-history/</guid><description>Malicious MCP servers can inject trigger phrases into tool descriptions to exfiltrate entire conversation histories and steal sensitive credentials and IP.</description></item><item><title>Jumping the line: How MCP servers can attack you before you ever use them</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/04/21/jumping-the-line-how-mcp-servers-can-attack-you-before-you-ever-use-them/</link><pubDate>Mon, 21 Apr 2025 10:30:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/04/21/jumping-the-line-how-mcp-servers-can-attack-you-before-you-ever-use-them/</guid><description>MCP&amp;rsquo;s &amp;rsquo;line jumping&amp;rsquo; vulnerability lets malicious servers inject prompts through tool descriptions to manipulate AI behavior before tools are ever invoked.</description></item><item><title>Kicking off AIxCC’s Finals with Buttercup</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/04/21/kicking-off-aixccs-finals-with-buttercup/</link><pubDate>Mon, 21 Apr 2025 09:00:00 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/04/21/kicking-off-aixccs-finals-with-buttercup/</guid><description>Trail of Bits&amp;rsquo; Buttercup competes in DARPA&amp;rsquo;s AIxCC Finals with expanded resources, multiple rounds, new challenge types, and custom AI model capabilities.</description></item><item><title>Celebrating our 2024 open-source contributions</title><link>https://miscreants.github.io/blog.trailofbits.com/2025/01/23/celebrating-our-2024-open-source-contributions/</link><pubDate>Thu, 23 Jan 2025 09:00:30 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2025/01/23/celebrating-our-2024-open-source-contributions/</guid><description>While Trail of Bits is known for developing security tools like Slither, Medusa, and Fickling, our engineering efforts extend far beyond our own projects. Throughout 2024, our team has been deeply engaged with the broader security ecosystem, tackling challenges in open-source tools and infrastructure that security engineers rely on every day. This year, our engineers […]</description></item><item><title>Evaluating Solidity support in AI coding assistants</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/11/19/evaluating-solidity-support-in-ai-coding-assistants/</link><pubDate>Tue, 19 Nov 2024 09:00:37 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/11/19/evaluating-solidity-support-in-ai-coding-assistants/</guid><description>AI-enabled code assistants (like GitHub’s Copilot, Continue.dev, and Tabby) are making software development faster and more productive. Unfortunately, these tools are often bad at Solidity. So we decided to improve them! To make it easier to write, edit, and understand Solidity with AI-enabled tools, we have: Added support for Solidity into Tabby […]</description></item><item><title>Auditing Gradio 5, Hugging Face’s ML GUI framework</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/10/10/auditing-gradio-5-hugging-faces-ml-gui-framework/</link><pubDate>Thu, 10 Oct 2024 12:00:29 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/10/10/auditing-gradio-5-hugging-faces-ml-gui-framework/</guid><description>This is a joint post with the Hugging Face Gradio team; read their announcement here! You can find the full report with all of the detailed findings from our security audit of Gradio 5 here. Hugging Face hired Trail of Bits to audit Gradio 5, a popular open-source library that provides a web interface that […]</description></item><item><title>Inside DEF CON: Michael Brown on how AI/ML is revolutionizing cybersecurity</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/09/17/inside-def-con-michael-brown-on-how-ai-ml-is-revolutionizing-cybersecurity/</link><pubDate>Tue, 17 Sep 2024 09:00:08 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/09/17/inside-def-con-michael-brown-on-how-ai-ml-is-revolutionizing-cybersecurity/</guid><description>At DEF CON, Michael Brown, Principal Security Engineer at Trail of Bits, sat down with Michael Novinson from Information Security Media Group (ISMG) to discuss four critical areas where AI/ML is revolutionizing security. Here’s what they covered: AI/ML techniques surpass the limits of traditional software analysis As Moore’s law slows down after 20 years of […]</description></item><item><title>Provisioning cloud infrastructure the wrong way, but faster</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/08/27/provisioning-cloud-infrastructure-the-wrong-way-but-faster/</link><pubDate>Tue, 27 Aug 2024 09:00:06 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/08/27/provisioning-cloud-infrastructure-the-wrong-way-but-faster/</guid><description>Today we’re going to provision some cloud infrastructure the Max Power way: by combining automation with unchecked AI output. Unfortunately, this method produces cloud infrastructure code that 1) works and 2) has terrible security properties. In a nutshell, AI-based tools like Claude and ChatGPT readily provide extremely bad cloud infrastructure provisioning code, […]</description></item><item><title>Trail of Bits’ Buttercup heads to DARPA’s AIxCC</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/08/09/trail-of-bits-buttercup-heads-to-darpas-aixcc/</link><pubDate>Fri, 09 Aug 2024 09:10:29 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/08/09/trail-of-bits-buttercup-heads-to-darpas-aixcc/</guid><description>With DARPA’s AI Cyber Challenge (AIxCC) semifinal starting today at DEF CON 2024, we want to introduce Buttercup, our AIxCC submission. Buttercup is a Cyber Reasoning System (CRS) that combines conventional cybersecurity techniques like fuzzing and static analysis with AI and machine learning to find and fix software vulnerabilities. The system is designed to operate […]</description></item><item><title>Auditing the Ask Astro LLM Q&amp;A app</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/07/05/auditing-the-ask-astro-llm-qa-app/</link><pubDate>Fri, 05 Jul 2024 09:00:28 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/07/05/auditing-the-ask-astro-llm-qa-app/</guid><description>Today, we present the second of our open-source AI security audits: a look at security issues we found in an open-source retrieval augmented generation (RAG) application that could lead to chatbot output poisoning, inaccurate document ingestion, and potential denial of service. This audit follows up on our previous work that identified 11 security vulnerabilities in […]</description></item><item><title>Understanding Apple’s On-Device and Server Foundation Models release</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/06/14/understanding-apples-on-device-and-server-foundations-model-release/</link><pubDate>Fri, 14 Jun 2024 16:49:37 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/06/14/understanding-apples-on-device-and-server-foundations-model-release/</guid><description>Earlier this week, at Apple’s WWDC, we finally witnessed Apple’s AI strategy. The videos and live demos were accompanied by two long-form releases: Apple’s Private Cloud Compute and Apple’s On-Device and Server Foundation Models. This blog post is about the latter. So, what is Apple releasing, and how does it compare to […]</description></item><item><title>PCC: Bold step forward, not without flaws</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/06/14/pcc-bold-step-forward-not-without-flaws/</link><pubDate>Fri, 14 Jun 2024 15:46:48 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/06/14/pcc-bold-step-forward-not-without-flaws/</guid><description>Earlier this week, Apple announced Private Cloud Compute (or PCC for short). Without deep context on the state of the art of Artificial Intelligence (AI) and Machine Learning (ML) security, some sensible design choices may seem surprising. Conversely, some of the risks linked to this design are hidden in the fine print. […]</description></item><item><title>Exploiting ML models with pickle file attacks: Part 2</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/06/11/exploiting-ml-models-with-pickle-file-attacks-part-2/</link><pubDate>Tue, 11 Jun 2024 11:00:17 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/06/11/exploiting-ml-models-with-pickle-file-attacks-part-2/</guid><description>In part 1, we introduced Sleepy Pickle, an attack that uses malicious pickle files to stealthily compromise ML models and carry out sophisticated attacks against end users. Here we show how this technique can be adapted to enable long-lasting presence on compromised systems while remaining undetected. This variant technique, which we call […]</description></item><item><title>Exploiting ML models with pickle file attacks: Part 1</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/06/11/exploiting-ml-models-with-pickle-file-attacks-part-1/</link><pubDate>Tue, 11 Jun 2024 09:00:36 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/06/11/exploiting-ml-models-with-pickle-file-attacks-part-1/</guid><description>We’ve developed a new hybrid machine learning (ML) model exploitation technique called Sleepy Pickle that takes advantage of the pervasive and notoriously insecure Pickle file format used to package and distribute ML models. Sleepy pickle goes beyond previous exploit techniques that target an organization’s systems when they deploy ML models to instead […]</description></item><item><title>Announcing AI/ML safety and security trainings</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/06/07/announcing-ai-ml-safety-and-security-trainings/</link><pubDate>Fri, 07 Jun 2024 09:00:41 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/06/07/announcing-ai-ml-safety-and-security-trainings/</guid><description>We are offering AI/ML safety and security training this year! Recent advances in AI/ML technologies opened up a new world of possibilities for businesses to run more efficiently and offer better services and products. However, incorporating AI/ML into computing systems brings new and unique complexities, risks, and attack surfaces. In our experience […]</description></item><item><title>Relishing new Fickling features for securing ML systems</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/03/04/relishing-new-fickling-features-for-securing-ml-systems/</link><pubDate>Mon, 04 Mar 2024 09:00:44 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/03/04/relishing-new-fickling-features-for-securing-ml-systems/</guid><description>We’ve added new features to Fickling to offer enhanced threat detection and analysis across a broad spectrum of machine learning (ML) workflows. Fickling is a decompiler, static analyzer, and bytecode rewriter for the Python pickle module that can help you detect, analyze, or create malicious pickle files. While the ML community […]</description></item><item><title>Our response to the US Army’s RFI on developing AIBOM tools</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/02/28/our-response-to-the-us-armys-rfi-on-developing-aibom-tools-2/</link><pubDate>Wed, 28 Feb 2024 11:30:05 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/02/28/our-response-to-the-us-armys-rfi-on-developing-aibom-tools-2/</guid><description>The US Army’s Program Executive Office for Intelligence, Electronic Warfare and Sensors (PEO IEW&amp;amp;S) recently issued a request for information (RFI) on methods to implement and automate production of an artificial intelligence bill of materials (AIBOM) as part of Project Linchpin. The RFI describes the AIBOM as a detailed […]</description></item><item><title>Celebrating our 2023 open-source contributions</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/01/24/celebrating-our-2023-open-source-contributions/</link><pubDate>Wed, 24 Jan 2024 09:00:22 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/01/24/celebrating-our-2023-open-source-contributions/</guid><description>At Trail of Bits, we pride ourselves on making our best tools open source, such as Slither, PolyTracker, and RPC Investigator. But while this post is about open source, it’s not about our tools… In 2023, our employees submitted over 450 pull requests (PRs) that were merged into non-Trail of Bits repositories. This demonstrates our […]</description></item><item><title>Our thoughts on AIxCC’s competition format</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/01/18/our-thoughts-on-aixccs-competition-format/</link><pubDate>Thu, 18 Jan 2024 09:00:38 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/01/18/our-thoughts-on-aixccs-competition-format/</guid><description>Late last month, DARPA officially opened registration for their AI Cyber Challenge (AIxCC). As part of the festivities, DARPA also released some highly anticipated information about the competition: a request for comments (RFC) that contained a sample challenge problem and the scoring methodology. Prior rules documents and FAQs released by DARPA painted […]</description></item><item><title>LeftoverLocals: Listening to LLM responses through leaked GPU local memory</title><link>https://miscreants.github.io/blog.trailofbits.com/2024/01/16/leftoverlocals-listening-to-llm-responses-through-leaked-gpu-local-memory/</link><pubDate>Tue, 16 Jan 2024 12:00:39 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2024/01/16/leftoverlocals-listening-to-llm-responses-through-leaked-gpu-local-memory/</guid><description>We are disclosing LeftoverLocals: a vulnerability that allows recovery of data from GPU local memory created by another process on Apple, Qualcomm, AMD, and Imagination GPUs. LeftoverLocals impacts the security posture of GPU applications as a whole, with particular significance to LLMs and ML models run on impacted GPU […]</description></item><item><title>AI In Windows: Investigating Windows Copilot</title><link>https://miscreants.github.io/blog.trailofbits.com/2023/12/27/ai-in-windows-investigating-windows-copilot/</link><pubDate>Wed, 27 Dec 2023 09:00:22 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2023/12/27/ai-in-windows-investigating-windows-copilot/</guid><description>AI is becoming ubiquitous, as developers of widely used tools like GitHub and Photoshop are quickly implementing and iterating on AI-enabled features. With Microsoft&amp;rsquo;s recent integration of Copilot into Windows, AI is even on the old stalwart of computing—the desktop. The integration of an AI assistant into an entire operating system is a significant development that warrants investigation.</description></item><item><title>Assessing the security posture of a widely used vision model: YOLOv7</title><link>https://miscreants.github.io/blog.trailofbits.com/2023/11/15/assessing-the-security-posture-of-a-widely-used-vision-model-yolov7/</link><pubDate>Wed, 15 Nov 2023 10:15:05 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2023/11/15/assessing-the-security-posture-of-a-widely-used-vision-model-yolov7/</guid><description>TL;DR: We identified 11 security vulnerabilities in YOLOv7, a popular computer vision framework, that could enable attacks including remote code execution (RCE), denial of service, and model differentials (where an attacker can trigger a model to perform differently in different contexts). Open-source software […]</description></item><item><title>How AI will affect cybersecurity: What we told the CFTC</title><link>https://miscreants.github.io/blog.trailofbits.com/2023/07/31/how-ai-will-affect-cybersecurity-what-we-told-the-cftc/</link><pubDate>Mon, 31 Jul 2023 07:00:32 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2023/07/31/how-ai-will-affect-cybersecurity-what-we-told-the-cftc/</guid><description>Dan Guido, CEO The second meeting of the Commodity Futures Trading Commission’s Technology Advisory Committee (TAC) on July 18 focused on the effects of AI on the financial sector. During the meeting, I explained that AI has the potential to fundamentally change the balance between cyber offense and defense, and that we need security-focused benchmarks […]</description></item><item><title>Trail of Bits’s Response to OSTP National Priorities for AI RFI</title><link>https://miscreants.github.io/blog.trailofbits.com/2023/07/18/trail-of-bitss-response-to-ostp-national-priorities-for-ai-rfi/</link><pubDate>Tue, 18 Jul 2023 13:46:44 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2023/07/18/trail-of-bitss-response-to-ostp-national-priorities-for-ai-rfi/</guid><description>The Office of Science and Technology Policy (OSTP) has circulated a request for information (RFI) on how best to develop policies that support the responsible development of AI while minimizing risk to rights, safety, and national security. In our response, we highlight the following points: To ensure that AI […]</description></item><item><title>Trail of Bits’s Response to NTIA AI Accountability RFC</title><link>https://miscreants.github.io/blog.trailofbits.com/2023/06/16/trail-of-bitss-response-to-ntia-ai-accountability-rfc/</link><pubDate>Fri, 16 Jun 2023 08:00:10 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2023/06/16/trail-of-bitss-response-to-ntia-ai-accountability-rfc/</guid><description>The National Telecommunications and Information Administration (NTIA) has circulated an Artificial Intelligence (AI) Accountability Policy Request for Comment on what policies can support the development of AI audits, assessments, certifications, and other mechanisms to create earned trust in AI systems. Trail of Bits has submitted a response to the […]</description></item><item><title>Codex (and GPT-4) can’t beat humans on smart contract audits</title><link>https://miscreants.github.io/blog.trailofbits.com/2023/03/22/codex-and-gpt4-cant-beat-humans-on-smart-contract-audits/</link><pubDate>Wed, 22 Mar 2023 07:00:49 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2023/03/22/codex-and-gpt4-cant-beat-humans-on-smart-contract-audits/</guid><description>Is artificial intelligence (AI) capable of powering software security audits? Over the last four months, we piloted a project called Toucan to find out. Toucan was intended to integrate OpenAI’s Codex into our Solidity auditing workflow. This experiment went far […]</description></item><item><title>We need a new way to measure AI security</title><link>https://miscreants.github.io/blog.trailofbits.com/2023/03/14/ai-security-safety-audit-assurance-heidy-khlaaf-odd/</link><pubDate>Tue, 14 Mar 2023 08:00:47 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2023/03/14/ai-security-safety-audit-assurance-heidy-khlaaf-odd/</guid><description>Trail of Bits has launched a practice focused on machine learning and artificial intelligence, bringing together safety and security methodologies to create a new risk assessment and assurance program. This program evaluates potential bespoke risks and determines the necessary safety and security measures for AI-based systems.</description></item><item><title>Secure your machine learning with Semgrep</title><link>https://miscreants.github.io/blog.trailofbits.com/2022/10/03/semgrep-maching-learning-static-analysis/</link><pubDate>Mon, 03 Oct 2022 09:00:53 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2022/10/03/semgrep-maching-learning-static-analysis/</guid><description>tl;dr: Our publicly available Semgrep ruleset now has 11 rules dedicated to the misuse of machine learning libraries. Try it out now! Picture this: You’ve spent months curating images, trying out different architectures, downloading pretrained models, messing with Kubernetes, and you’re finally ready to ship your sparkling new machine learning (ML) product. […]</description></item><item><title>PrivacyRaven: Implementing a proof of concept for model inversion</title><link>https://miscreants.github.io/blog.trailofbits.com/2021/11/09/privacyraven-implementing-a-proof-of-concept-for-model-inversion/</link><pubDate>Tue, 09 Nov 2021 00:45:55 -0500</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2021/11/09/privacyraven-implementing-a-proof-of-concept-for-model-inversion/</guid><description>Originally published August 3, 2021 During my Trail of Bits winternship and springternship, I had the pleasure of working with Suha Hussain and Jim Miller on PrivacyRaven, a Python-based tool for testing deep-learning frameworks against a plethora of privacy attacks. I worked on improving PrivacyRaven’s versatility by adding compatibility for services […]</description></item><item><title>Never a dill moment: Exploiting machine learning pickle files</title><link>https://miscreants.github.io/blog.trailofbits.com/2021/03/15/never-a-dill-moment-exploiting-machine-learning-pickle-files/</link><pubDate>Mon, 15 Mar 2021 11:06:18 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2021/03/15/never-a-dill-moment-exploiting-machine-learning-pickle-files/</guid><description>Many machine learning (ML) models are Python pickle files under the hood, and it makes sense. The use of pickling conserves memory, enables start-and-stop model training, and makes trained models portable (and, thereby, shareable). Pickling is easy to implement, is built into Python without requiring additional dependencies, and supports serialization of custom […]</description></item><item><title>Efficient audits with machine learning and Slither-simil</title><link>https://miscreants.github.io/blog.trailofbits.com/2020/10/23/efficient-audits-with-machine-learning-and-slither-simil/</link><pubDate>Fri, 23 Oct 2020 07:00:51 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2020/10/23/efficient-audits-with-machine-learning-and-slither-simil/</guid><description>Trail of Bits has manually curated a wealth of data—years of security assessment reports—and now we’re exploring how to use this data to make the smart contract auditing process more efficient with Slither-simil. Based on accumulated knowledge embedded in previous audits, we set out to detect similar vulnerable code snippets […]</description></item><item><title>PrivacyRaven Has Left the Nest</title><link>https://miscreants.github.io/blog.trailofbits.com/2020/10/08/privacyraven-has-left-the-nest/</link><pubDate>Thu, 08 Oct 2020 08:00:36 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2020/10/08/privacyraven-has-left-the-nest/</guid><description>If you work on deep learning systems, check out our new tool, PrivacyRaven—it’s a Python library that equips engineers and researchers with a comprehensive testing suite for simulating privacy attacks on deep learning systems. Because deep learning enables software to perform tasks without explicit programming, it’s become ubiquitous in […]</description></item><item><title>Multi-Party Computation on Machine Learning</title><link>https://miscreants.github.io/blog.trailofbits.com/2019/10/04/multi-party-computation-on-machine-learning/</link><pubDate>Fri, 04 Oct 2019 10:13:15 -0400</pubDate><guid>https://miscreants.github.io/blog.trailofbits.com/2019/10/04/multi-party-computation-on-machine-learning/</guid><description>During my internship this summer, I built a multi-party computation (MPC) tool that implements a 3-party computation protocol for perceptron and support vector machine (SVM) algorithms. MPC enables multiple parties to perform analyses on private datasets without sharing them with each other. I defveloped a technique that lets three parties obtain the results of machine […]</description></item></channel></rss>