Skip to main content

Evaluating Vulnerabilities of Coding Agents

Supervisors

Suitable for

MSc in Advanced Computer Science

Abstract

Evaluating Vulnerabilities of Coding Agents

Providing LLM agents the ability to read, write, and execute scripts is a major step towards automating coding. However, granting LLMs the ability to interact with a user filesystem exposes a range of new security vulnerabilities. This project aims to explore the attack surface of LLM coding agents; identifying where and how vulnerabilities arise (e.g., perhaps through malicious documentation, unit tests, or otherwise), and proposing defense mechanisms to better mitigate the risks. This project will likely be joint with collaborators from Softserve as an industry partner.