Microsoft challenges you to hack its LLM email service

Microsoft is offering $10k prize for hackers who can exploit vulnerabilities in its LLM
The challenge will focus on prompt injection defenses
Software developers and hackers often work together to discover and fix flaws

Are you an experienced hacker looking to make a little extra money this Christmas? Well you might be in luck, as Microsoft is sponsoring a competition, alongside the Institute of Science, and Technology Australia, and ETH Zurich, in which contestants will try to break a simulated Large Language Model (LLM) integrated email client.

Winning teams for the LLMail-Inject challenge will be awarded a share of the $10,000 prize pool.

Participants will need to sign into the challenge using a GitHub account, and create a team. The teams will then be asked to evade prompt injection defenses in a simulated LLM-integrated email client. The LLmail service includes an assistant which can answer questions and perform actions on behalf of the user, and crucially includes defenses against indirect prompt injection tasks.

A mutually beneficial relationship

By bypassing the injection defenses, the hackers will be looking to prompt the LLM to do or reveal things it is not trained to. Through this, Microsoft is aiming to identify weaknesses in its current prompt injection defenses, and encourage the development of robust security measures.

The relationship between security researchers and software developers is often used this way, with Google often offering a ‘bug bounty’ for anyone who discovers and is able to exploit vulnerabilities in its Google Cloud Platform.

Similarly, Microsoft recently announced it was hosting its own Black Hat-esque hacking event, in which competitors would look for vulnerabilities in Microsoft AI, Azure, Identity, Dynamics 365, and M365.

Taking a proactive approach to addressing potential vulnerabilities allows software companies to mitigate the risks before they can be exploited by threat actors in real world scenarios. Slack’s AI assistant was on the receiving end of malicious prompt injections, which was luckily discovered by security researchers – but could have led to real security concerns.

Via The Register

A mutually beneficial relationship

You might also like

Sandisk 3D matrix memory is eight times cheaper than DRAM and compatible with CXL

China carries big risks for investors, money manager suggests

NYT Strands hints and answers for Sunday, February 23 (game #357)

Why the chance that an asteroid might hit Earth in 2032 keeps changing

Will the Nintendo Switch 2 feature Hall effect thumbsticks? It seems likely thanks to reliable leaker

Spike in UK borrowing costs raises specter of public spending cuts

Companies switching from VMWare should expect high-cost high-risk journey

Spike in UK borrowing costs raises specter of public spending cuts

Sandisk 3D matrix memory is eight times cheaper than DRAM and compatible with CXL

China carries big risks for investors, money manager suggests

NYT Strands hints and answers for Sunday, February 23 (game #357)

Will the Nintendo Switch 2 feature Hall effect thumbsticks? It seems likely thanks to reliable leaker

Spike in UK borrowing costs raises specter of public spending cuts

Companies switching from VMWare should expect high-cost high-risk journey

Sandisk 3D matrix memory is eight times cheaper than DRAM and compatible with CXL

China carries big risks for investors, money manager suggests

NYT Strands hints and answers for Sunday, February 23 (game #357)

Microsoft challenges you to hack its LLM email service

A mutually beneficial relationship

You might also like

Related Articles