Welcome!

Unlock your personalized experience.

Cybersecurity

Estimating worst case frontier risks of open weight LLMs

Christopher Holloway

May 21, 2026 - 18:15

Updated: 1 month ago

0 2

Estimating worst case frontier risks of open weight LLMs

In this paper, we study the worst-case frontier risks of releasing gpt-oss. We introduce malicious fine-tuning (MFT), where we attempt to elicit maximum capabilities by fine-tuning gpt-oss to be as capable as possible in two domains: biology and cybersecurity.

Previous Article

Introducing gpt-oss

Open Weights and AI for All

What's Your Reaction?

Like 0

Dislike 0

Love 0

Funny 0

Wow 0

Sad 0

Angry 0

Christopher Holloway

Christopher Holloway is the founder and director of Progressive Robot, a UK-based technology company. A full-stack engineer with more than two decades of experience, he works across PHP development, ecommerce, Linux infrastructure, technical SEO and AI automation, and writes here on technology, AI, hardware and software.

Related Posts

Evaluating Legacy Database Security Updates and Hybrid Cloud Connectivity

Evaluating Legacy Database Security Updates and Hybr...

Christopher Hol...

Jun 01, 2026

0

2

Atlas Menu Breach Exposes 64,000 GTA V Users to Credential Risks

Atlas Menu Breach Exposes 64,000 GTA V Users to Cred...

Christopher Hol...

Jun 01, 2026

0

4

Dutch officials secure network servers during a major botnet dismantling operation.

Dutch Authorities Dismantle 17 Million Device Botnet...

Christopher Hol...

Jun 01, 2026

0

3

Open-Source Intelligence Explained: History, Tools, and Ethics

Open-Source Intelligence Explained: History, Tools, ...

Christopher Hol...

Jun 01, 2026

0

7

A chart displays the surge in election phishing domains and exposed political credentials ahead of the midterms.

Election Cybersecurity Shifts: Phishing Surges Ahead...

Christopher Hol...

Jun 01, 2026

0

4

Firefox 151 update overview highlights privacy enhancements, security patches, and VPN expansion.

Firefox 151 Update: Privacy Enhancements and Securit...

Christopher Hol...

Jun 01, 2026

0

6

Comments (0)