OpenAI News & Discussions

weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »



firestar464
Posts: 7206
Joined: Wed Oct 12, 2022 7:45 am

Re: OpenAI News & Discussions

Post by firestar464 »

weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »

firestar464
Posts: 7206
Joined: Wed Oct 12, 2022 7:45 am

Re: OpenAI News & Discussions

Post by firestar464 »

weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »



weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »

User avatar
wjfox
Site Admin
Posts: 13587
Joined: Sat May 15, 2021 6:09 pm
Location: Essex, UK
Contact:

Re: OpenAI News & Discussions

Post by wjfox »

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

1 May 2026 16:32

Last month, Anthropic made a big deal about the supposedly outsize cybersecurity threat represented by its Mythos Preview model, leading the company to restrict the initial release to “critical industry partners.” But new research from the UK’s AI Security Institute (AISI) suggests that OpenAI’s GPT-5.5, which launched publicly last week, reached “a similar level of performance on our cyber evaluations” as Mythos Preview, which the group evaluated last month.

Since 2023, the AISI has run a variety of frontier AI models through 95 different Capture the Flag challenges designed to test capabilities on cybersecurity tasks, such as reverse engineering, web exploitation, and cryptography. On the highest-level “Expert” tasks, GPT-5.5 passed an average of 71.4 percent, slightly higher than the 68.6 percent achieved by Mythos Preview (though within the margin of error). In one particularly difficult task that involved building a disassembler to decode a Rust binary, AISI notes that “GPT-5.5 solved the challenge in 10 minutes and 22 seconds with no human assistance at a cost of $1.73” in API calls.

GPT-5.5 also matched Mythos Preview in its progress on “The Last Ones” (TLO), an AISI test range set up to simulate a 32-step data extraction attack on a corporate network. GPT-5.5 succeeded in 3 of 10 attempts on TLO, compared to 2 of 10 for Mythos Preview—no previous model had ever succeeded at the test even once. But GPT-5.5 still fails at AISI’s more difficult “Cooling Tower” simulation of an attempted disruption of the control software for a power plant, as every previously tested AI model also has.

The new results for GPT-5.5 suggest that, when it comes to cybersecurity risk, Mythos Preview was likely not “a breakthrough specific to one model” but rather “a byproduct of more general improvements in long-horizon autonomy, reasoning, and coding,” AISI writes.

In a recent interview with the Core Memory podcast, OpenAI CEO Sam Altman criticized what he calls “fear-based marketing” in promoting limited releases for certain AI models. While he said he’s “sure Mythos is a great model for cybersecurity,” he added that “it is clearly incredible marketing to say, ‘We have built a bomb. We are about to drop it on your head. We will sell you a bomb shelter for $100 million.’”

https://arstechnica.com/ai/2026/05/amid ... t-as-good/
firestar464
Posts: 7206
Joined: Wed Oct 12, 2022 7:45 am

Re: OpenAI News & Discussions

Post by firestar464 »

weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »

firestar464
Posts: 7206
Joined: Wed Oct 12, 2022 7:45 am

Re: OpenAI News & Discussions

Post by firestar464 »

firestar464
Posts: 7206
Joined: Wed Oct 12, 2022 7:45 am

Re: OpenAI News & Discussions

Post by firestar464 »

Image

Holy shit, did anyone else get this? My ChatGPT seemed to have glitched out and I got access to a list of internal models
weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »

weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »


OpenAI launches new Codex tools for white-collar work

Russell Brandom
9:00 AM PDT · June 2, 2026
OpenAI is getting serious about courting enterprise users. On Tuesday, the AI lab released a new set of capabilities for Codex, meant to expand the agentic tool’s uses in the workplace.

Together with the new tools, the company released an internal report on how Codex is being used for knowledge work, finding its uses go far beyond software engineering.

“Codex now has more than 5 million weekly active users, up more than 6x since the launch of the desktop app in February,” reads a blog post introducing the report. “While developers remain the largest user group, knowledge workers now represent about 20 percent of users and are growing more than three times as fast.”

To further court those users, OpenAI released a set of six plug-ins aimed at specific jobs: data analytics, creative production, sales, product design, equity investing, and investment banking. Available from within the Codex app, each of the new tools bundles integrations, instructions, and context to allow Codex to approximate a specific job.
https://techcrunch.com/2026/06/02/opena ... llar-work/
weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »

firestar464
Posts: 7206
Joined: Wed Oct 12, 2022 7:45 am

Re: OpenAI News & Discussions

Post by firestar464 »

weatheriscool
Posts: 24494
Joined: Sun May 16, 2021 6:16 pm
Contact:

Re: OpenAI News & Discussions

Post by weatheriscool »

Post Reply