OpenAI News & Discussions

YouTube · Post by **wjfox** » Fri May 01, 2026 10:19 pm

GPT-5.5 matches heavily hyped Mythos Preview in new cybersecurity tests

1 May 2026 16:32

Last month, Anthropic made a big deal about the supposedly outsize cybersecurity threat represented by its Mythos Preview model, leading the company to restrict the initial release to “critical industry partners.” But new research from the UK’s AI Security Institute (AISI) suggests that OpenAI’s GPT-5.5, which launched publicly last week, reached “a similar level of performance on our cyber evaluations” as Mythos Preview, which the group evaluated last month.

Since 2023, the AISI has run a variety of frontier AI models through 95 different Capture the Flag challenges designed to test capabilities on cybersecurity tasks, such as reverse engineering, web exploitation, and cryptography. On the highest-level “Expert” tasks, GPT-5.5 passed an average of 71.4 percent, slightly higher than the 68.6 percent achieved by Mythos Preview (though within the margin of error). In one particularly difficult task that involved building a disassembler to decode a Rust binary, AISI notes that “GPT-5.5 solved the challenge in 10 minutes and 22 seconds with no human assistance at a cost of $1.73” in API calls.

GPT-5.5 also matched Mythos Preview in its progress on “The Last Ones” (TLO), an AISI test range set up to simulate a 32-step data extraction attack on a corporate network. GPT-5.5 succeeded in 3 of 10 attempts on TLO, compared to 2 of 10 for Mythos Preview—no previous model had ever succeeded at the test even once. But GPT-5.5 still fails at AISI’s more difficult “Cooling Tower” simulation of an attempted disruption of the control software for a power plant, as every previously tested AI model also has.

The new results for GPT-5.5 suggest that, when it comes to cybersecurity risk, Mythos Preview was likely not “a breakthrough specific to one model” but rather “a byproduct of more general improvements in long-horizon autonomy, reasoning, and coding,” AISI writes.

In a recent interview with the Core Memory podcast, OpenAI CEO Sam Altman criticized what he calls “fear-based marketing” in promoting limited releases for certain AI models. While he said he’s “sure Mythos is a great model for cybersecurity,” he added that “it is clearly incredible marketing to say, ‘We have built a bomb. We are about to drop it on your head. We will sell you a bomb shelter for $100 million.’”

https://arstechnica.com/ai/2026/05/amid ... t-as-good/

firestar464 · Post by **firestar464** » Tue May 12, 2026 4:13 pm

weatheriscool · Post by **weatheriscool** » Sat May 16, 2026 6:54 pm

firestar464 · Post by **firestar464** » Wed May 27, 2026 8:10 pm

firestar464 · Post by **firestar464** » Fri May 29, 2026 3:16 am

Holy shit, did anyone else get this? My ChatGPT seemed to have glitched out and I got access to a list of internal models

weatheriscool · Post by **weatheriscool** » Sun May 31, 2026 6:04 pm

weatheriscool · Post by **weatheriscool** » Tue Jun 02, 2026 4:12 pm

OpenAI launches new Codex tools for white-collar work
Russell Brandom
9:00 AM PDT · June 2, 2026

OpenAI is getting serious about courting enterprise users. On Tuesday, the AI lab released a new set of capabilities for Codex, meant to expand the agentic tool’s uses in the workplace.

Together with the new tools, the company released an internal report on how Codex is being used for knowledge work, finding its uses go far beyond software engineering.

“Codex now has more than 5 million weekly active users, up more than 6x since the launch of the desktop app in February,” reads a blog post introducing the report. “While developers remain the largest user group, knowledge workers now represent about 20 percent of users and are growing more than three times as fast.”

To further court those users, OpenAI released a set of six plug-ins aimed at specific jobs: data analytics, creative production, sales, product design, equity investing, and investment banking. Available from within the Codex app, each of the new tools bundles integrations, instructions, and context to allow Codex to approximate a specific job.

https://techcrunch.com/2026/06/02/opena ... llar-work/

weatheriscool · Post by **weatheriscool** » Wed Jun 03, 2026 3:35 am

firestar464 · Post by **firestar464** » Thu Jun 04, 2026 3:08 pm

weatheriscool · Post by **weatheriscool** » Thu Jun 04, 2026 6:54 pm

weatheriscool · Post by **weatheriscool** » Sun Jun 07, 2026 4:35 am

weatheriscool · Post by **weatheriscool** » Mon Jun 08, 2026 9:11 pm

firestar464 · Post by **firestar464** » Mon Jun 08, 2026 10:08 pm

weatheriscool · Post by **weatheriscool** » Tue Jun 16, 2026 4:16 pm

Future Timeline

OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions

Re: OpenAI News & Discussions