TLDR OpenAI and Paradigm have launched EVMbench to evaluate AI’s performance in smart contract security. The benchmark tests AI systems in detecting vulnerabilitiesTLDR OpenAI and Paradigm have launched EVMbench to evaluate AI’s performance in smart contract security. The benchmark tests AI systems in detecting vulnerabilities

OpenAI Unveils EVMbench Benchmark to Evaluate AI in Smart Contracts

2026/02/19 20:24
3 min read

TLDR

  • OpenAI and Paradigm have launched EVMbench to evaluate AI’s performance in smart contract security.
  • The benchmark tests AI systems in detecting vulnerabilities, patching code, and executing fund-draining exploits.
  • EVMbench uses 120 high-risk vulnerabilities sourced from 40 professional audits to simulate real-world scenarios.
  • GPT-5.3-Codex achieved a 72.2% success rate in exploit tasks, a notable improvement over GPT-5’s 31.9% performance.
  • OpenAI has invested $10 million in API credits to support open-source security initiatives and strengthen smart contract defenses.

OpenAI and Paradigm have unveiled a new smart contract security evaluation system called EVMbench. This benchmark aims to assess AI systems in detecting vulnerabilities and executing exploits in Ethereum Virtual Machine (EVM) environments. With smart contracts securing over $100 billion in crypto assets, testing the security of these contracts has become crucial.

Testing AI in Smart Contract Security

OpenAI, in collaboration with Paradigm, launched EVMbench to evaluate how AI handles security in smart contracts. The benchmark leverages 120 curated vulnerabilities from 40 professional audits, including scenarios from the Tempo blockchain. The system evaluates AI models in three distinct tasks: detecting vulnerabilities, patching code, and executing fund-draining exploits in a sandboxed EVM environment.

EVMbench focuses on Ethereum-based contracts and incorporates scenarios that reflect real financial applications. The use of 120 high-risk issues, along with data from public auditing competitions, helps to simulate actual challenges faced in the crypto space. OpenAI developed this system to address the growing concern over AI’s role in identifying and mitigating risks in smart contract security.

EVMbench’s Capabilities and Performance

The benchmark provides a comprehensive approach to testing AI agents by evaluating their capabilities in different security tasks. In detection mode, the agents review contract code to identify known vulnerabilities. In patch mode, the AI must fix these vulnerabilities without compromising the contract’s functionality.

Recent testing showed impressive results with the GPT-5.3-Codex model achieving a 72.2% success rate in exploit tasks, up from 31.9% with the GPT-5 model. Despite these advancements, detection and patching performance remained lower. OpenAI noted that while the benchmark gives a glimpse into AI’s potential, it does not fully replicate real-world conditions, as some complex multi-chain and timing-based attacks are excluded from the testing framework.

OpenAI Expands Security Efforts

OpenAI’s announcement also highlighted its broader commitment to security. As part of the release, the company invested $10 million in API credits to support open-source security projects. The company also emphasized that all EVMbench tools and datasets have been made publicly available for further research and development.

The launch of EVMbench is seen as a step toward strengthening the cybersecurity of smart contracts and blockchain systems. With the increasing reliance on smart contracts, OpenAI aims to help the industry address emerging risks by testing AI systems in critical financial settings. As AI continues to evolve, its role in both defending and attacking smart contracts will be crucial for maintaining the integrity of the crypto ecosystem.

The post OpenAI Unveils EVMbench Benchmark to Evaluate AI in Smart Contracts appeared first on CoinCentral.

Market Opportunity
Smart Blockchain Logo
Smart Blockchain Price(SMART)
$0.00456
$0.00456$0.00456
+0.92%
USD
Smart Blockchain (SMART) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.
Tags:

You May Also Like

Dogecoin ETF From 21Shares Appears on DTCC. What Does It Mean?

Dogecoin ETF From 21Shares Appears on DTCC. What Does It Mean?

The post Dogecoin ETF From 21Shares Appears on DTCC. What Does It Mean? appeared on BitcoinEthereumNews.com. What does this listing mean?  No SEC approval yet  The spot-based Dogecoin exchange-traded fund (ETF) proposal filed by 21Shares has been listed on the Depository Trust & Clearing Corporation (DTCC).  This comes after REX-Shares successfully launched the first-ever DOGE ETF in the US in collaboration with Osprey Funds earlier this September.  What does this listing mean?  The DTCC is a central clearinghouse that is responsible for the settlement of securities transactions in the U.S. The listing is an important preparatory step since it ensures that the product can be seamlessly integrated into the financial infrastructure.  The shares that are bought and sold will be tracked via DTCC’s system.  Moreover, the ticker of the ETF can now be pre-registered by brokers. No SEC approval yet  That said, the DTCC listing is a purely technical step, which does not mean that the SEC has already approved the product.  As reported by U.Today, Canary Capital Group’s XRP ETF was similarly listed by the clearinghouse earlier this month, which led to some confusion within the community.  Source: https://u.today/dogecoin-etf-from-21shares-appears-on-dtcc-what-does-it-mean
Share
BitcoinEthereumNews2025/09/23 16:32
Forward Industries zet $4 miljard in om Solana bezit uit te breiden

Forward Industries zet $4 miljard in om Solana bezit uit te breiden

Forward Industries gooit het roer om met een flinke financiële zet: het bedrijf lanceert een zogeheten “At The Market” aandelenprogramma van maar liefst $4 miljard. Het programma geeft het bedrijf flexibiliteit om op elk gewenst moment aandelen te verkopen, wat vooral handig is voor het uitbreiden van hun Solana treasury... Het bericht Forward Industries zet $4 miljard in om Solana bezit uit te breiden verscheen het eerst op Blockchain Stories.
Share
Coinstats2025/09/18 01:31
Ripple CEO Reveals When the CLARITY Act Could Officially Pass

Ripple CEO Reveals When the CLARITY Act Could Officially Pass

Ripple CEO Brad Garlinghouse has raised his confidence that the Digital Asset Market Clarity Act, known as the CLARITY Act, will pass by the end of April, increasing
Share
Ethnews2026/02/21 19:10