
{"id":136578,"date":"2026-02-19T08:33:35","date_gmt":"2026-02-19T08:33:35","guid":{"rendered":"https:\/\/mycryptomania.com\/?p=136578"},"modified":"2026-02-19T08:33:35","modified_gmt":"2026-02-19T08:33:35","slug":"sam-altmans-openai-partners-with-paradigm-to-launch-ai-tool-evmbench-to-test-ethereum-security","status":"publish","type":"post","link":"https:\/\/mycryptomania.com\/?p=136578","title":{"rendered":"Sam Altman\u2019s OpenAI Partners With Paradigm To Launch AI Tool EVMbench To Test Ethereum Security\u00a0"},"content":{"rendered":"<p>ChatGPT creator OpenAI and crypto investment firm Paradigm have partnered for a new AI tool that will assess if it can fix Ethereum\u2019s security woes.<\/p>\n<p>On 18 February 2026, EVMbench was <a href=\"https:\/\/openai.com\/index\/introducing-evmbench\/\" target=\"_blank\" rel=\"noopener\">released,<\/a> and it now tests ground to see how well AI agents can find and patch bugs in smart contracts.\u00a0With billions previously lost to hacks, this initiative could be a massive step forward for DeFi safety.<\/p>\n<p>So, what is EVMbench? It is an open-source benchmarking framework designed to rigorously test how well AI agents can analyze and interact with smart contracts on the Ethereum Virtual Machine (EVM). <span class=\"citation-52 citation-end-52\">As AI models become more capable of reading and writing code, this tool measures their ability to act as both security auditors and potential attackers.\u00a0<\/span><\/p>\n<p>In a blogpost today, OpenAI said, \u201cTogether with\u00a0Paradigm\u2060, we\u2019re introducing EVMbench, a benchmark evaluating the ability of AI agents to detect, patch, and exploit high-severity smart contract vulnerabilities. EVMbench draws on 120 curated vulnerabilities from 40 audits, with most sourced from open code audit competitions.\u201d<\/p>\n<p>new collab from <a href=\"https:\/\/twitter.com\/paradigm?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">@paradigm<\/a> and <a href=\"https:\/\/twitter.com\/OpenAI?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">@OpenAI<\/a>:<\/p>\n<p>evmbench is a benchmark and agent harness for exploiting smart contract bugs<\/p>\n<p>a few months ago, the best models found &lt;20% of critical, fund-draining <a href=\"https:\/\/twitter.com\/code4rena?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">@Code4rena<\/a> bugs in our benchmark. today they find &gt; 70% <a href=\"https:\/\/t.co\/soOrCR38eO\" target=\"_blank\" rel=\"noopener\">https:\/\/t.co\/soOrCR38eO<\/a> <a href=\"https:\/\/t.co\/2lr0WUVo2Q\" target=\"_blank\" rel=\"noopener\">pic.twitter.com\/2lr0WUVo2Q<\/a><\/p>\n<p>\u2014 Alpin Yukseloglu (@0xalpo) <a href=\"https:\/\/twitter.com\/0xalpo\/status\/2024196291354624141?ref_src=twsrc%5Etfw\" target=\"_blank\" rel=\"noopener\">February 18, 2026<\/a><\/p>\n\n<p><strong>DISCOVER:\u00a0<a class=\"general-link\" href=\"https:\/\/99bitcoins.com\/cryptocurrency\/new-cryptocurrency\/\" target=\"_blank\" rel=\"noopener\">Best New Cryptocurrencies to Invest in 2026<\/a>\u00a0<\/strong><\/p>\n<h2>As Giants Like BlackRock Expand Into Ethereum\u2019s Staking Ecosystem, EVMbench Becomes Critical<\/h2>\n<p>Currently, humans review smart contracts to catch mistakes. EVMbench is critical because smart contracts secure over $100 billion in assets. As major financial players enter the space and <a href=\"https:\/\/99bitcoins.com\/news\/altcoins\/blackrock-ethereum-staking-etf-supply-shock\/\">institutions like BlackRock interact with Ethereum staking security<\/a>, the stakes are higher than ever.<\/p>\n<p>With EVMbench, the goal is to see if AI can act as a hyper-fast security guard that never sleeps, spotting \u201cvending machine\u201d flaws before thieves do.<\/p>\n<p>EVMbench puts AI models through a rigorous boot camp designed to mimic real-world dangers.<\/p>\n<p>According to OpenAI blog, newer models like GPT-5.3-Codex are getting surprisingly good at the \u201cexploit\u201d part, solving over 70% of critical bugs during testing. While fixing the code remains a challenge, this \u201cgym\u201d for AI helps developers improve defensive tools. It creates a foundation for stronger infrastructure, similar to the engineering efforts seen in the recent <a href=\"https:\/\/99bitcoins.com\/news\/altcoins\/megaeth-mainnet-launch-ethereum-l2-debate\/\">MegaETH mainnet launch<\/a>.<\/p>\n<p><strong>DISCOVER:\u00a0<a class=\"general-link\" href=\"https:\/\/99bitcoins.com\/cryptocurrency\/best-crypto-to-buy\/\" target=\"_blank\" rel=\"noopener\">Top 20 Crypto to Buy in 2026<\/a><\/strong><\/p>\n<h2>\u201cSmart contracts routinely secure $100B+ in open-source crypto assets\u201d<\/h2>\n<div class=\"@md:col-span-6 @md:col-start-4 col-span-12 max-w-none [&amp;:not(:first-child)]:mt-sm\">\n<p class=\"mb-sm last:mb-0\">\u201cAs AI agents improve at reading, writing, and executing code, it becomes increasingly important to measure their capabilities in economically meaningful environments, and to encourage the use of AI systems defensively to audit and strengthen deployed contracts,\u201d said OPenAI.<\/p>\n<p>However, OpenAI added, \u201cOur grading system is robust but imperfect.\u201d<\/p>\n<p><strong>DISCOVER:\u00a0<a class=\"general-link\" href=\"https:\/\/99bitcoins.com\/cryptocurrency\/best-solana-meme-coins\/\" target=\"_blank\" rel=\"noopener\">Top Solana Meme Coins to Buy in 2026\u00a0<\/a><\/strong><\/p>\n<p><span>    <\/span><\/p>\n<div class=\"nnbtc-key-takeaways\">\n<h2 class=\"nnbtc-key-takeaways__title\">Key Takeaways<\/h2>\n<p><span><br \/>\n        <\/span><\/p>\n<p><span><br \/>\n         EVMbench now tests ground to see how well AI agents can find and patch bugs in Ethereum smart contracts.\u00a0<\/span><\/p>\n<p><span><br \/>\n    <\/span><\/p>\n<p><span><br \/>\n        <\/span><span> EVMbench is critical because smart contracts secure over $100 billion in assets.<\/span><span><br \/>\n<\/span><span><br \/>\n    <\/span><\/p>\n<p><span><br \/>\n    <\/span><\/p>\n<p><span>    <\/span><\/p>\n<\/div>\n<p>    <\/p>\n<\/div>\n<div class=\"@md:col-span-6 @md:col-start-4 col-span-12 max-w-none [&amp;:not(:first-child)]:mt-sm\"><\/div>\n<p>\u00a0<\/p>\n<h2><\/h2>\n<p>The post <a href=\"https:\/\/99bitcoins.com\/news\/altcoins\/openai-paradigm-test-ai-agents-ethereum-security\/\">Sam Altman\u2019s OpenAI Partners With Paradigm To Launch AI Tool EVMbench To Test Ethereum Security\u00a0<\/a> appeared first on <a href=\"https:\/\/99bitcoins.com\/\">99Bitcoins<\/a>.<\/p>","protected":false},"excerpt":{"rendered":"<p>ChatGPT creator OpenAI and crypto investment firm Paradigm have partnered for a new AI tool that will assess if it can fix Ethereum\u2019s security woes. On 18 February 2026, EVMbench was released, and it now tests ground to see how well AI agents can find and patch bugs in smart contracts.\u00a0With billions previously lost to [&hellip;]<\/p>\n","protected":false},"author":0,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4],"tags":[],"class_list":["post-136578","post","type-post","status-publish","format-standard","hentry","category-discovery"],"_links":{"self":[{"href":"https:\/\/mycryptomania.com\/index.php?rest_route=\/wp\/v2\/posts\/136578"}],"collection":[{"href":"https:\/\/mycryptomania.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mycryptomania.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"replies":[{"embeddable":true,"href":"https:\/\/mycryptomania.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=136578"}],"version-history":[{"count":0,"href":"https:\/\/mycryptomania.com\/index.php?rest_route=\/wp\/v2\/posts\/136578\/revisions"}],"wp:attachment":[{"href":"https:\/\/mycryptomania.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=136578"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mycryptomania.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=136578"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mycryptomania.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=136578"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}