Close Menu
CryptoDispatchDaily.comCryptoDispatchDaily.com
    What's Hot

    Can Litecoin Price Bounce To $285? This Trend Maps Out 5 Major Levels

    February 19, 2026

    AAVE price defends $120 demand zone as RWA deposits top $1B

    February 20, 2026

    ETHZilla struggles to find footing as Peter Thiel’s Founders Fund exits

    February 18, 2026
    Facebook X (Twitter) Instagram
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    Facebook X (Twitter) Instagram
    CryptoDispatchDaily.comCryptoDispatchDaily.com
    • News

      Bitcoin Eyes $78K Breakout as Momentum Indicators Remain Neutral

      May 20, 2026

      Analyst Says Roadmap For Bitcoin To Reach $500,000 Is Complete, Here’s Why

      May 19, 2026

      Harvard cuts stake in BlackRock’s BTC ETF as crypto deleverage extends into Q1

      May 17, 2026

      Can BNB price break above $750 as double bottom pattern forms?

      May 15, 2026

      Casper Network Plans Quantum-Safe Keys in 2027 to Protect Tokenized Assets

      May 14, 2026
    • Technology

      Proof of Work vs Proof of Stake – Which consensus mechanism is better?

      May 20, 2026

      Stellar Price Prediction Turns Bullish as XLM Eyes Breakout Toward $0.68

      May 18, 2026

      DeFi meets AI – How smart protocols are revolutionizing finance

      May 17, 2026

      ONDO’s Three-Product Protocol Hits $3.778B TVL as Institutional Giants Join Settlement Pilots

      May 16, 2026

      Quantum-resistant crypto – Bitcoin, Ethereum, and preparing blockchain for future

      May 15, 2026
    • Learn/Guide

      Wadoozie ($WADZ): The Ethereum Memecoin With a 48-State Tour and Hidden Token Rewards

      May 7, 2026

      How to Optimize Company Operational Costs: A Manual on Modern Payment Ecosystems

      March 6, 2026

      6 Best Citizenship by Investment Programs for 2026

      February 23, 2026

      Strategies to Conquering Risk in Crypto Trading

      February 18, 2026

      What is GameFi? How to Play and Earn Crypto in 2025

      February 18, 2026
    • Regulation

      Poland Approves MiCA Law While Zondacrypto Probe Grows

      May 15, 2026

      UK Treasury Sees Digital Assets Reshaping Financial Markets and Payments

      May 14, 2026

      Labor and Banks Oppose Senate Crypto Clarity Act Bill

      May 13, 2026

      Senate Banking Panel to Debate CLARITY Act May 14

      May 12, 2026

      Coinbase, Kraken & Gemini Push Back on Senate Crypto Listing Rules

      May 10, 2026
    • Live Pricing Chart
    CryptoDispatchDaily.comCryptoDispatchDaily.com
    Home » OpenAI launches smart contract security evaluation system
    Technology

    OpenAI launches smart contract security evaluation system

    February 19, 20263 Mins Read
    Facebook Twitter Pinterest LinkedIn Tumblr Email
    OpenAI launches smart contract security evaluation system
    Share
    Facebook Twitter LinkedIn Pinterest Email



    OpenAI has introduced a new system called EVMbench, designed to measure how well artificial intelligence agents can find and fix security flaws in crypto smart contracts.

    Summary

    • OpenAI has introduced EVMbench, a new framework designed to measure how well AI agents can detect, fix, and exploit smart contract vulnerabilities.
    • Developed with Paradigm, the benchmark is built on real audit data and focuses on practical, high-risk security scenarios.
    • Early results show strong progress in exploit tasks, while detection and patching are still challenging.

    The company announced on Feb. 18 that it has developed EVMbench in partnership with Paradigm. The benchmark focuses on contracts built for the Ethereum Virtual Machine and is meant to test how AI systems perform in real financial settings.

    OpenAI said smart contracts currently secure more than $100 billion in open-source crypto assets, making security testing increasingly important as AI tools become more capable.

    Testing how AI handles real security risks

    EVMbench evaluates AI agents across three main tasks: detecting vulnerabilities, fixing flawed code, and carrying out simulated attacks. The system is built using 120 high-risk issues drawn from 40 past security audits, many of them from public auditing competitions.

    Additional scenarios were taken from reviews of the Tempo blockchain, a payments-focused network designed for stablecoin use. These cases were added to reflect how smart contracts are used in financial applications.

    To build the test environment, OpenAI adapted existing exploit scripts and created new ones where needed. All exploit tests run in isolated systems rather than on live networks, and only previously disclosed vulnerabilities are included.

    In detection mode, agents review contract code and try to identify known security flaws. In patch mode, they must fix those flaws without breaking the software. In exploit mode, agents attempt to drain funds from vulnerable contracts in a controlled setting.

    Early results and industry impact

    OpenAI said a custom testing framework was developed to ensure results can be reproduced and verified.

    The company tested several advanced models using EVMbench. In exploit mode, GPT-5.3-Codex achieved a score of 72.2%, compared with 31.9% for GPT-5, released six months earlier. Detection and patching scores were lower, showing that many vulnerabilities are still difficult for AI systems to handle.

    Researchers observed that agents performed best when goals were clear, such as draining funds. Performance dropped when tasks required deeper analysis, such as reviewing large codebases or fixing subtle bugs.

    OpenAI acknowledged that EVMbench does not fully reflect real-world conditions. Many major crypto projects undergo more extensive reviews than those included in the dataset. Some timing-based and multi-chain attacks are also outside the system’s scope.

    The company said the benchmark is intended to support defensive use of AI in cybersecurity. As AI tools become more powerful, they could be used by both attackers and auditors. Measuring their capabilities is seen as a way to reduce risk and encourage responsible deployment.

    Alongside the release, OpenAI said it is expanding security programs and investing $10 million in API credits to support open-source and infrastructure protection. All EVMbench tools and datasets have been made public to support further research.



    Source link

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

    Related Posts

    Proof of Work vs Proof of Stake – Which consensus mechanism is better?

    May 20, 2026

    Stellar Price Prediction Turns Bullish as XLM Eyes Breakout Toward $0.68

    May 18, 2026

    DeFi meets AI – How smart protocols are revolutionizing finance

    May 17, 2026

    ONDO’s Three-Product Protocol Hits $3.778B TVL as Institutional Giants Join Settlement Pilots

    May 16, 2026
    Top Posts

    Bitcoin Difficulty Whipsaws From 11% Slide to 14.73% Climb in 2 Weeks

    February 20, 2026

    Bitcoin ETFs Hold Billions as BTC Slips Below $70K, Analysts Warn of Potential Free Fall

    February 19, 2026

    Binance adds news features to Binance Junior to increase family crypto savings and learning

    February 19, 2026

    Welcome to CryptoDispatchDaily.com! Your go-to source for fast, reliable updates from the ever-evolving world of cryptocurrency. Whether it's Bitcoin, altcoins, blockchain breakthroughs, or DeFi trends, we bring you timely insights, expert analysis, and key developments shaping the future of digital finance. Stay ahead with real-time crypto news and in-depth coverage.

    Top Insights

    Bitcoin Eyes $78K Breakout as Momentum Indicators Remain Neutral

    May 20, 2026

    Analyst Says Roadmap For Bitcoin To Reach $500,000 Is Complete, Here’s Why

    May 19, 2026

    Harvard cuts stake in BlackRock’s BTC ETF as crypto deleverage extends into Q1

    May 17, 2026
    Advertisement
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    © 2026. Designed by CryptoDispatchDaily.com.

    Type above and press Enter to search. Press Esc to cancel.