Thursday, January 15, 2026
No Result
View All Result
Blockchain Broadcast
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • NFT
  • Blockchain
  • Metaverse
  • DeFi
  • Web3
  • Analysis
  • Regulations
  • Scam Alert
Crypto Marketcap
Blockchain Broadcast
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • NFT
  • Blockchain
  • Metaverse
  • DeFi
  • Web3
  • Analysis
  • Regulations
  • Scam Alert
No Result
View All Result
Blockchain Broadcast
No Result
View All Result

OpenAI GPT 4o ranked as best AI model for writing Solidity smart contract code by IQ

October 21, 2024
in Web3
Reading Time: 3 mins read
0 0
A A
0
Home Web3
Share on FacebookShare on Twitter


Receive, Manage & Grow Your Crypto Investments With Brighty

SolidityBench by IQ has launched as the primary leaderboard to guage LLMs in Solidity code technology. Obtainable on Hugging Face, it introduces two modern benchmarks, NaïveJudge and HumanEval for Solidity, designed to evaluate and rank the proficiency of AI fashions in producing good contract code.

Developed by IQ’s BrainDAO as a part of its forthcoming IQ Code suite, SolidityBench serves to refine their very own EVMind LLMs and evaluate them towards generalist and community-created fashions. IQ Code goals to supply AI fashions tailor-made for producing and auditing good contract code, addressing the rising want for safe and environment friendly blockchain purposes.

As IQ informed CryptoSlate, NaïveJudge gives a novel strategy by tasking LLMs with implementing good contracts primarily based on detailed specs derived from audited OpenZeppelin contracts. These contracts present a gold commonplace for correctness and effectivity. The generated code is evaluated towards a reference implementation utilizing standards comparable to purposeful completeness, adherence to Solidity finest practices and safety requirements, and optimization effectivity.

The analysis course of leverages superior LLMs, together with totally different variations of OpenAI’s GPT-4 and Claude 3.5 Sonnet as neutral code reviewers. They assess the code primarily based on rigorous standards, together with implementing all key functionalities, dealing with edge instances, error administration, correct syntax utilization, and total code construction and maintainability.

Optimization issues comparable to fuel effectivity and storage administration are additionally evaluated. Scores vary from 0 to 100, offering a complete evaluation throughout performance, safety, and effectivity, mirroring the complexities {of professional} good contract growth.

Which AI fashions are finest for solidity good contract growth?

Benchmarking outcomes confirmed that OpenAI’s GPT-4o mannequin achieved the best total rating of 80.05, with a NaïveJudge rating of 72.18 and HumanEval for Solidity move charges of 80% at move@1 and 92% at move@3.

Apparently, newer reasoning fashions like OpenAI’s o1-preview and o1-mini had been overwhelmed to the highest spot, scoring 77.61 and 75.08, respectively. Fashions from Anthropic and XAI, together with Claude 3.5 Sonnet and grok-2, demonstrated aggressive efficiency with total scores hovering round 74. Nvidia’s Llama-3.1-Nemotron-70B scored lowest within the high 10 at 52.54.

SolidityBench scores for LLMs (Hugging Face)
SolidityBench scores for LLMs (Hugging Face)

Per IQ, HumanEval for Solidity adapts OpenAI’s unique HumanEval benchmark from Python to Solidity, encompassing 25 duties of various problem. Every job contains corresponding assessments appropriate with Hardhat, a well-liked Ethereum growth setting, facilitating correct compilation and testing of generated code. The analysis metrics, move@1 and move@3, measure the mannequin’s success on preliminary makes an attempt and over a number of tries, providing insights into each precision and problem-solving capabilities.

Targets of using AI fashions in good contract growth

By introducing these benchmarks, SolidityBench seeks to advance AI-assisted good contract growth. It encourages the creation of extra subtle and dependable AI fashions whereas offering builders and researchers with helpful insights into AI’s present capabilities and limitations in Solidity growth.

The benchmarking toolkit goals to advance IQ Code’s EVMind LLMs and in addition units new requirements for AI-assisted good contract growth throughout the blockchain ecosystem. The initiative hopes to deal with a vital want within the business, the place the demand for safe and environment friendly good contracts continues to develop.

Builders, researchers, and AI fans are invited to discover and contribute to SolidityBench, which goals to drive the continual refinement of AI fashions, promote finest practices, and advance decentralized purposes.

Go to the SolidityBench leaderboard on Hugging Face to be taught extra and start benchmarking Solidity technology fashions.

🤖 Prime AI Crypto Belongings

View AllMentioned on this article



Source link

Tags: CodeContractGPTModelOpenAIrankedSmartSoliditywriting
Previous Post

Land a Six-Figure Salary Job as a Blockchain Developer

Next Post

PayPal’s Move to Zero Fees for International Crypto Transfers

Related Posts

AI, Impersonations Drove Crypto Scam Losses to Record  Billion in 2025: Chainalysis
Web3

AI, Impersonations Drove Crypto Scam Losses to Record $17 Billion in 2025: Chainalysis

January 15, 2026
What Is Venice AI? The Privacy-Focused Chatbot
Web3

What Is Venice AI? The Privacy-Focused Chatbot

January 13, 2026
Two major crypto events canceled after city hit by 18 violent physical attacks on crypto holders amid market downturn
Web3

Two major crypto events canceled after city hit by 18 violent physical attacks on crypto holders amid market downturn

January 12, 2026
Bitcoin Shrugs Off Powell Probe as DOJ Targets Fed Chair
Web3

Bitcoin Shrugs Off Powell Probe as DOJ Targets Fed Chair

January 12, 2026
Should Politicians Be Able to Use Prediction Markets? House Bill Proposes Ban
Web3

Should Politicians Be Able to Use Prediction Markets? House Bill Proposes Ban

January 10, 2026
Altcoins Defy Bitcoin Slump as XRP, Solana Notch Double-Digit Gains
Web3

Altcoins Defy Bitcoin Slump as XRP, Solana Notch Double-Digit Gains

January 9, 2026
Next Post
PayPal’s Move to Zero Fees for International Crypto Transfers

PayPal's Move to Zero Fees for International Crypto Transfers

Bitcoin Hashrate Hits All-Time High as Publicly-Listed Miners’ Share of the Network Peaks

Bitcoin Hashrate Hits All-Time High as Publicly-Listed Miners' Share of the Network Peaks

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Facebook Twitter Instagram Youtube RSS
Blockchain Broadcast

Blockchain Broadcast delivers the latest cryptocurrency news, expert analysis, and in-depth articles. Stay updated on blockchain trends, market insights, and industry innovations with us.

CATEGORIES

  • Altcoin
  • Analysis
  • Bitcoin
  • Blockchain
  • Crypto Exchanges
  • Crypto Updates
  • DeFi
  • Ethereum
  • Metaverse
  • NFT
  • Regulations
  • Scam Alert
  • Uncategorized
  • Web3
No Result
View All Result

SITEMAP

  • About Us
  • Advertise With Us
  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact Us

Copyright © 2024 Blockchain Broadcast.
Blockchain Broadcast is not responsible for the content of external sites.

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In
  • bitcoinBitcoin(BTC)$95,665.00-1.78%
  • ethereumEthereum(ETH)$3,305.61-1.80%
  • tetherTether(USDT)$1.00-0.02%
  • binancecoinBNB(BNB)$931.79-1.63%
  • rippleXRP(XRP)$2.07-3.27%
  • solanaSolana(SOL)$142.23-3.11%
  • usd-coinUSDC(USDC)$1.000.02%
  • tronTRON(TRX)$0.3121793.01%
  • staked-etherLido Staked Ether(STETH)$3,302.89-1.89%
  • dogecoinDogecoin(DOGE)$0.140028-4.94%
No Result
View All Result
  • Home
  • Bitcoin
  • Crypto Updates
    • General
    • Altcoin
    • Ethereum
    • Crypto Exchanges
  • NFT
  • Blockchain
  • Metaverse
  • DeFi
  • Web3
  • Analysis
  • Regulations
  • Scam Alert

Copyright © 2024 Blockchain Broadcast.
Blockchain Broadcast is not responsible for the content of external sites.