GitHub's new Rubber Duck feature pairs Claude models with GPT-5.4 for independent code review, closing 74.7% of the performance gap between Sonnet and Opus. (ReadGitHub's new Rubber Duck feature pairs Claude models with GPT-5.4 for independent code review, closing 74.7% of the performance gap between Sonnet and Opus. (Read

GitHub Copilot CLI Adds Rubber Duck Feature for Cross-Model AI Code Review

2026/04/09 01:06
3 min di lettura
Per feedback o dubbi su questo contenuto, contattateci all'indirizzo crypto.news@mexc.com.

GitHub Copilot CLI Adds Rubber Duck Feature for Cross-Model AI Code Review

Jessie A Ellis Apr 08, 2026 17:06

GitHub's new Rubber Duck feature pairs Claude models with GPT-5.4 for independent code review, closing 74.7% of the performance gap between Sonnet and Opus.

GitHub Copilot CLI Adds Rubber Duck Feature for Cross-Model AI Code Review

GitHub just shipped a feature that addresses one of the most frustrating problems with AI coding assistants: they make confident mistakes that snowball into bigger messes. The new Rubber Duck capability, now available in experimental mode for Copilot CLI, brings in a second AI model from a completely different family to critique the primary agent's work.

Here's the setup: when you're running a Claude model as your main orchestrator, Rubber Duck deploys GPT-5.4 as an independent reviewer. The goal isn't just catching typos—it's questioning architectural decisions before they become expensive technical debt.

The Numbers Worth Knowing

GitHub tested this on SWE-Bench Pro, a benchmark of gnarly real-world coding problems from open-source repos. Claude Sonnet 4.6 paired with Rubber Duck closed 74.7% of the performance gap between Sonnet and the more expensive Opus model running solo.

The gains weren't uniform. Rubber Duck showed the strongest results on complex problems spanning 3+ files that typically require 70+ steps to resolve. On these harder tasks, the Sonnet + Rubber Duck combo scored 3.8% higher than baseline Sonnet, jumping to 4.8% higher on the most difficult problems identified across three trials.

What It Actually Catches

GitHub shared specific examples from their testing. In one OpenLibrary case, Rubber Duck flagged that a proposed scheduler would start and immediately exit without running any jobs—and spotted that even if fixed, one scheduled task contained an infinite loop.

Another catch: a single-line bug in a Solr integration where a loop silently overwrote the same dictionary key on every iteration. Three of four facet categories were being dropped from search queries with zero errors thrown. That's the kind of bug that passes code review and then haunts you in production for months.

A third example involved a NodeBB email confirmation flow where three files all read from a Redis key that new code stopped writing to. The confirmation UI and cleanup paths would have broken silently on deploy.

When It Kicks In

Rubber Duck activates at three checkpoints: after drafting a plan (where GitHub expects the biggest wins), after complex implementations, and after writing tests but before running them. The agent can also call for a critique when it gets stuck in a loop.

Users can trigger a review manually at any point. Copilot queries Rubber Duck, processes the feedback, and shows what changed and why.

The feature works with all Claude family models—Opus, Sonnet, and Haiku—as orchestrators. GitHub says they're already exploring other model family pairings, including options for when GPT-5.4 serves as the primary orchestrator.

To access Rubber Duck, install GitHub Copilot CLI and run the /experimental slash command. You'll need access to GPT-5.4 enabled and a Claude model selected from the picker. Feedback goes to GitHub's community discussion board.

Image source: Shutterstock
  • github
  • ai coding
  • copilot cli
  • developer tools
  • machine learning
Opportunità di mercato
Logo DuckChain
Valore DuckChain (DUCK)
$0.00007197
$0.00007197$0.00007197
-6.78%
USD
Grafico dei prezzi in tempo reale di DuckChain (DUCK)
Disclaimer: gli articoli ripubblicati su questo sito provengono da piattaforme pubbliche e sono forniti esclusivamente a scopo informativo. Non riflettono necessariamente le opinioni di MEXC. Tutti i diritti rimangono agli autori originali. Se ritieni che un contenuto violi i diritti di terze parti, contatta crypto.news@mexc.com per la rimozione. MEXC non fornisce alcuna garanzia in merito all'accuratezza, completezza o tempestività del contenuto e non è responsabile per eventuali azioni intraprese sulla base delle informazioni fornite. Il contenuto non costituisce consulenza finanziaria, legale o professionale di altro tipo, né deve essere considerato una raccomandazione o un'approvazione da parte di MEXC.

Potrebbe anche piacerti

CME Group to launch Solana and XRP futures options in October

CME Group to launch Solana and XRP futures options in October

The post CME Group to launch Solana and XRP futures options in October appeared on BitcoinEthereumNews.com. CME Group is preparing to launch options on SOL and XRP futures next month, giving traders new ways to manage exposure to the two assets.  The contracts are set to go live on October 13, pending regulatory approval, and will come in both standard and micro sizes with expiries offered daily, monthly and quarterly. The new listings mark a major step for CME, which first brought bitcoin futures to market in 2017 and added ether contracts in 2021. Solana and XRP futures have quickly gained traction since their debut earlier this year. CME says more than 540,000 Solana contracts (worth about $22.3 billion), and 370,000 XRP contracts (worth $16.2 billion), have already been traded. Both products hit record trading activity and open interest in August. Market makers including Cumberland and FalconX plan to support the new contracts, arguing that institutional investors want hedging tools beyond bitcoin and ether. CME’s move also highlights the growing demand for regulated ways to access a broader set of digital assets. The launch, which still needs the green light from regulators, follows the end of XRP’s years-long legal fight with the US Securities and Exchange Commission. A federal court ruling in 2023 found that institutional sales of XRP violated securities laws, but programmatic exchange sales did not. The case officially closed in August 2025 after Ripple agreed to pay a $125 million fine, removing one of the biggest uncertainties hanging over the token. This is a developing story. This article was generated with the assistance of AI and reviewed by editor Jeffrey Albus before publication. Get the news in your inbox. Explore Blockworks newsletters: Source: https://blockworks.co/news/cme-group-solana-xrp-futures
Condividi
BitcoinEthereumNews2025/09/17 23:55
Zelenskyy warns Russia aims to involve Belarus in Ukraine conflict

Zelenskyy warns Russia aims to involve Belarus in Ukraine conflict

The post Zelenskyy warns Russia aims to involve Belarus in Ukraine conflict appeared on BitcoinEthereumNews.com. Zelenskyy said Russia is trying to draw Belarus
Condividi
BitcoinEthereumNews2026/04/18 11:12
Bitcoin, Gold, and U.S. Stocks Dive as Trump Pledges to Hit Iran ‘Extremely Hard’

Bitcoin, Gold, and U.S. Stocks Dive as Trump Pledges to Hit Iran ‘Extremely Hard’

The post Bitcoin, Gold, and U.S. Stocks Dive as Trump Pledges to Hit Iran ‘Extremely Hard’ appeared on BitcoinEthereumNews.com. In brief Bitcoin dropped Thursday
Condividi
BitcoinEthereumNews2026/04/02 17:57

USD1 Genesis: 0 Fees + 12% APR

USD1 Genesis: 0 Fees + 12% APRUSD1 Genesis: 0 Fees + 12% APR

New users: stake for up to 600% APR. Limited time!