Skip to main content
Tech

Trade: Will any AI model reach ___ Coding Arena Score by December 31?

Opened · Settles

Resolution criteria on PolyGram: This market will resolve to "Yes" if any model on the Arena.AI Leaderboard (arena.ai/leaderboard/text) reaches at least the specified Arena Score on the "Leaderboard" tab for "Coding" by December 31, 2026, 11:59 PM ET. Otherwise, this market will resolve to "No". Results from the "Score" column under the "Text Arena | Coding" Leaderboard tab at https://arena.ai/leaderboard/text/coding-no-style-control with style control off will be used to resolve this market. The resolution source for this market is the Chatbot Arena LLM Leaderboard found at arena.ai/leaderboard/text.

PolyGram is an on-chain prediction market where you trade YES or NO outcome shares with real USDC on Polygon. For this market, buy YES if you believe the event will happen, or NO if you think it won't. Your maximum loss is your stake — winning shares pay $1.00 each at resolution. Unlike sportsbooks, there is no house edge: prices are set by supply and demand from other traders and reflect the crowd's real-time probability.

Liquidity
$6K
Total Volume
$3K
24h Volume
Open Interest
$629
Trade this market on PolyGram →

Market outcomes

1580 54% YES46% NO
1600 32% YES68% NO
1560 83% YES17% NO

Market context

The question centres on whether any large language model will achieve a specified coding performance benchmark on the Chatbot Arena leaderboard by year-end 2026. The Arena's coding evaluation measures model capability across practical programming tasks, with scores aggregated from comparative human judgements. The current order book on Polymarket reflects a 55% probability of this threshold being breached, suggesting traders view the target as moderately challenging but achievable within the two-year window.

Historical progression on the Chatbot Arena coding track shows consistent capability gains across model releases. GPT-4o, Claude 3.5 Sonnet, and other frontier models have demonstrated measurable score improvements with each major iteration. The 24-month timeframe aligns with typical development cycles for leading labs—most release significant model updates annually or biannually. Comparable benchmarks on coding-specific leaderboards (HumanEval, LiveCodeBench) have seen steady score inflation, though the rate of improvement occasionally plateaus as models approach task saturation on certain problem classes.

Traders should monitor announcements from Anthropic, OpenAI, Google DeepMind, and other major labs regarding model releases and capability claims. The specific score threshold will determine probability calibration; higher targets require either breakthrough architectural advances or substantial scaling investments. Changes to the Arena's evaluation methodology or scoring system could also shift market dynamics, though the resolution criteria reference the current leaderboard structure. Broader AI funding trends and compute availability will influence development velocity across the sector through 2026.

Wikipedia Context

  • Anthony Municipal Airport

    Anthony Municipal Airport is a city-owned public-use airport located three miles (5 km) northwest of the central business district of Anthony, a city in Harper County, Kansas, United States.

  • Ikechi Anya
    Ikechi Anya

    Ikechi Anya is a Scottish former professional footballer. A versatile player, Anya was fielded in a number of positions, including winger, wing-back and full-back.

  • Anya Taylor-Joy
    Anya Taylor-Joy

    Anya-Josephine Marie Taylor-Joy is an actress. Born in Miami, she grew up in Buenos Aires and London. She began pursuing an acting career at the age of 16. After a series of small television roles, her breakthrough came with a leading role in the horror film The Witch (2015). She had roles in the horror film Split (2016) and its sequel Glass (2019); Thorough

  • Any Given Sunday
    Any Given Sunday

    Any Given Sunday is a 1999 American sports drama film directed by Oliver Stone and produced by Clayton Townsend, Dan Halsted, and Lauren Shuler Donner from a screenplay by Stone and John Logan based on a story written by Logan and Daniel Pyne, with Stone and Richard Donner additionally serving as executive producers. The film depicts a fictional professional

How this market resolves

Resolution is handled by the UMA optimistic oracle on Polygon. A proposer submits the outcome, a two-hour dispute window opens, and if no one stakes a counter-claim the payout is final. Contested outcomes escalate to UMA token-holder voting. Payouts clear in USDC to the winning side.

How to trade this market step by step

The mechanics for trading "Will any AI model reach ___ Coding Arena Score by December 31?" are the same as any other PolyGram event contract. Each YES share resolves to $1 if the event happens, or $0 if it doesn't. The current price between 0¢ and 100¢ is the market's probability estimate, set live by the order book.

  1. Sign in on polygram.ink with your email — no full KYC under $1,500 lifetime trading volume.
  2. Deposit USDC on Polygon (lowest fees, ~$0.01 per transaction) or Ethereum. Funds credit after 12 confirmations.
  3. Pick a side. Buy YES if you believe the event will happen; buy NO if you think it won't. The current YES price reflects the market's collective probability.
  4. Size your position. If you stake 100 USDC at 50% YES, you'll receive shares that pay $200 if YES resolves true — a 100% gross return. If NO resolves, your shares are worth $0.
  5. Set risk controls (optional). Stop-loss, take-profit, and limit-order types all supported. Use the trade ticket's slippage box to cap your maximum entry price.
  6. Wait for resolution. When the event resolves on-chain via the UMA optimistic oracle, the winning side settles to 100¢ automatically and USDC hits your balance within seconds. Withdrawable to any wallet you control.

How active is this market?

$3K in lifetime turnover and $6K of resting liquidity puts this market in the below the median by volume for tech contracts on PolyGram. Order-book depth is thin — large orders may need to be split across the book or executed as limit orders.

The market has been open for around a month — fresh enough that information asymmetry remains a real factor.

Higher-volume markets tend to have tighter spreads and faster price discovery — meaning the displayed YES/NO percentages are more likely to reflect the true crowd-implied probability rather than a single trader's directional view.

Key terms

YES / NO share
A binary outcome token that pays $1.00 if the underlying claim resolves true (YES) or false (NO), and $0 otherwise. The market price between 0¢ and 100¢ is the implied probability.
CLOB
Central limit order book. The matching engine that pairs YES buyers with NO buyers (effectively the same trade). Polymarket's CLOB on Polygon executes trades on-chain via the conditional-tokens framework.
Liquidity
USDC capital sitting in resting limit orders inside the order book. Deeper liquidity means smaller slippage on large trades and a tighter bid-ask spread.
UMA optimistic oracle
The on-chain dispute system that settles each Polymarket market. A proposer submits the outcome, a two-hour challenge window opens, and unchallenged proposals finalise the resolution.
Slippage
The difference between the displayed mid-price and your fill price. Affects market orders most; limit orders avoid slippage but may take time to fill.
Conditional token
ERC-1155 outcome share issued by Gnosis Conditional Tokens on Polygon. The token type that resolves to $1.00 or $0.00 at settlement.

See the full prediction-market glossary →

Frequently asked questions

How does this market resolve?

Resolution is handled by the UMA optimistic oracle on Polygon. A proposer submits the outcome, a 2-hour dispute window opens, and if uncontested the payout is final. Contested outcomes escalate to UMA token holders.

When does this market close?

This prediction market is scheduled to close on 31 December 2026. After the resolving event occurs, settlement typically clears within 24 hours once the UMA optimistic oracle confirms the outcome. All payouts are in USDC on the Polygon network.

How can I trade on "Will any AI model reach ___ Coding Arena Score by December 31?"?

To trade on this prediction market, create a free PolyGram account at polygram.ink, deposit USDC via Polygon, and place a YES or NO order on the outcome you believe in. You can learn more on our how-it-works page. Your maximum loss is limited to your stake — there is no leverage or margin.

What happens when the market resolves?

When the outcome is determined, winning YES shares pay out $1.00 each in USDC, while losing shares pay $0. Settlement is handled by the UMA optimistic oracle on Polygon — a proposer submits the result, a two-hour dispute window opens, and if uncontested, payouts are distributed automatically. You can withdraw your winnings to any Polygon wallet.

Risk and regulatory note

Prediction-market positions can lose 100% of staked capital. Outcomes are uncertain by definition — historical accuracy of crowd-implied probabilities is high in aggregate but not for any single market. PolyGram does not provide investment advice. Trade only with capital you can afford to lose.

Regulatory status varies by jurisdiction. Germany, the United States, and most EU countries treat Polymarket-style event contracts under one of three frameworks: financial derivative, gambling product, or unregulated novel asset. Consult local counsel before trading.

View live odds & trade →

Related prediction markets

Explore more prediction market odds and trading opportunities on PolyGram: