Kwegg

Forecasting Future with AI

๐Ÿ“ง Email me top thoughts like this
Exploring๐ŸŒ Public
paragยท6 hours ago

Question / Claim

What kinds of real-world events can language models meaningfully forecast?

Key Assumptions

  • Forecastability depends more on signal structure than reasoning difficulty(high confidence)
  • News coverage provides sufficient leading indicators for institutional events(medium confidence)
  • Calibration is more important than raw accuracy in forecasting(high confidence)

Evidence & Observations

  • The OpenForesight paper shows strongest performance on institutional and process-driven events like elections and appointments(citation)
  • OpenForesight evaluation shows highest accuracy and calibration on institutional political events such as elections, confirmations, and government appointments.(citation)
  • Prior forecasting research shows calibrated probabilistic models outperform raw accuracy metrics in political and geopolitical forecasting.(citation)
  • Prediction markets and superforecaster studies demonstrate that process-driven events with public signals are more forecastable than shock-based events.(citation)
  • There is no methodological or empirical connection between LLM-based forecasting and astrological or religious prediction systems; modern forecasting relies on probabilistic calibration, falsifiability, and empirical signals, whereas astrology and religion operate as belief-based, non-calibrated systems.(citation)
  • The main triggering work for this discussion is the OpenForesight paper, which empirically evaluates which classes of future events are forecastable by LLMs using calibrated, open-ended predictions.(citation)

Open Uncertainties

  • How well does this generalize beyond news-driven domains?
  • Can models forecast longer time horizons without severe degradation?

Current Position

LLMs can meaningfully forecast structured, institutional, and process-driven events with clear timelines and leading indicators (e.g., elections, political appointments, regulatory approvals, corporate mergers), but perform poorly on chaotic shocks (wars, disasters), reflexive domains (financial markets), or purely individual, private human decisions.

This is work-in-progress thinking, not a final conclusion.

References(4)

  1. 1.^
    "The Logic of Political Forecasting"โ†—gjopen.comโ€” Academic analysis of calibration and forecastability in political events.
  2. 2.^
    "Superforecasting Study (IARPA)"โ†—rand.orgโ€” Empirical evidence on which classes of events humans and models can forecast reliably.
  3. 3.^
    "Scientific Prediction"โ†—plato.stanford.eduโ€” Philosophical overview of what distinguishes scientific forecasting from non-scientific predictive belief systems.
  4. 4.^
    "Scaling Open-Ended Reasoning to Predict the Future"โ†—arxiv.orgโ€” Primary research paper introducing the OpenForesight dataset and demonstrating open-ended probabilistic forecasting with LLMs.
0
3A6E2U
โ€ข
Login to vote

Engage with this Thought

Comments

No comments yet. Be the first to share your thoughts!

Related Thoughts