Methodology · Sources

The sources behind every Preuve report

Preuve AI doesn't rate your idea from training data. Ten agents pull live evidence from 100+ recurring publications across 10 source categories. Every claim in your report links back to one.

10source categories
100+recurring pubs
6,000+citations · 246 reports

Last updated: May 15, 2026 · View as Markdown

How we use these sources

Validation runs as a multi-agent pipeline. Each agent owns a domain (competitors, market, sentiment) and is allowed to call only the source families relevant to its job. The verdict is the convergence point - not a single LLM's opinion.

  1. Step 1

    10 agents query in parallel

    Competitors, market sizing, demand signals, community sentiment. Each agent owns its slice.

  2. Step 2

    Cross-validated across models

    When runs disagree, the verdict reruns until two models agree.

  3. Step 3

    Every claim links back

    No number in a report without a source URL behind it. You can fact-check every line.

Source families

Audit of 246 recent paid Preuve reports, May 2026. Source URLs were extracted and filtered to remove infrastructure and internal domains. Each row shows how often a family appeared as a cited source.

Heavy use

≥ 100 of 246 · 3 families
01

Regulatory filings

144/246

Audited revenue, risk factors, and direct competitor mentions from public filings.

SEC EDGAREUR-LexCompanies House
02

Professional & hiring

135/246

Org charts, hiring velocity, and team composition signal growth and product direction.

LinkedInGlassdoorWellfound
03

Community & social

130/246

Unfiltered pain points, demand language, and founder threads pulled from public posts.

RedditXYouTubeHacker News

Common

50-99 of 246 · 4 families
04

Market research firms

97/246

TAM, CAGR, segmentation, and forecast cited from published industry reports.

Grand ViewMordor IntelligenceStatistaIBISWorld
05

Tech & business press

78/246

Funding announcements, product launches, and pivot reporting from named outlets.

TechCrunchCNBCFortuneThe Information
06

Funding databases

76/246

Round size, lead investors, valuation, and competitor cap-table data.

CrunchbasePitchBookTracxn
07

Regional & international press

62/246

Local market dynamics and emerging-market signals beyond US/EU mainstream coverage.

Economic TimesEU-StartupsMaddynessNikkei Asia

Specialty

< 50 of 246 · 3 families
08

PR & funding wires

49/246

First-party press releases, syndicated funding alerts, and milestone announcements.

PR NewswireBusinessWireGlobeNewswire
09

Review sites & marketplaces

37/246

Verified user reviews, ratings distributions, and unmet-need clusters by product.

G2CapterraTrustpilotProduct Hunt
10

Developer & code

13/246

Open-source traction, dependent counts, and technical depth of competing teams.

GitHubdev.toStack Overflow

Common questions

What sources does Preuve AI use?
Ten families: regulatory filings (SEC EDGAR), professional and hiring data (LinkedIn), community signals (Reddit, Hacker News), market research firms (Grand View, Statista), tech and business press, funding databases, regional press, PR wires, review sites, and developer sources. Each agent calls the families relevant to its slice of the report.
How many sources does a typical Preuve report cite?
Reports cite from across ten source families. The May 2026 audit of 246 paid reports surfaced over 6,000 source URLs across those families, with regulatory filings (the most-cited family) appearing in 144 of the 246 reports.
Is Preuve AI's data live, or trained on a snapshot?
Live. Each scan queries the web at run time, not a training-data snapshot. Every report stamps the moment it ran.
How does Preuve AI verify what it cites?
Every number in a report links to a source URL. Independent models cross-check verdicts; runs rerun on disagreement.
What does "50+ live sources" mean across the site?
Per scan, the ten agents query 50+ live data sources (categories like Google Trends, Reddit, Crunchbase, news feeds, G2). The May 2026 audit of 246 paid reports found those queries surfaced 100+ recurring publications grouped into the 10 families above.