---
title: "Sources - How Preuve AI cites live evidence"
slug: sources
description: "The 10 source families behind every Preuve AI report. Audit of 246 paid reports."
canonical: https://preuve.ai/sources
audit_date: 2026-05-15
reports_audited: 246
source_families: 10
---

# The sources behind every Preuve report

> **TL;DR:** Preuve AI cites live evidence from 100+ recurring publications across 10 source families. A May 2026 audit of 246 paid reports surfaced over 6,000 source-URL occurrences, with regulatory filings appearing in 144 of the 246 reports. Every claim in every paid report links back to a source URL.

Preuve AI doesn't rate your idea from training data. Ten agents pull live evidence from 100+ recurring publications across 10 source categories. Every claim in your report links back to one.

## Audit summary

- 10 source families
- 100+ recurring publications
- 6,000+ source-URL occurrences
- 246 paid Preuve reports audited (May 2026)

## How we use these sources

Validation runs as a multi-agent pipeline. Each agent owns a domain (competitors, market, sentiment) and is allowed to call only the source families relevant to its job. The verdict is the convergence point, not a single LLM's opinion.

### Step 1: 10 agents query in parallel
Competitors, market sizing, demand signals, community sentiment. Each agent owns its slice.

### Step 2: Cross-validated across models
When runs disagree, the verdict reruns until two models agree.

### Step 3: Every claim links back
No number in a report without a source URL behind it. You can fact-check every line.

## Source families

Audit of 246 recent paid Preuve reports, May 2026. Source URLs were extracted and filtered to remove infrastructure and internal domains. Each row shows how often a family appeared as a cited source.

### Heavy use (>= 100 of 246 - 3 families)

1. **Regulatory filings** - 144 of 246. Audited revenue, risk factors, and direct competitor mentions from public filings. Examples: SEC EDGAR, EUR-Lex, Companies House.
2. **Professional & hiring** - 135 of 246. Org charts, hiring velocity, and team composition signal growth and product direction. Examples: LinkedIn, Glassdoor, Wellfound.
3. **Community & social** - 130 of 246. Unfiltered pain points, demand language, and founder threads pulled from public posts. Examples: Reddit, X, YouTube, Hacker News.

### Common (50-99 of 246 - 4 families)

4. **Market research firms** - 97 of 246. TAM, CAGR, segmentation, and forecast cited from published industry reports. Examples: Grand View, Mordor Intelligence, Statista, IBISWorld.
5. **Tech & business press** - 78 of 246. Funding announcements, product launches, and pivot reporting from named outlets. Examples: TechCrunch, CNBC, Fortune, The Information.
6. **Funding databases** - 76 of 246. Round size, lead investors, valuation, and competitor cap-table data. Examples: Crunchbase, PitchBook, Tracxn.
7. **Regional & international press** - 62 of 246. Local market dynamics and emerging-market signals beyond US/EU mainstream coverage. Examples: Economic Times, EU-Startups, Maddyness, Nikkei Asia.

### Specialty (< 50 of 246 - 3 families)

8. **PR & funding wires** - 49 of 246. First-party press releases, syndicated funding alerts, and milestone announcements. Examples: PR Newswire, BusinessWire, GlobeNewswire.
9. **Review sites & marketplaces** - 37 of 246. Verified user reviews, ratings distributions, and unmet-need clusters by product. Examples: G2, Capterra, Trustpilot, Product Hunt.
10. **Developer & code** - 13 of 246. Open-source traction, dependent counts, and technical depth of competing teams. Examples: GitHub, dev.to, Stack Overflow.

## Common questions

### What sources does Preuve AI use?
Ten families: regulatory filings (SEC EDGAR), professional and hiring data (LinkedIn), community signals (Reddit, Hacker News), market research firms (Grand View, Statista), tech and business press, funding databases, regional press, PR wires, review sites, and developer sources. Each agent calls the families relevant to its slice of the report.

### How many sources does a typical Preuve report cite?
Reports cite from across ten source families. The May 2026 audit of 246 paid reports surfaced over 6,000 source URLs across those families, with regulatory filings (the most-cited family) appearing in 144 of the 246 reports.

### Is Preuve AI's data live, or trained on a snapshot?
Live. Each scan queries the web at run time, not a training-data snapshot. Every report stamps the moment it ran.

### How does Preuve AI verify what it cites?
Every number in a report links to a source URL. Independent models cross-check verdicts; runs rerun on disagreement.

### What does "50+ live sources" mean across the site?
Per scan, the ten agents query 50+ live data sources (categories like Google Trends, Reddit, Crunchbase, news feeds, G2). The May 2026 audit of 246 paid reports found those queries surfaced 100+ recurring publications grouped into the 10 families above.

## Canonical

- HTML: https://preuve.ai/sources
- Markdown: https://preuve.ai/sources.md
- Audit date: 2026-05-15