Key takeaways
- Most ideas are not bad, they are unfinished. 76.9% land in the caution zone (40-69), not the no-go pile, and the median score is 55, not 30.
- Distribution kills more startups than competition. No go-to-market plan is the top risk at 29.4%, roughly 2x the rate of a crowded market (14.5%).
- A crowded market is a green flag. Ideas flagged for competitive pressure average 65.4, the highest score of any risk group. An empty market usually means no demand.
- A 70+ score puts you in the top 18%. Only 18.3% of 4,000+ analyzed ideas earned a "go" verdict, and fewer than 0.2% broke 90.
- Preparation beats the idea. Founders who named real competitors, showed demand from outside their own network, and picked a specific buyer scored about 13 points higher.
These startup validation benchmarks come from more than 4,000 completed analyses, free scans and paid deep analyses combined. Each idea is scored against live data sources: competitor databases, Google Trends, Reddit, community forums, and more, with paid deep analyses cross-checking 50+ of them. The first edition covered 1,000 ideas; this second edition, run in April 2026, rebuilds the benchmark tables from comparable analyses so the score bands and risk splits stay consistent.
Every number here comes from real analyses and real follow-up runs. No synthetic data, no ChatGPT-generated ideas. Cite freely.
The Preuve AI Score
The Preuve AI Score is a 0-to-100 startup viability rating computed by 10 AI agents cross-validating claims against 50+ live data sources including competitors, demand signals, market size, community sentiment, hiring trends, and patent activity. Every point is source-linked. A score above 70 indicates launch-ready signals. Below 40 means fundamental validation work remains.

Overall Viability Distribution
The average viability score across the current benchmark set is 57.4 out of 100. The median is 55. The distribution is still centered in the 50-59 range.
| Score range | % of total |
|---|---|
| 0-9 | 0.1% |
| 10-19 | 0.1% |
| 20-29 | 1.0% |
| 30-39 | 3.6% |
| 40-49 | 25.1% |
| 50-59 (peak) | 30.3% |
| 60-69 | 21.5% |
| 70-79 | 15.5% |
| 80-89 | 2.7% |
| 90-100 | 0.1% |
76.9% of all ideas land between 40 and 69. This is the caution zone, where the idea has a kernel of something real but is missing at least one thing that matters: a distribution plan, a clear competitor picture, or enough demand evidence from outside the founder's head.
Fewer than 0.2% of ideas score above 90. A score above 90 means live market data shows strong demand, a clear gap in the competitive landscape, and multiple converging signals. It remains exceptionally rare.
Verdict Breakdown: Go, Caution, No-Go
Every analysis ends with one of three verdicts based on the composite viability score:
18.3%
Go
Avg score: 75.1
76.9%
Caution
Avg score: 54.8
4.8%
No-Go
Avg score: 31.9
The gap between a go idea and a caution idea is an average of 20.3 points. In most cases, that gap comes down to 2-3 fixable issues: unclear positioning, no competitor awareness, no distribution plan. The idea itself is rarely the problem.

Why Go-to-Market Risk Kills More Startups Than Competition
A primary risk label was captured on most analyses in the benchmark set. Here is the distribution across the tagged reports:
| Top risk | % of ideas | Avg score |
|---|---|---|
| Go-to-market | 29.4% | 57.6 |
| Early-stage risk | 25.6% | 50.8 |
| Competitive pressure | 14.5% | 65.4 |
| Regulatory risk | 12.7% | 56.6 |
| Funding requirements | 9.8% | 60.7 |
| Production challenges | 4.1% | 58.0 |
| Team execution | 2.2% | 61.6 |
Minor categories like technical complexity, legal risk, and brand risk account for the remaining 1.6% of tagged analyses. A small share of reports did not have a primary risk label stored in the backfilled column.
Go-to-market risk is the top failure signal for the second year running, flagged in 29.4% of tagged analyses. Nearly 1 in 3 startup ideas have no realistic path to their first customer. The founder has a product concept but no distribution plan, no channel strategy, and no evidence the target buyer is reachable.
The second-biggest risk is early-stage risk at 25.6%. These are ideas that are too vague to properly evaluate: no defined buyer, no pricing reference, no competitive landscape. The average score for this group is 50.8, the lowest of any major risk category.
The competition paradox
Competition ranks third as a risk factor (14.5%), but among the major risk categories it has the highest average score at 65.4. That is counterintuitive until you realize: competition means there is a real market. Ideas with no competitors often have no customers either. A crowded market is not automatically a bad sign. An empty market often is.
Do founders who pay for deeper analysis have better ideas?
Preuve AI offers two tiers: a free quick scan and a paid deep analysis that unlocks competitor research, community signals, and investor-grade sections.
Free Scan
56.5
Average score
Deep Analysis (paid)
69.5
Average score
Paid users average 69.5 vs 56.5 for free users, a gap of 13.0 points. That is not because paying makes your idea better. It is selection bias. Founders who invest $29 in validation tend to be more serious, have already done some homework, and are working from a clearer concept with a more defined market in mind.
The practical takeaway: if you are going to pay for validation, make sure you have done the basic homework first. Define your buyer. Name three competitors. Know what price range the market tolerates. That baseline preparation is worth more than any tool.
How to read this benchmark
This report is designed as a comparative benchmark, not as a census of every analysis ever run on the platform. The goal is to show the score bands, verdict mix, and risk patterns you should treat as normal inside the current scoring system.
In practical terms, read the percentages as directional benchmarks. A score in the 50s is normal. A score in the 70s is strong. A score above 90 is still extremely rare.
What separates a 50 from a 75?
After reviewing thousands of analyses, the gap between a mid-range score and a high score almost always traces back to the same pattern:
1. Named competitors with pricing
Ideas that can name 3+ real competitors and reference their pricing score significantly higher. Not because competition is good, but because it proves the market exists and the founder has done the work. "No competitors" almost always means "I have not looked."
2. Demand evidence from outside the founder's network
Reddit threads, review complaints, Google Trends upticks, community forum posts. External demand signals show that strangers already care about the problem. Without them, the analysis can only score what the founder claims.
3. A specific buyer, not "everyone"
"Small business owners" is not a target market. "Solo accountants in the US who still use spreadsheets for client onboarding" is. Narrow targeting makes every other validation signal sharper: competitors are more relevant, demand is more measurable, pricing is more grounded.
The difference between a 50 and a 75 is not a better idea. It is better homework. Most ideas in the 40-60 range have a real kernel of value - they just have not been pressure-tested against the market yet.

Methodology
This report starts from more than 4,000 completed analyses processed by Preuve AI as of April 6, 2026. The benchmark tables use a cleaned comparison set so the score bands, verdict mix, and risk breakdown stay consistent across the report.
The viability score (0-100) is a composite of market size signals, competitive density, demand evidence, timing, and risk factors. The verdict (go/caution/no-go) follows from the score, and every data point links to its original source within the report.
The primary-risk section uses the subset of analyses where a top-risk label was explicitly stored. This dataset includes both free scans and paid deep analyses, and it also includes follow-up improved iterations tied to founder ideas. It does not include agency scans or investor packages. All data is anonymized. No individual ideas, founder names, or company names are disclosed.
For the full validation methodology, read how Preuve AI validates startup ideas. For the scoring formula, see how viability scores work.
FAQ
What percentage of startup ideas pass validation?
In the current benchmark set, 18.3% of startup ideas scored 70 or above. Another 76.9% landed in the caution zone between 40 and 69, while 4.8% scored below 40.
What is the most common reason startup ideas fail validation?
Go-to-market risk is still the top failure signal. In the analyses with an explicit primary risk tag, it appears in 29.4% of cases. Early-stage risk ranks second at 25.6%, and competitive pressure ranks third at 14.5%.
What is a good startup viability score?
The current benchmark set has a median viability score of 55 and an average score of 57.4. Ideas scoring 70 or above are high-viability, and fewer than 0.2% score above 90.
What is the startup failure rate in 2026?
In the current 2026 benchmark set, 4.8% of startup ideas receive a no-go verdict with a score below 40. Most ideas, 76.9%, land in the caution zone, and 18.3% pass validation with a score of 70 or higher.
How does Preuve AI calculate startup benchmarks?
Preuve AI scores startup ideas from 0 to 100 against live data sources including competitor databases, Google Trends, Reddit, Crunchbase, and community forums, with paid deep analyses cross-checking 50+ of them. For this report, Preuve started from more than 4,000 completed analyses, free scans and paid deep analyses combined, and rebuilt the benchmark tables from a like-for-like comparison set so the score bands stay internally consistent.
Vincent
5 years in B2B growth, building Preuve AI in public. 82% of ideas it scores aren't ready, the point is finding out in 5 minutes, not 3 months.
Follow on X →Building is expensive. Validation is free.
Run your idea through 10 AI agents before you write a line of code. Every claim source-linked.





