TL;DR
4,000+ startup ideas analyzed against 40+ live data sources. Median score: 55/100. Only 18.3% earned a "go" verdict. The #1 killer is not competition (14.5%). It is having no go-to-market plan (29.4%). Fewer than 0.2% scored above 90.
This is the second edition of my startup validation benchmarks report. The first edition covered 1,000 ideas. This edition starts from more than 4,000 completed analyses run through Preuve AI, where each idea was scored against 40+ live data sources including competitor databases, Google Trends, Reddit, and community forums.
For the benchmark tables below, I rebuilt the benchmark set from comparable completed analyses so the score bands and risk splits stay internally consistent across the report.
Every number in this report comes from real analyses and real follow-up runs inside Preuve AI. No synthetic data. No ChatGPT-generated ideas. These are real people testing business concepts they planned to build or were already building.
I am publishing this as a reference for anyone researching startup failure rates, validation benchmarks, or idea viability data. Cite freely.
The Preuve AI Score
The Preuve AI Score is a 0-to-100 startup viability rating computed by 10 AI agents cross-validating claims against 40+ live data sources including competitors, demand signals, market size, community sentiment, hiring trends, and patent activity. Every point is source-linked. A score above 70 indicates launch-ready signals. Below 40 means fundamental validation work remains.

Overall Viability Distribution
The average viability score across the current benchmark set is 57.4 out of 100. The median is 55. The distribution is still centered in the 50-59 range.
| Score range | % of total |
|---|---|
| 0-9 | 0.1% |
| 10-19 | 0.1% |
| 20-29 | 1.0% |
| 30-39 | 3.6% |
| 40-49 | 25.1% |
| 50-59 (peak) | 30.3% |
| 60-69 | 21.5% |
| 70-79 | 15.5% |
| 80-89 | 2.7% |
| 90-100 | 0.1% |
76.9% of all ideas land between 40 and 69. This is the caution zone, where the idea has a kernel of something real but is missing at least one critical ingredient: a distribution plan, a clear competitor picture, or enough demand evidence.
Fewer than 0.2% of ideas score above 90. A score above 90 means live market data shows strong demand, a clear gap in the competitive landscape, and multiple converging signals. It remains exceptionally rare.
Verdict Breakdown: Go, Caution, No-Go
Every analysis ends with one of three verdicts based on the composite viability score:
18.3%
Go
Avg score: 75.1
76.9%
Caution
Avg score: 54.8
4.8%
No-Go
Avg score: 31.9
The gap between a go idea and a caution idea is an average of 20.3 points. In most cases, that gap comes from 2-3 fixable issues: unclear positioning, no competitor awareness, or a missing distribution plan. The idea itself is rarely the problem.

Why Go-to-Market Risk Kills More Startups Than Competition
A primary risk label was captured on most analyses in the benchmark set. Here is the distribution across the tagged reports:
| Top risk | % of ideas | Avg score |
|---|---|---|
| Go-to-market | 29.4% | 57.6 |
| Early-stage risk | 25.6% | 50.8 |
| Competitive pressure | 14.5% | 65.4 |
| Regulatory risk | 12.7% | 56.6 |
| Funding requirements | 9.8% | 60.7 |
| Production challenges | 4.1% | 58.0 |
| Team execution | 2.2% | 61.6 |
Minor categories like technical complexity, legal risk, and brand risk account for the remaining 1.6% of tagged analyses. A small share of reports did not have a primary risk label stored in the backfilled column.
Go-to-market risk is the #1 failure signal for the second year running, flagged in 29.4% of tagged analyses. This means nearly 1 in 3 startup ideas have no realistic path to their first customer. The founder has a product concept but no distribution plan, no channel strategy, and no evidence that the target buyer is reachable.
The second-biggest risk is early-stage risk at 25.6%. These are ideas that are too vague to properly evaluate: no defined buyer, no pricing reference, no competitive landscape. The average score for this group is 50.8, the lowest of any major risk category.
The competition paradox
Competition ranks third as a risk factor (14.5%), but among the major risk categories it has the highest average score at 65.4. That is counterintuitive until you realize: competition means there is a real market. Ideas with no competitors often have no customers either. A crowded market is not automatically a bad sign. An empty market often is.
Do founders who pay for deeper analysis have better ideas?
Preuve AI offers two tiers: a free quick scan and a paid deep analysis that unlocks competitor research, community signals, and investor-grade sections.
Free Scan
56.5
Average score
Deep Analysis (paid)
69.5
Average score
Paid users average 69.5 vs 56.5 for free users, a gap of 13.0 points. This is not because paying makes your idea better. It is selection bias: founders who invest $29 in validation tend to be more serious. They have already done some homework. They have a clearer concept and a more defined market.
The practical takeaway: if you are going to pay for validation, make sure you have done the basic homework first. Define your buyer. Name three competitors. Know what price range the market tolerates. That baseline preparation is worth more than any tool.
How to read this benchmark
This report is designed as a comparative benchmark, not as a census of every analysis ever run on the platform. The goal is to show the score bands, verdict mix, and risk patterns you should treat as normal inside the current scoring system.
In practical terms, read the percentages as directional benchmarks. A score in the 50s is normal. A score in the 70s is strong. A score above 90 is still extremely rare.
What separates a 50 from a 75?
After reviewing thousands of analyses, the gap between a mid-range score and a high score almost always comes down to three things:
1. Named competitors with pricing
Ideas that can name 3+ real competitors and reference their pricing score significantly higher. Not because competition is good, but because it proves the market exists and the founder has done the work. "No competitors" almost always means "I have not looked."
2. Demand evidence from outside the founder's network
Reddit threads, review complaints, Google Trends upticks, community forum posts. External demand signals show that strangers already care about the problem. Without them, the analysis can only score what the founder claims.
3. A specific buyer, not "everyone"
"Small business owners" is not a target market. "Solo accountants in the US who still use spreadsheets for client onboarding" is. Narrow targeting makes every other validation signal sharper: competitors are more relevant, demand is more measurable, pricing is more grounded.
The difference between a 50 and a 75 is not a better idea. It is better homework. Most ideas that score in the 40-60 range have a real kernel of value. They just have not been pressure-tested against the market yet.

Key takeaways for founders
- 1.Most ideas are not terrible, they are unfinished. 76.9% land in the caution zone, not the no-go zone. The median score is 55, not 30.
- 2.Distribution kills more ideas than competition. Go-to-market risk (29.4%) is about 2x more common than competitive pressure (14.5%) in the tagged cohort. Stop worrying about who else is building this. Start worrying about how your first customer finds you.
- 3.Competition is a positive signal. Among the major risk categories, ideas flagged for competitive pressure average 65.4, the highest score. A crowded market means proven demand.
- 4.Scoring above 90 is extremely rare. Fewer than 0.2% of ideas in this benchmark set score above 90. If your idea scores 70+, you are already in the top 18%.
- 5.Preparation matters more than the idea. Paid users score 13.0 points higher on average, not because paying helps, but because serious founders do their homework first.
Methodology
This report starts from more than 4,000 completed analyses processed by Preuve AI as of April 6, 2026. The benchmark tables use a cleaned comparison set so the score bands, verdict mix, and risk breakdown stay consistent across the report.
The viability score (0-100) is a composite of market size signals, competitive density, demand evidence, timing indicators, and risk factors. The verdict (go/caution/no-go) is derived from the score. Every data point is linked to its original source within each report.
The primary-risk section uses the subset of analyses where a top-risk label was explicitly stored. This dataset includes both free scans and paid deep analyses, and it also includes follow-up improved iterations tied to founder ideas. It does not include agency scans or investor packages. All data is anonymized. No individual ideas, founder names, or company names are disclosed.
For the full validation methodology, read how Preuve AI validates startup ideas. For the scoring formula, see how viability scores work.
Frequently Asked Questions
What percentage of startup ideas pass validation?
In the current benchmark set, 18.3% received a go verdict (score 70+). Another 76.9% received a caution verdict, meaning the idea has potential but needs significant work. Only 4.8% received a no-go verdict with a score below 40.
What is the most common reason startup ideas fail validation?
Go-to-market risk is the number one failure signal, flagged in 29.4% of the analyses with a stored primary-risk label. Founders build products without a distribution plan. The second most common risk is early-stage risk at 25.6%. Competition, which founders fear most, ranks third at 14.5%.
What is a good startup viability score?
The median viability score across the current benchmark set is 55 out of 100. Ideas scoring 70 or above, 18.3% of that set, are considered high-viability. The average score is 57.4. Fewer than 0.2% score above 90.
What is the startup failure rate in 2026?
In the current 2026 benchmark set, 4.8% of startup ideas receive a no-go verdict based on live market data. Another 76.9% land in the caution zone, meaning they need iteration before launch. Only 18.3% of ideas pass validation on that score threshold.
How does Preuve AI calculate startup benchmarks?
Preuve AI scores startup ideas from 0 to 100 using 40+ live data sources including competitor databases, Google Trends, Reddit, Crunchbase, and community forums. For this report, I started from more than 4,000 completed analyses and rebuilt the benchmark tables from a cleaned comparison set. The score reflects market size, competitive density, demand signals, timing, and risk factors. Every data point links to its original source.
60초 만에 이 검증 과정을 실행해 보세요
Preuve AI는 실제 시장 데이터와 투자자가 보는 검증 프레임워크로 스타트업 아이디어를 분석합니다.
내 아이디어 무료 분석하기무료 분석. 60초면 충분합니다.
