aibizhub

Bootstrapped Growth

Ship-or-Kill Decision Score

Get a brutally honest verdict on your side project — SHIP, ITERATE, or KILL based on traction, economics, and market signal.

Your Project

Answer honestly. The tool is only as good as your inputs.

Time Investment
Traction
Economics
Competition & Distribution

Verdict

KILL

Composite Score: 6/100

Dimension Breakdown

Traction0/100 (35%)

You have 0 paying customers after 200 hours of building — that's a distribution problem, not a product problem.

Economics0/100 (20%)

You're burning $30/mo with zero revenue. Every month without income is a month closer to quitting.

Momentum0/100 (15%)

No growth and minimal users. You're building in a vacuum. Get the product in front of people before adding features.

Efficiency40/100 (15%)

10h/week with 200 hours invested. Make sure each hour is moving a metric, not just shipping code.

Market0/100 (15%)

You have low differentiation but no distribution plan. Distribution beats product every time. Fix this first.

Brutally Honest Take

Zero revenue with $30/mo in costs. You're subsidizing a product nobody has validated with money. Either get a paying customer this week or shut it down and redirect your energy.

What To Do Next

  • 1.Get your first paying customer this week. Offer manual onboarding, a discount, anything — just validate willingness to pay.
  • 2.Add a paid tier or raise prices. Free-only is not a business model.
  • 3.Set a 30-day checkpoint with specific metrics. If the numbers don't improve, make a hard call.

This tool is opinionated by design. It uses weighted scoring across 5 dimensions to produce a directional verdict, not a prediction. Use it to pressure-test your assumptions, not as the final word.

How to use it

  1. Enter your time investment: total hours spent building so far and hours per week you can realistically dedicate going forward. Be honest with both numbers because the scoring methodology specifically penalizes sunk-cost patterns, a cognitive bias identified by Kahneman and Tversky where past investment irrationally influences future commitment. If you have invested 500+ hours over 6+ months without producing meaningful traction (paying customers or measurable user growth), the time investment dimension will score low regardless of how polished the product is. This is intentional: the research on startup failure from CB Insights consistently identifies 'building something nobody wants' as the number one cause of failure (42% of post-mortems), and extended building without market validation is the most common symptom. The score treats time-without-traction as a risk signal, not as progress.
  2. Enter your traction metrics: number of paying customers, current monthly recurring revenue (MRR), free monthly active users, and week-over-week growth rate. These metrics are weighted heavily in the scoring model because they are the strongest signals of product-market fit. Even a single paying customer is a meaningful signal because it proves that someone values the product enough to exchange money for it. Zero paying customers after months of building is an equally strong signal in the opposite direction. For growth rate, the benchmark from Y Combinator and the startup ecosystem is that 5-7% week-over-week growth is 'good' for early-stage products, 10%+ is 'great,' and flat or negative growth after launch is a warning sign requiring immediate investigation. Revenue growth compounds: 5% weekly growth on a $1,000 MRR base produces approximately $12,600 MRR in 12 months.
  3. Enter your economics: monthly cost to operate the product (hosting, APIs, tools, marketing spend) and monthly revenue. The economics dimension checks whether your path to profitability is realistic. Products that are already profitable or approaching break-even score highest. Products burning cash with a clear path to profitability through demonstrated unit economics score moderately. Products burning cash with no revenue, no paying users, and no clear path to revenue score lowest. For bootstrapped founders, the sustainability of cash burn matters more than the absolute amount: burning $100/month on a side project with growing free users is sustainable, while burning $2,000/month on a product with no users is not, regardless of how much savings you have. The calculator cross-references your economics with your traction to identify whether spending is producing results.
  4. Evaluate your market position across three dimensions: number of direct competitors (0-2 competitors is favorable, 10+ competitors requires strong differentiation), your primary distribution channel (organic search, product communities, direct sales, paid acquisition, referrals), and how differentiated your product actually is from alternatives. Distribution is weighted heavily at this stage because research from a16z and First Round Capital consistently shows that distribution advantage matters more than feature advantage for early-stage products. A mediocre product with great distribution outperforms a great product with no distribution channel. If your only distribution plan is 'build it and they will come,' the market dimension will score low because passive discovery is the weakest and slowest customer acquisition strategy, especially in crowded markets.
  5. Read your verdict (SHIP, ITERATE, or KILL), the composite score from 0-100, the per-dimension breakdown showing which areas are strong and which are weak, and the three recommended next steps tailored to your specific situation. SHIP (score 70-100) means the signals support scaling effort and investment. ITERATE (score 40-69) means there are some positive signals but focused experiments are needed before committing further. KILL (score 0-39) means the evidence does not support continued investment in the current form. Focus your energy on your weakest scoring dimension first, because that dimension is the constraint limiting overall viability. A product with excellent traction but terrible economics needs pricing work, not more features. A product with good economics but zero traction needs distribution, not optimization. The recommended next steps are prioritized based on which dimension drags the composite score down most.

AI Integrations

Contract, discovery endpoints, and developer notes for agent use.

Always available for agents

Tool contract JSON

https://aibizhub.io/contracts/ship-or-kill-score.json

Stable input and output contract for this exact tool.

Human review

People can use the browser page to sense-check outputs and charts, but agents should still execute against the contract and discovery endpoints.

{
  "tool": "ship_or_kill_score",
  "hours_spent": 200,
  "hours_per_week": 10,
  "paying_customers": 3,
  "mrr": 45,
  "free_users": 120,
  "wow_growth_percent": 5,
  "monthly_cost": 30,
  "monthly_revenue": 45,
  "competitors": 4,
  "distribution": "organic",
  "differentiation": "better_ux"
}
Expand developer notes

Agent playbook

  1. Resolve Ship-or-Kill Decision Score from /agent-tools.json and open its contract before execution.
  2. Validate inputs against the contract schema instead of scraping labels from the page UI.
  3. Open the browser page only when a person wants to review charts, assumptions, or related tools.

Agent FAQ

Should ChatGPT, Claude, or another agent click through the UI?

No. Start with /agent-tools.json, then follow the tool's contract URL. The page UI is for human review, not parameter discovery.

When do tools show Quick and Advanced?

Every tool opens in Quick Start first. Advanced Controls keeps the same scenario, reveals more assumptions or diagnostics, and every tool keeps AI integrations inline below the instructions.

When should an agent still open the browser page?

Open it when a human wants to sense-check the output, review the chart, or keep exploring related tools after the calculation finishes.

Questions people usually ask
How does the Ship-or-Kill scoring methodology work?

The tool evaluates your project across five weighted dimensions: Traction (35%) measures paying customers, revenue, active users, and growth rate. Economics (20%) assesses cost structure, revenue trajectory, and path to profitability. Momentum (15%) evaluates growth trend direction and consistency. Efficiency (15%) examines output relative to time invested, penalizing sunk-cost patterns. Market (15%) evaluates competitive landscape, differentiation, and distribution channel strength. Each dimension scores 0-100 independently, then a weighted composite score determines the verdict: SHIP (70-100) means signals support scaling, ITERATE (40-69) means focused experiments are needed, KILL (0-39) means evidence does not support continued investment.

Why is traction weighted at 35% while everything else is lower?

Traction is weighted highest because it is the most reliable indicator of product-market fit, which research from CB Insights, Y Combinator, and First Round Capital consistently identifies as the primary determinant of startup success. No amount of elegant architecture, clever marketing, or efficient development compensates for building something people do not want to pay for. A single paying customer provides more signal than months of building because it proves someone values your product enough to exchange money for it. The scoring reflects this reality: a project with strong traction but weak economics can fix pricing, but a project with great economics and zero traction may have a fundamental product-market fit problem.

What does an ITERATE verdict actually mean in practice?

ITERATE (score 40-69) means there are some positive signals but the evidence is not strong enough to justify going all-in with significant time, money, or career risk. Practically, this means: set a 30-day checkpoint with specific, measurable goals (e.g., reach 10 paying customers, achieve 5% weekly growth, validate a distribution channel). Run 2-3 focused experiments targeting your weakest scoring dimension. Limit time investment to 10-15 hours per week rather than full-time commitment. The most common ITERATE pattern is a product that has free users or interest but has not converted that interest into paying customers, which requires pricing experiments, direct user conversations, and distribution testing before a SHIP-or-KILL decision.

When should I KILL a project even if I believe in the idea?

The most common kill signals are: zero paying customers after 3+ months of availability (the market has spoken), negative or flat growth for 6+ consecutive weeks after launch (no organic momentum), cost to acquire customers exceeds their lifetime value with no clear path to improvement, and the project requires more hours per week than you can sustain. The hardest kills are projects where you have invested significant time and emotional energy. Research on sunk cost fallacy by Kahneman and Tversky shows that humans systematically overweight past investment when making future-oriented decisions. The KILL verdict is designed to surface this pattern objectively. If you find yourself arguing against the score, ask whether you would start this project today knowing what you know now.

Does this framework work for non-SaaS products?

Yes. The scoring framework applies to any product, side project, or business initiative where you can measure customers, revenue, costs, and growth. This includes digital products (courses, templates, tools), mobile apps, physical products, service businesses, and content businesses. The inputs are universal: how many people pay you, how much revenue you generate, how much it costs to operate, how fast you are growing, and how you acquire customers. For non-SaaS products, interpret MRR as average monthly revenue and adjust customer acquisition context accordingly. The framework is deliberately model-agnostic because the fundamentals of traction, economics, and market position apply to any commercial venture.

How should I interpret the per-dimension breakdown?

The dimension breakdown reveals where your project is strong and where it is constrained. Focus on your weakest dimension first because it is your bottleneck. Common patterns: high traction with low economics means you have demand but your pricing or cost structure needs work. High economics with low traction means your model works on paper but you have not found product-market fit or distribution. High efficiency with low market means you are building well but in a crowded space without differentiation. The recommended next steps are always prioritized based on which dimension drags the composite score down most, because improving a score from 20 to 50 in one dimension has more impact than improving from 70 to 80 in another.

What if I disagree with the verdict?

The score is directional, not prescriptive. If you have strong contextual information the tool cannot capture (insider knowledge, pending enterprise contracts, a distribution partnership about to launch), that context legitimately matters. However, the most common reason founders disagree with a low score is that the numbers are uncomfortable, which is exactly the pattern the tool is designed to surface. Ask yourself: if a friend showed you these exact numbers for their project, would you advise them to continue? External perspective often reveals what internal attachment obscures. If after honest reflection you still disagree, document the specific thesis for why the score is wrong and set a 30-day checkpoint to test it with measurable goals.

How often should I re-evaluate my ship-or-kill score?

Re-evaluate monthly during active building, after any significant milestone (first paying customer, first $1K MRR, launch on Product Hunt or Hacker News), and at the end of any focused experiment period. Tracking your score over time is more valuable than any single evaluation because the trend reveals trajectory: a score that improves from 35 to 55 over 3 months shows the right direction even if the current verdict is still ITERATE. A score that stays flat at 45 for 3 months despite active effort is a stronger kill signal than a single reading of 39.

Is this tool free and private?

Yes. All calculations run entirely in your browser. No data is sent anywhere. No signup or account required.

Is this professional business advice?

No. The Ship-or-Kill framework provides a structured, quantitative assessment based on the inputs you provide. It does not capture qualitative factors like team dynamics, market timing, or domain expertise that legitimately influence product decisions. Use the score as one input alongside your own judgment, customer conversations, and trusted advisor feedback.

Related Resources

Learn the decision before you act

Every link here is tied directly to Ship-or-Kill Decision Score. Use the explanation, formula, examples, and benchmarks to pressure-test the calculator output from first principles.

Browse all 1 resources

Continue With Related Tools

More in Bootstrapped Growth

See when cash runs out, what you need to break even, and how to grow without funding.

Read the full Bootstrapped Growth guide →

Browse by Use Case

Business planning estimates — not legal, tax, or accounting advice.