Question 1

How do you calculate A/B test sample size?

Accepted Answer

Sample size is calculated using the formula: n = (Zα/2 + Zβ)² × (p₁(1-p₁) + p₂(1-p₂)) / (p₂-p₁)², where p₁ is the baseline conversion rate, p₂ is the expected improved rate, Zα/2 is the z-value for your confidence level, and Zβ is the z-value for your desired power.

Question 2

What is minimum detectable effect (MDE)?

Accepted Answer

MDE is the smallest relative improvement you want to be able to detect in your test. A 10% MDE on a 5% baseline means you want to detect if the variant achieves at least 5.5% (a 0.5 percentage point absolute improvement). Smaller MDEs require larger sample sizes.

Question 3

What is statistical power?

Accepted Answer

Statistical power (1-β) is the probability of correctly detecting a real effect. 80% power means you have an 80% chance of detecting a true difference and a 20% chance of missing it (Type II error). Higher power requires more samples but reduces false negatives.

Question 4

Why do I need so many visitors for my A/B test?

Accepted Answer

Sample size depends on your baseline rate, desired MDE, confidence level, and power. Lower baseline rates, smaller MDEs, higher confidence, and higher power all increase the required sample size. A 5% baseline with 5% relative MDE at 95% confidence and 80% power needs ~125,000 visitors per variation.

Question 5

How long should I run my A/B test?

Accepted Answer

Divide your total required sample size by your daily traffic. For example, if you need 20,000 visitors total and get 2,000/day, run for at least 10 days. Also run for at least 1-2 full weeks to account for day-of-week variations in user behavior.

Question 6

What confidence level and power should I use?

Accepted Answer

The standard is 95% confidence and 80% power. Use 90% confidence for faster iterations where false positives are less costly. Use 99% confidence for high-impact changes. Increase power to 90-95% when missing a real improvement would be very costly (e.g., pricing tests).

Question 7

Can I reduce the required sample size?

Accepted Answer

Yes: (1) Accept a larger MDE — if you only care about big improvements, you need fewer samples. (2) Lower confidence to 90%. (3) Lower power to 70-80%. (4) Use one-tailed tests if you only care about improvements (not recommended for most cases). (5) Focus traffic on the test pages.

Question 8

What happens if I stop my test early?

Accepted Answer

Stopping early when you see a significant result inflates false positive rates dramatically — a phenomenon called 'peeking.' You may conclude a variant is better when it isn't. Always commit to the pre-calculated sample size before analyzing results, or use sequential testing methods designed for continuous monitoring.

Baseline Rate	5% MDE	10% MDE	20% MDE
1%	637,008	163,092	42,691
3%	207,936	53,208	13,911
5%	122,121	31,231	8,155
10%	57,760	14,749	3,839
20%	25,580	6,507	1,680

Error Type	Name	Controlled By	Consequence
Type I (α)	False Positive	Confidence Level	Ship a change that doesn't work
Type II (β)	False Negative	Statistical Power	Miss a real improvement

A/B Test Sample Size Calculator

Sample Size Quick Reference

Related Calculators

Frequently Asked Questions

How do you calculate A/B test sample size?

What is minimum detectable effect (MDE)?

What is statistical power?

Why do I need so many visitors for my A/B test?

How long should I run my A/B test?

What confidence level and power should I use?

Can I reduce the required sample size?

What happens if I stop my test early?

Why Sample Size Matters in A/B Testing

Sample Size Formula

Sample Size Calculation Examples

Example 1: Standard E-commerce Test

Example 2: High-converting Landing Page

Example 3: Bold Change, Low Traffic

Understanding Key Parameters

Baseline Conversion Rate

Minimum Detectable Effect (MDE)

Confidence Level (1 - α)

Statistical Power (1 - β)

How to Reduce Required Sample Size

Common Pitfalls in Sample Size Planning

Related Calculators