Question 1

What is an A/B test?

Accepted Answer

An A/B test (also called a split test) is a controlled experiment where you compare two versions of something (e.g., a webpage, email, or ad) to determine which performs better. Version A is the control (original), and Version B is the variant (modified). Users are randomly assigned to each group, and their behavior (conversions, clicks, etc.) is measured.

Question 2

What is statistical significance in A/B testing?

Accepted Answer

Statistical significance means the difference between your control and variant is unlikely to be due to random chance. Typically, a result is considered significant at 95% confidence, meaning there's less than a 5% probability the observed difference happened by chance. The p-value quantifies this probability.

Question 3

How do you calculate the p-value for an A/B test?

Accepted Answer

The p-value is calculated using a two-proportion z-test. First, compute the z-score: Z = (p₂ - p₁) / √(p̂ × (1 - p̂) × (1/n₁ + 1/n₂)), where p̂ is the pooled proportion. Then convert the z-score to a two-tailed p-value using the standard normal distribution.

Question 4

What confidence level should I use?

Accepted Answer

95% confidence is the industry standard for most A/B tests. Use 90% for directional decisions or fast-paced experiments where speed matters more than certainty. Use 99% for high-stakes decisions (pricing changes, major redesigns) where a false positive would be very costly.

Question 5

What is statistical power?

Accepted Answer

Statistical power is the probability of detecting a true effect when one exists. A power of 80% means if there really is a difference between your variations, you have an 80% chance of detecting it. Low power means you might miss real improvements (false negatives). Most experiments should target at least 80% power.

Question 6

How long should I run an A/B test?

Accepted Answer

Run your test until you reach the required sample size (use our Sample Size Calculator to determine this). Never stop a test early just because it looks significant — this inflates false positive rates. Also run for at least 1-2 full business cycles (typically 1-2 weeks) to account for day-of-week effects.

Question 7

What does the uplift percentage mean?

Accepted Answer

Uplift (or lift) is the relative improvement of the variant over the control. It's calculated as: Uplift = (Variant Rate - Control Rate) / Control Rate × 100. For example, if control converts at 5% and variant at 6%, the uplift is 20% — meaning the variant performs 20% better than the control.

Question 8

Can I trust my A/B test results with a small sample size?

Accepted Answer

Small sample sizes lead to unreliable results with wide confidence intervals. Even if you see a 'significant' p-value with small samples, the observed effect size is likely exaggerated. Aim for adequate sample sizes before drawing conclusions. Use our A/B Test Sample Size Calculator to plan your experiment.

Confidence	Alpha (α)	Z Critical	Best For
90%	0.10	1.645	Quick iterations, low-risk changes
95%	0.05	1.960	Industry standard, most A/B tests
99%	0.01	2.576	High-stakes decisions, pricing changes

A/B Test Calculator

Control Group

Variant Group

Related Calculators

Frequently Asked Questions

What is an A/B test?

What is statistical significance in A/B testing?

How do you calculate the p-value for an A/B test?

What confidence level should I use?

What is statistical power?

How long should I run an A/B test?

What does the uplift percentage mean?

Can I trust my A/B test results with a small sample size?

What is A/B Testing?

Statistical Formula & How It Works

A/B Test Calculation Examples

Example 1: Significant Result

Example 2: Not Significant

Example 3: Variant Performs Worse

Choosing Your Significance Level

Common A/B Testing Mistakes

When to Use A/B Testing

A/B Testing Best Practices

Related Calculators