What Is a Confidence Interval - The Most Misunderstood Statistical Concept

The "95% confidence interval" is the most frequently used and most frequently misunderstood concept in statistics. The correct interpretation is: "If we repeated the same sampling procedure 100 times, approximately 95 of the resulting confidence intervals would contain the true value." It does not mean "there is a 95% probability that the true value lies within this interval."

This distinction may appear philosophical, but it carries practical consequences. The true value is a fixed constant; what varies probabilistically is the confidence interval itself. Once a particular confidence interval has been computed, whether it contains the true value is a binary fact - it either does or does not - not a matter of probability.

Sources of Uncertainty in Ranking Data

When MyRank displays "you are in the top X% globally," that figure carries implicit uncertainty. The sources of this uncertainty are multiple: sampling error in the underlying data sources, measurement error, temporal lag in the data, and uncertainty in purchasing power parity conversions.

For instance, if "top 25%" is displayed, the true value might lie anywhere in the 20-30% range. Ignoring this uncertainty and treating "25%" as a precise figure leads to confusion when small changes in input produce seemingly large shifts in ranking. Ranking numbers are point estimates and should always be interpreted with an implicit margin.

The Relationship Between Sample Size and Confidence Intervals

Confidence interval width is inversely proportional to the square root of sample size. Quadrupling the sample size halves the confidence interval width. This means "doubling precision requires quadrupling data" - a diminishing returns relationship for precision improvement.

World Bank income data are based on samples of several thousand to tens of thousands per country. A sample of 10,000 from China's 1.4 billion population may seem adequate, but when stratified by region, age, and occupation, cell sizes shrink rapidly. Estimates for fine-grained subgroups like "Chinese males aged 25-34 in rural areas" carry substantial uncertainty.

The Relationship Between p-values and Confidence Intervals

P-values and confidence intervals are two sides of the same coin. A 95% confidence interval excluding zero is mathematically equivalent to a p-value below 0.05. However, while p-values do not indicate effect magnitude, confidence intervals convey the plausible range of effect sizes - making them more informative.

In recent years, the statistical community has increasingly criticized binary use of p-values (significant versus non-significant), recommending instead the reporting of effect sizes and confidence intervals. The American Statistical Association issued a 2016 statement on p-value misuse, explicitly stating that "statistical significance does not imply scientific importance."

Developing Literacy for Uncertainty

Human cognition prefers certainty. The definitive statement "you are in the top 25%" is psychologically more satisfying than "you are probably somewhere in the top 20-30% range." Yet the latter is more honest and more accurate.

Mature data literacy means not finding uncertainty uncomfortable, but being able to appropriately assess its magnitude. Asking "how reliable is this number?" and "would my conclusion change within the range of uncertainty?" forms the foundation of thinking that is not driven by data but informed by it.

信頼区間と不確実性 - ランキングの数字はどこまで信じられるか

What Is a Confidence Interval - The Most Misunderstood Statistical Concept

Sources of Uncertainty in Ranking Data

The Relationship Between Sample Size and Confidence Intervals

The Relationship Between p-values and Confidence Intervals

Developing Literacy for Uncertainty

関連記事

関連用語