📊 統計・データ

選択バイアス

せんたくばいあす

標本の選び方に偏りがあることで生じる系統的な誤差。サンプルの代表性を損ない、結論を歪める。

1 分で読める

Definition and Mechanism

Selection bias is a systematic error that arises when the sample studied is not representative of the target population. It occurs during participant recruitment, data collection, or analysis when certain individuals are more likely to be included than others. The resulting conclusions may be valid for the sample but misleading when generalized.

Common Forms

Self-selection bias occurs when participants volunteer, attracting those with stronger opinions or motivation. Non-response bias emerges when certain groups systematically decline to participate. Survivorship bias focuses only on those who "survived" a process, ignoring dropouts.

In online surveys and ranking tools, users who engage tend to be younger, more tech-savvy, and more interested in self-improvement than the general population. This skews the reference group against which you compare yourself.

Impact on Ranking Accuracy

If a ranking's underlying data suffers from selection bias, your percentile may not reflect your true position in the broader population. For instance, a fitness app's user base skews toward active individuals, so being "average" among its users likely means you are above average in the general population. Recognizing who is and is not represented in the data is critical.

Mitigation Strategies

Researchers combat selection bias through random sampling, stratification, and weighting adjustments. As a consumer of ranking data, you can mitigate its effects by checking the data source, understanding who was surveyed, and comparing results across multiple datasets with different collection methods.

関連用語

関連記事

この用語解説は役に立ちましたか?