ARTICLE
standard deviation
Standard Deviation Standard deviation is one of the most fundamental measures of dispersion in probability theory and statistics. Formally, it is the square root of the variance an
Standard Deviation
Standard deviation is one of the most fundamental measures of dispersion in probability theory and statistics. Formally, it is the square root of the variance and quantifies the average distance of data points from the arithmetic mean (or expected value). A low standard deviation indicates that data points cluster tightly around the mean; a high standard deviation signals wide dispersion. The population standard deviation is denoted by , while the sample counterpart is denoted by or .
Unlike variance, which is expressed in squared units (e.g., dollars-squared), standard deviation restores the original units (e.g., dollars), making it far more interpretable for reporting and decision-making. This property alone explains why standard deviation, rather than variance, is the default measure of variability in virtually every applied discipline.
Computational Formulas
Standard deviation is defined as the non-negative square root of the variance. Two versions exist depending on whether the entire population or only a sample is available.
Population Standard Deviation
Here is the population mean, is the population size, and the numerator is the sum of squared deviations. Dividing by yields the population variance ; taking the square root gives .
Sample Standard Deviation
where is the sample mean and is the degrees of freedom. Dividing by rather than implements Bessel's correction. This adjustment is necessary because the sample mean is, by construction, the value that minimizes the sum of squared deviations for the sample—making the deviations systematically smaller than they would be around the unknown population mean . Dividing by compensates for this downward bias, ensuring that the sample variance is an unbiased estimator of : . Taking the square root of the unbiased variance estimator yields the (slightly biased but consistent) sample standard deviation.
The Random Variable Definition
For a random variable with mean , the standard deviation is:
Using the computational shortcut :
For a discrete random variable with probability mass function :
For a continuous random variable with probability density function :
Properties
Let and be random variables and constants.
- Non-negativity: . Equality holds if and only if is almost surely constant.
- Translation invariance: . Shifting all values leaves dispersion unchanged.
- Scaling: . Multiplying by a constant scales the standard deviation by the absolute value of that constant.
- Additivity under independence: If and are independent random variables, then . In general, .
- Relation to the mean: No universal relationship binds the mean and standard deviation; for a given mean, any non-negative standard deviation is possible depending on the distribution. However, for certain families the two are functionally linked: in a Poisson distribution, the mean and variance are equal (), while in a binomial distribution, is fully determined by and .
- Inequality: , with equality when . This follows directly from .
- Pooled standard deviation: When combining two independent samples of sizes with respective standard deviations , the pooled standard deviation (assuming equal population variances) is: \[ s_p = \sqrt{\frac{(n_1-1)s_1^2 + (n_2-1)s_2^2}{n_1 + n_2 - 2}} \] This is a weighted average of the two variances, used extensively in the two-sample t-test and ANOVA.
Interpretation: The Empirical Rule and Chebyshev's Inequality
The 68-95-99.7 Rule (Normal Distribution)
When data follow a normal distribution, the Empirical Rule provides precise coverage probabilities:
- Approximately 68\% of observations fall within ;
- Approximately 95\% fall within ;
- Approximately 99.7\% fall within .
Observations beyond are typically flagged as potential outliers. For example, if exam scores are normally distributed with mean 75 and standard deviation 8, roughly 95\% of students score between 59 and 91.
Chebyshev's Inequality
For any distribution (not necessarily normal), Chebyshev's inequality guarantees that for :
Equivalently, at least of the data lies within standard deviations of the mean. For , at least 75\% is covered; for , at least 88.9\%. This bound is distribution-free but far looser than the Empirical Rule; its virtue is absolute generality.
Worked Example
Consider an investment portfolio with annual returns over five years: , , , , . The sample mean is:
Squared deviations from the mean: , , , , . Their sum is .
The portfolio's average annual return is 10\%, and returns typically deviate from this mean by about 11.18 percentage points. If returns are approximately normal, roughly two-thirds of annual returns should fall between and . The relatively high standard deviation relative to the mean signals substantial year-to-year variability, which a risk-averse investor would weigh carefully.
Interpretation via Standardized Scores
The z-score (or standard score) expresses any observation in standard-deviation units:
A -score of means the observation is two standard deviations above the mean; a -score of means it is one and a half below. This transformation is foundational: it allows comparison across variables measured on entirely different scales—for instance, comparing a student's performance in mathematics and verbal reasoning when the two tests have different means and spreads. It is also the first step in constructing -tests and -tests in hypothesis testing. In finance, a -score of for a daily return signals an event so extreme that, under normality, it would be expected roughly once every 1,350 trading days; such tail events challenge the normality assumption itself and motivate the use of heavy-tailed distributions in risk management.
Key Applications
- Finance: Asset return standard deviation is the canonical measure of volatility and risk. It is a core input to Modern Portfolio Theory, the Capital Asset Pricing Model (CAPM), and the Sharpe ratio , which measures risk-adjusted return. Option pricing models like Black-Scholes-Merton also depend critically on volatility estimates.
- Economics: Standard deviations of GDP growth, inflation, and unemployment rates gauge macroeconomic stability. A country with low GDP-growth standard deviation is said to exhibit smoother business cycles; policymakers often target low output-gap variability.
- Quality Control: In Six Sigma methodology, the standard deviation of a manufacturing process determines defect rates. A process operating at Six Sigma quality produces only 3.4 defects per million opportunities, meaning the specification limits are six standard deviations from the process mean.
- Social Sciences: Survey results routinely report means accompanied by standard deviations to convey the spread of responses. In psychometrics, test-score standard deviations allow comparison across populations via z-scores: .
- Machine Learning: Standard deviation underpins feature scaling (standardization), where each feature is transformed to have mean 0 and standard deviation 1 before model training. This prevents features with larger scales from dominating algorithms sensitive to magnitude.
Comparison with Other Dispersion Measures
- Variance (): Mathematically cleaner for theoretical work (additive under independence), but its squared units hinder interpretation. Standard deviation is almost always preferred for reporting.
- Range (max min): Simple to compute but uses only two data points and is catastrophically sensitive to outliers. A single extreme value can inflate the range regardless of the remaining data.
- Interquartile Range (IQR): The difference between the 75th and 25th percentiles. Robust to outliers and preferred when distributions are heavily skewed. However, IQR discards information from the tails beyond the quartiles.
- Mean Absolute Deviation (MAD): The average of absolute (not squared) deviations. Less sensitive to outliers than standard deviation and has the same units as the data. Historically, Fisher argued for the standard deviation's greater algebraic tractability and efficiency under normality—reasons it became the dominant measure.
- Coefficient of Variation (): A dimensionless ratio useful for comparing dispersion across datasets with different units or vastly different means.
Limitations
Standard deviation has two principal limitations. First, by squaring deviations it amplifies the influence of outliers; a single extreme observation can dramatically inflate the estimate. In such cases, robust alternatives like IQR or MAD should be considered. Second, standard deviation is most informative when the underlying distribution is roughly symmetric and unimodal. For multi-modal or highly skewed data, a single may mislead—reporting the full distribution or using quantile-based measures is often more appropriate. Despite these caveats, standard deviation remains the indispensable workhorse of dispersion measurement, combining mathematical tractability with practical interpretability.