Measures of Dispersion
After reviewing the formulas a table shown below will show the numerical results for the measures of dispersion for the data set below. The measures of dispersion describe the width of the distribution.
{1, 3, 8, 3, 7, 11, 8, 3, 9, 10}
Range
The range, R, of the data is the difference of the highest and smallest values being analyzed.
R = 11 - 1 = 10
Deviation
The deviation is the difference of each value from the mean. This is used in the calculation of the standard deviation and variance. "x" is the point of interest.

The sum of deviations from the mean is always zero.

Standard Deviation
The standard deviation is shown by the following formulas. It also equals the square root of the variance. "x" is the point of interest.
"N" and "n" represent the sample size, although one is capitalized it is done for notation, they both represent the same value (the sample size). As the sample size approaches infinity, the denominators in both formulas become equal. The "n-1" is to remove bias from very small sample sizes such as less than 30 samples.

Variance
The variance is the standard deviation squared. The variance can not be calculated unless it is known that the data being analyzed is from a Population or Sample. This will change the denominator value. As the sample size approaches infinity, this difference in the values ("N" or "n-1") becomes neglible.
A term called "degrees of freedom" is commonly found in statistics. This referes to the number of independent values. For example, in the denomimator of the standard deviation and variance formula is a value of "n-1".
This is also known as the number of degrees of freedom. IF a particular sample size has 15 values then n = 15.
If 14 of the 15 values are known and the value of X-bar (sample mean) is known, then it is possible to calculate the final unknown value in the sample set.

Variance example
Data Set Analysis
The table shown below is a numerical summary of the measures of central tendency and dispersion for the data set shown at the top of the page.

Coefficient of Variation
The ratio of the standard deviation to the mean. This value is expressed as a percentage and helps to determine comparing variability when mean is also known.
For example:
Comparing variability is stocks, machines, operators, and shifts, commonly use the coefficient of variation.
If we only know the standard deviation of the performance on 5 machines it may not be satisfactory to make a good decision if stability is most important. Then you are interested in the mean and how much variability from the mean the machines exhibit.
If you know that all machines have a standard deviation of 2 pcs/hr, which one is worse assuming you want stability and predictability. The one with the highest mean pcs/hr has less risk and more predictable to get consistent performance.
This is commonly used to assess risks among stocks.
The formula is shown below:

EXAMPLE:
Stock 1: Mean = $36.50 and Standard Deviation = $2.00
Stock 2: Mean = $60.00 and Standard Deviation = $4.00
Stock 3: Mean = $45.00 and Standard Deviation = $2.25
THEN
CV 1: 5.48%
CV 2: 6.67%
CV 3: 5.00%
CONCLUSION:
Although the standard deviation is less for Stock 1, the stock is riskier and predicted to have more volatility in performance.
STOCK 3 has lowest risk of the three stocks.
The table below summarizes notation for describing samples and populations.
Samples are properly described as statistics.
Populations are properly described as parameters.

Return to MEASURE phase
Return to BASIC STATISTICS
Return to Six-Sigma-Material Home Page from Measures of Dispersion

|