Nonparametric Test Flowcharts

In general, the power of parametric tests are greater than the power of the alternative nonparametric test when assumptions are met. As the sample size increases and becomes larger, the power of the nonparametric test approaches it parametric alternative.

In other words, when using a nonparametric test more data is needed to detect the same size difference as a parametric equivalent.

  • The center for parametric tests is the mean
  • The center for nonparametric tests is the median

Nonparametric tests also assume that the underlying distributions are symmetric but not necessarily normal. The assumption is these test statistics are "distribution-free. In other words, there aren't any assumptions made about the population parameters. 

When the choice exist on whether to use the parametric or nonparametric test, if the distribution is fairly symmetric, the standard parametric test are better choices than the nonparametric alternatives.

For example, if you are not sure if two data sets are normally distributed it may be safer to substitute the Mann-Whitney test to reduce the risk of drawing a wrong conclusion when testing two means. 

Nonparametric tests are used when:

  • Parametric criteria are not met or if distribution is unknown
  • These test are used when analyzing nominal or ordinal data
  • Nonparametric test can also analyze interval or ratio data but parametric test should be used
  • Quantitative, ranked, and qualitative data. Parametric is for only quantitative data
  • Data has definite outliers (that can not be removed)
  • Measurements are known to be imprecise

Advantages of nonparametric tests:

  • Offer more power when assumptions for the parametric tests have been violated. And they can be almost as powerful when assumptions haven't been violated.
  • Fewer assumptions about the data
  • Work with smaller sample sizes
  • They can be used for all data types or data containing outliers

Disadvantages of nonparametric tests:

  • Less powerful than parametric tests if assumptions haven’t been violated.
  • Critical value tables for many tests aren’t included in many computer software packages or textbooks

When to use Nonparametric Tests

  • Data can not be assumed to meet normality assumptions and can not be practically transformed to approximate normality.
  • With qualitative data of nominal scale or ranked data of ordinal scale.
  • When comparing Median values for skewed data (such as right or left skewed data). See Histograms for more insight on skewed data.
  • When there are very few, if any, assumptions regarding the shape of the distribution. However, if there is a bi-modal distribution, the data should be reviewed and separated to correctly analyze the true process performance. 
  • Don't be forceful and rush into a nonparametric test when there is clearly two (or more) unique situations in within the data. Use the Runs Test to verify the nonnormality is related to random causes and not a results of special causes.  
  • When unsure whether to use parametric or nonparametric, you can easily run both tests. See what the difference is in the outcomes and it may not be a meaningful difference in your Six Sigma project. Therefore, don't sweat it. Make a decision and move on to the IMPROVEments  Often times you'll find the results very similar and the hypothesis decision is the same for both. 

Comparables to Parametric Tests

These are the most commonly used nonparametric equivalents. There are many more nonparametric tests but these are most likely to arise on a Six Sigma project and a certification exam.

A sample and a target (or given value) 

Parametric - One Sample t test (testing means)

Nonparametric - One Sample Wilcoxon or One Sample Sign (testing medians)

Two independent samples 

Parametric - Two Sample t test (testing means)

Nonparametric - Mann-Whitney (testing medians)

>2 independent samples

Parametric - ANOVA (testing means)

Nonparametric - Mood's median or Kruskal-Wallis test (testing medians)

Three or more matched (dependent) samples

Parametric - Two way ANOVA for matched samples

Nonparametric - Friedman Test


Use the Runs Test to examine the randomness of the data and whether a sequence of samples are randomly distributed. 

Use Spearman Rank Correlation to examine the correlation between two data sets.

Use Goodman Kruska's Gamma testing the association for ranked variables.

Testing Variances - Levene's test is alternate if F and t test assumptions are not met and is used to check the homogeneity of variances across samples.




Refresher

You may be able to "transform" the data if the raw data does not meet normality assumptions.

The word "transformation" implies that each data point has the same calculation applied to it (including any specification limits when analyzing capability). Data that meets the assumption of normality allows a wider variety of test to be used.

Follow these steps when you believe the data does not meet normality assumptions:

  • Attempt to transform the data such as through a Box-Cox transformation, which uses an optimal exponent, lambda, on all the positive (must be positive for Box-Cox) data points.
  • Use the Runs Test to verify the data the is random (void of special causes).
  • If the transformation doesn't work, then apply the nonparametric test.

There are other transformation methods such as the Yeo-Johnson which is similar to the Box-Cox but can be used with negative values and zero. 

Other transformations are called Squared, Reciprocal, Exponential, Log (x), Lambert Gaussian, and Tukey's Ladder of Powers.

These are more advanced topics of which you should consult a Master Black Belt to ensure the most appropriate transformation is used depending on the distribution of the data. 


 




Return to BASIC STATISTICS

Return to Hypothesis Testing

Templates and Calculators

Search Six Sigma related job openings

Return the Six-Sigma-Material.com Home Page

Custom Search


Site Membership
LEARN MORE


Six Sigma

Templates, Tables & Calculators


Six Sigma Certification

Six Sigma Black Belt Certification

Six Sigma Slides

CLICK HERE

Green Belt Program (1,000+ Slides)

Basic Statistics

Cost of Quality

SPC

Process Mapping

Capability Studies

MSA

SIPOC

Cause & Effect Matrix

FMEA

Multivariate Analysis

Central Limit Theorem

Confidence Intervals

Hypothesis Testing

T Tests

1-Way ANOVA

Chi-Square

Correlation and Regression

Control Plan

Kaizen

MTBF and MTTR

Project Pitfalls

Error Proofing

Z Scores

OEE

Takt Time

Line Balancing

Yield Metrics

Practice Exam

... and more



Statistics in Excel


Need a Gantt Chart?

Click here to get this template