Skip to main content

Normal distribution


Normal Distribution

Probability Density FunctionThe general formula for the probability density function of the normal distribution isf(x) = EXP[-(x-mu)**2/(2*sigma**2)]/(sigma*SQRT(2*PI))
where mu is the location parameter and sigma is the scale parameter. The case where mu = 0 and sigma = 1 is called the standard normal distribution. The equation for the standard normal distribution is
f(x) = EXP[-x**2/2]/SQRT(2*PI)
Since the general form of probability functions can be expressed in terms of the standard distribution, all subsequent formulas in this section are given for the standard form of the function.
The following is the plot of the standard normal probability density function.
plot of the standard normal probability density function
Cumulative Distribution FunctionThe formula for the cumulative distribution function of the normal distribution does not exist in a simple closed formula. It is computed numerically.The following is the plot of the normal cumulative distribution function.
plot of the normal cumulative distribution function
Percent Point FunctionThe formula for the percent point function of the normal distribution does not exist in a simple closed formula. It is computed numerically.The following is the plot of the normal percent point function.
plot of the normal percent point function
Hazard FunctionThe formula for the hazard function of the normal distribution isPHI(x)/phi(-x)
where PHI is the cumulative distribution function of the standardnormal distribution and phi is the probability density function of the standard normal distribution.
The following is the plot of the normal hazard function.
plot of the normal hazard function
Cumulative Hazard FunctionThe normal cumulative hazard function can be computed from the normal cumulative distribution function.The following is the plot of the normal cumulative hazard function.
plot of the normal cumulative hazard function
Survival FunctionThe normal survival function can be computed from the normal cumulative distribution function.The following is the plot of the normal survival function.
normal survival function
Inverse Survival FunctionThe normal inverse survival function can be computed from the normal percent point function.The following is the plot of the normal inverse survival function.
normal inverse survival function
Common Statistics
MeanThe location parameter mu.
MedianThe location parameter mu.
ModeThe location parameter mu.
RangeInfinity in both directions.
Standard DeviationThe scale parameter sigma.
Coefficient of Variationsigma/mu
Skewness0
Kurtosis3
Parameter EstimationThe location and scale parameters of the normal distribution can be estimated with the sample mean and sample standard deviation, respectively.
CommentsFor both theoretical and practical reasons, the normal distribution is probably the most important distribution in statistics. For example,
  • Many classical statistical tests are based on the assumption that the data follow a normal distribution. This assumption should be tested before applying these tests.
  • In modeling applications, such as linear and non-linear regression, the error term is often assumed to follow a normal distribution with fixed location and scale.
  • The normal distribution is used to find significance levels in many hypothesis tests and confidence intervals.
Theroretical Justification - Central Limit TheoremThe normal distribution is widely used. Part of the appeal is that it is well behaved and mathematically tractable. However, the central limit theorem provides a theoretical basis for why it has wide applicability.The central limit theorem basically states that as the sample size (N) becomes large, the following occur:
  1. The sampling distribution of the mean becomes approximately normal regardless of the distribution of the original variable.
  2. The sampling distribution of the mean is centered at the population mean, mu, of the original variable. In addition, the standard deviation of the sampling distribution of the mean approaches sigma/SQRT(N).
SoftwareMost general purpose statistical software programs support at least some of the probability functions for the normal distribution.

Comments

Popular posts from this blog

Runs Test for Detecting Non-randomness

Runs Test for Detecting Non-randomness Purpose: Detect Non-Randomness The runs test ( Bradley, 1968 ) can be used to decide if a data set is from a random process. A run is defined as a series of increasing values or a series of decreasing values. The number of increasing, or decreasing, values is the length of the run. In a random data set, the probability that the ( I +1)th value is larger or smaller than the I th value follows a binomial distribution , which forms the basis of the runs test. Typical Analysis and Test Statistics The first step in the runs test is to count the number of runs in the data sequence. There are several ways to define runs in the literature, however, in all cases the formulation must produce a dichotomous sequence of values. For example, a series of 20 coin tosses might produce the f...

The most femiliar statisticians

Gertrude Cox :   Gertrude Mary Cox (of Experimental Statistics at North Carolina State University. She was later appointed director of both the Institute of Statistics of 1900 - 1978) was an influential American statistician and founder of the department the Consolidated University of North Carolina and the Statistics Research Division of North Carolina State University. Her most important and influential research dealt with experimental design; she wrote an important book on the subject with W. G. Cochran. In 1949 Cox became the first female elected into the International Statistical Institute and in 1956 she was president of the American Statistical Association. From 1931 to 1933 Cox undertook graduate studies in statistics at the  University of California at Berkeley , then returned to Iowa State College as assistant in the Statistical Laboratory. Here she worked on the  design of experiments . In 1939 she was appointed assistant professor of statisti...

Lognormal distribution

Lognormal Distribution Probability Density Function A variable X is lognormally distributed if Y = LN(X) is normally distributed with "LN" denoting the natural logarithm. The general formula for the  probability density function  of the lognormal distribution is where   is the  shape parameter ,   is the  location parameter  and  m is the  scale parameter . The case where   = 0 and  m  = 1 is called the  standard lognormal distribution . The case where   equals zero is called the 2-parameter lognormal distribution. The equation for the standard lognormal distribution is Since the general form of probability functions can be  expressed in terms of the standard distribution , all subsequent formulas in this section are given for the standard form of the function. The following is the plot of the lognormal probability density function for four values of  . There are several commo...