Skip to main content

Beta distribution

Beta Distribution

Probability Density FunctionThe general formula for the probability density function of the beta distribution isf(x) = (x-a)^(p-1)*(b-x)^(q-1)/(B(p,q)*(b-a)^(p+q-1))
 for   a <= x <= b, p, q > 0
where p and q are the shape parametersa and b are the lower and upper bounds, respectively, of the distribution, and B(p,q) is the beta function. The beta function has the formula
B(alpha,beta) = INTEGRAL[0 to 1][t^(alpha-1)*(1-t)^(beta-1)dt]
The case where a = 0 and b = 1 is called the standard beta distribution. The equation for the standard beta distribution is
f(x) = x^(p-1)*(1-x)^(q-1)/B(p,q) for 0 <= x <= 1, p, q > 0
Typically we define the general form of a distribution in terms of location and scale parameters. The beta is different in that we define the general distribution in terms of the lower and upper bounds. However, the location and scale parameters can be defined in terms of the lower and upper limits as follows:
    location = a
    scale = b - a
Since the general form of probability functions can be expressed in terms of the standard distribution, all subsequent formulas in this section are given for the standard form of the function.The following is the plot of the beta probability density function for four different values of the shape parameters.
plot of the Beta probability density function for 4 different values
 of the shape parameters
Cumulative Distribution FunctionThe formula for the cumulative distribution function of the beta distribution is also called the incomplete beta function ratio (commonly denoted by Ix) and is defined as
    F(x) = I(x)(p,q) =
 (1/B(p,q))*INTEGRAL[0 to x][t^(p-1)*(1-t)^(q-1) dt
where B is the beta function defined above.The following is the plot of the beta cumulative distribution function with the same values of the shape parameters as the pdf plots above.
plot of the Beta cumulative distribution function with the same
 values of the shape parameters as the pdf plots above
Percent Point FunctionThe formula for the percent point function of the beta distribution does not exist in a simple closed form. It is computed numerically.The following is the plot of the beta percent point function with the same values of the shape parameters as the pdf plots above.
plot of the beta percent point function with the same values
 of the shape parameters as the pdf plots above
Other Probability FunctionsSince the beta distribution is not typically used for reliability applications, we omit the formulas and plots for the hazard, cumulative hazard, survival, and inverse survival probability functions.
Common StatisticsThe formulas below are for the case where the lower limit is zero and the upper limit is one.
Meanp/(p+q)
Mode(p-1)/(p+q-2)     p, q > 1
Range0 to 1
Standard DeviationSQRT[(p*q)/((p+q)^2*(p+q+1))]
Coefficient of VariationSQRT(q/(p*(p+q+1)))
SkewnessSQRT[2*(q-p)*(SQRT(p+q+1)]/[(p+q+2)*SQRT(p*q)]
Parameter EstimationFirst consider the case where a and b are assumed to be known. For this case, the method of moments estimates are
    p = xbar*[(xbar*(1-xbar)/s^2) - 1]p = xbar*[(xbar*(1-xbar)/s^2) - 1]
where xbar is the sample mean and s2 is the sample variance. If a and b are not 0 and 1, respectively, then replace xbar with (xbar-a)/(b-a) and s2 with s^2/((b-a)^2) in the above equations.For the case when a and b are known, the maximum likelihood estimates can be obtained by solving the following set of equations
    psi(phat)-psi(phat+qhat)=(1/n)*SUM[i=1 to n][LOG[(Y(i)-b)/(b-a)]]psi(phat)-psi(phat+qhat)=(1/n)*SUM[i=1 to n][LOG[(Y(i)-b)/(b-a)]]
The maximum likelihood equations for the case when a and b are not known are given in pages 221-235 of Volume II of Johnson, Kotz, and Balakrishan.
SoftwareMost general purpose statistical software programs support at least some of the probability functions for the beta distribution.

Comments

Popular posts from this blog

Double exponential distribution

Double Exponential Distribution Probability Density Function The general formula for the  probability density function  of the double exponential distribution is where   is the  location parameter  and   is the  scale parameter . The case where   = 0 and   = 1 is called the  standard double exponential distribution . The equation for the standard double exponential distribution is Since the general form of probability functions can be  expressed in terms of the standard distribution , all subsequent formulas in this section are given for the standard form of the function. The following is the plot of the double exponential probability density function. Cumulative Distribution Function The formula for the  cumulative distribution function  of the double exponential distribution is The following is the plot of the double exponential cumulative distribution function. Percent Point Function The formula for the  percent point function  of the double exponential distribution

Runs Test for Detecting Non-randomness

Runs Test for Detecting Non-randomness Purpose: Detect Non-Randomness The runs test ( Bradley, 1968 ) can be used to decide if a data set is from a random process. A run is defined as a series of increasing values or a series of decreasing values. The number of increasing, or decreasing, values is the length of the run. In a random data set, the probability that the ( I +1)th value is larger or smaller than the I th value follows a binomial distribution , which forms the basis of the runs test. Typical Analysis and Test Statistics The first step in the runs test is to count the number of runs in the data sequence. There are several ways to define runs in the literature, however, in all cases the formulation must produce a dichotomous sequence of values. For example, a series of 20 coin tosses might produce the f

Basics of Sampling Techniques

Population                A   population   is a group of individuals(or)aggregate of objects under study.It is also known as universe. The population is divided by (i)finite population  (ii)infinite population, (iii) hypothetical population,  subject to a statistical study . A population includes each element from the set of observations that can be made. (i) Finite population : A population is called finite if it is possible to count its individuals. It may also be called a countable population. The number of vehicles crossing a bridge every day, (ii) Infinite population : Sometimes it is not possible to count the units contained in the population. Such a population is called infinite or uncountable. ex, The number of germs in the body of a patient of malaria is perhaps something which is uncountable   (iii) Hypothetical population : Statistical population which has no real existence but is imagined to be generated by repetitions of events of a certain typ