Skip to main content

correlation and its types


  

Definition

Degree and type of relationship between any two or more quantities (variables) in which they vary together over a period; for example, variation in the level of expenditure or savings with variation in the level of income. A positive correlation exists where the high values of one variable are associated with the high values of the other variable(s). A 'negative correlation' means association of high values of one with the low values of the other(s). Correlation can vary from +1 to -1. Values close to +1 indicate a high-degree of positive correlation, and values close to -1 indicate a high degree of negative correlation. Values close to zero indicate poor correlation of either kind, and 0 indicates no correlation at all. While correlation is useful in discovering possible connections between variables, it does not prove or disprove any cause-and-effect (causal) relationships between them. See also regression 

Positive Correlation

Positive correlation occurs when an increase in one variable increases the value in another.
The line corresponding to the scatter plot is an increasing line.
Positive Correlation

Negative Correlation

Negative correlation occurs when an increase in one variable decreases the value of another.
The line corresponding to the scatter plot is a decreasing line.
Negative Correlation

No Correlation

No correlation occurs when there is no linear dependency between the variables.
No Correlation

Perfect Correlation

Perfect correlation occurs when there is a funcional dependency between the variables.
In this case all the points are in a straight line.
Perfect Correlation

Strong Correlation

A correlation is stronger the closer the points are located to one another on the line.
Strong Correlation

Weak Correlation

A correlation is weaker the farther apart the points are located to one another on the line.
Weak Correlation
.

Comments

Popular posts from this blog

Lognormal distribution

Lognormal Distribution Probability Density Function A variable X is lognormally distributed if Y = LN(X) is normally distributed with "LN" denoting the natural logarithm. The general formula for the  probability density function  of the lognormal distribution is where   is the  shape parameter ,   is the  location parameter  and  m is the  scale parameter . The case where   = 0 and  m  = 1 is called the  standard lognormal distribution . The case where   equals zero is called the 2-parameter lognormal distribution. The equation for the standard lognormal distribution is Since the general form of probability functions can be  expressed in terms of the standard distribution , all subsequent formulas in this section are given for the standard form of the function. The following is the plot of the lognormal probability density function for four values of  . There are several commo...

Poisson distribution

Poisson Distribution Probability Mass Function The Poisson distribution is used to model the number of events occurring within a given time interval.The formula for the Poisson probability mass function is  is the shape parameter which indicates the average number of events in the given time interval. The following is the plot of the Poisson probability density function for four values of  . Cumulative Distribution Function The formula for the Poisson cumulative probability function is The following is the plot of the Poisson cumulative distribution function with the same values of   as the pdf plots above. Percent Point Function The Poisson percent point function does not exist in simple closed form. It is computed numerically. Note that because this is a discrete distribution that is only defined for integer values of  x , the percent point function is not smooth in the way the percent point function typically is for a continuous distribution....

Runs Test for Detecting Non-randomness

Runs Test for Detecting Non-randomness Purpose: Detect Non-Randomness The runs test ( Bradley, 1968 ) can be used to decide if a data set is from a random process. A run is defined as a series of increasing values or a series of decreasing values. The number of increasing, or decreasing, values is the length of the run. In a random data set, the probability that the ( I +1)th value is larger or smaller than the I th value follows a binomial distribution , which forms the basis of the runs test. Typical Analysis and Test Statistics The first step in the runs test is to count the number of runs in the data sequence. There are several ways to define runs in the literature, however, in all cases the formulation must produce a dichotomous sequence of values. For example, a series of 20 coin tosses might produce the f...