Cumulative Distribution Function ( CDF) with Examples in Statistics
In statistics, a Cumulative Distribution Function (CDF) is a function that shows the cumulative probability of a random variable taking on a value less than or equal to a given value. The CDF is defined for both discrete and continuous random variables.
Definition and Properties:
For a random variable , the CDF is defined as: where denotes the probability that is less than or equal to .
Key properties of the CDF:
- is non-decreasing: As increases, either increases or stays constant.
- and : The CDF approaches 0 as goes to negative infinity and approaches 1 as goes to positive infinity.
- is right-continuous: does not jump; it changes smoothly or remains constant.
Examples:
Example 1: Discrete Random Variable
Let's consider a discrete random variable that represents the outcome of rolling a fair six-sided die.
- Possible outcomes:
- Probability mass function (PMF):
The CDF for this discrete random variable is:
For example,
Example 2: Continuous Random Variable
Consider a continuous random variable that follows a uniform distribution on the interval .
- Probability density function (PDF):
The CDF for this continuous random variable is:
For example,
Uses of CDF:
- Probability Calculation: can be used to find probabilities for both discrete and continuous distributions.
- Median and Percentiles: The median is the value of such that . Percentiles can be found similarly.
- Comparison of Distributions: CDFs are useful for comparing different distributions or assessing the fit of a distribution to data.
In summary, the Cumulative Distribution Function (CDF) in statistics is a fundamental concept that provides a comprehensive view of the probability distribution of a random variable, both for discrete and continuous cases. It allows us to calculate probabilities, find medians and percentiles, and compare distributions effectively.