Sampling and Repeated Trials
SAMPLING & REPEATED TRIALS
Chapter 2 of the “Stats 2 book” introduces the crucial concepts of repeated trials and sampling , providing the mathematical framework for analysing sequences of experiments where the outcomes influence each other or are drawn from a limited pool. This framework builds upon the basic concepts of probability space and events discussed in Chapter 1.
The chapter primarily focuses on three major statistical distributions that arise from repeating experiments: Bernoulli Trials (leading to Binomial and Geometric distributions), the Poisson Approximation, and the Hypergeometric Distribution (for sampling without replacement).
2.1 Bernoulli Trials: Success, Failure, and Repetition
Bernoulli trials form the foundation for modelling repetitive experiments that yield a binary outcome.
Concept Explanation: Bernoulli Trial
Bernoulli Trial
A single Bernoulli trial is an experiment where the outcome is classified strictly as either a “Success” or a “Failure”.
- Let be the probability of success on any given trial.
- The trials in a sequence are always assumed to be independent.
| Experiment | Success Event Example | Probability () |
|---|---|---|
| Toss a fair coin | Head appears | |
| Roll a die | Six appears |
If we denote a single trial with parameter as , we define the sample space where .
The Binomial Distribution (Counting Successes)
When you perform independent Bernoulli trials and you are interested in the total number of successes () observed, the resulting distribution is the Binomial Distribution ().
Binomial Probability
For
Interpretation
Probability of exactly k successes in n trials.
counts the arrangements.
is the probability of one specific sequence.
Example Question and Concept Application (Mode)
Free Throw Mode
Question: If a basketball player, Mark, is a free-throw shooter and attempts independent free throws, what is the most likely number of shots he will make?
📝 View Detailed Solution ▼
Concept Explanation (Mode): The “mode” is the value of that makes the probability largest. For a distribution, the most likely number of successes is the integer given by the formula: .
Solution: Using and : The most likely number of successful throws is 7.
The Geometric Distribution (Waiting for Success)
The Geometric Distribution () describes the probability that the first success occurs exactly on the trial.
Geometric Probability
For the first success to occur on the trial, the first trials must all be failures, and the trial must be a success.
Key Property: The Geometric distribution is memoryless. This means that if an event (like finding a head on a coin toss) hasn’t occurred by trial , the probability that it takes more trials is the same as the probability it would have taken trials from the start.
Example Question and Solution
Rolling a Six
Question: A fair die is rolled repeatedly. What is the probability that the first 6 appears exactly on the fifth roll?
📝 View Detailed Solution ▼
Concept: Success is rolling a 6 (). We are looking for .
Solution: The probability of failure is .
2.2 Poisson Approximation: When is Large and is Small
Calculating Binomial probabilities directly becomes computationally challenging when the number of trials () is very large. The Poisson Distribution serves as an effective approximation in cases where is large and the probability of success () is very small, provided the product remains constant.
Poisson Formula
where (Average Rate)
Limit Case
Approximation for Binomial when:
- constant
Example Question and Solution (Approximation)
Independence Day Births
Question: A college has students. Assuming a birthrate probability , what is the probability that five or more students were born on Independence Day?
📝 View Detailed Solution ▼
Concept: Since is large (1460) and is small (), we use the Poisson approximation.
- Calculate : .
- The problem asks for , where .
Solution: It is easier to calculate the complement . This calculation yields an approximate probability of 0.3711631.
2.3 Sampling: With and Without Replacement
The methods used for Bernoulli trials (Binomial distribution) assume independence, which implies “sampling with replacement” (i.e., selecting an item and making it available for future selection).
However, in many real-world scenarios, sampling is done without replacement, meaning that once an item is selected, it is removed from the population pool, and subsequent trials are therefore dependent on previous outcomes.
The Hypergeometric Distribution (Sampling Without Replacement)
The Hypergeometric Distribution () models the number of successes () found in a sample when sampling without replacement from a finite population.
- : Total population size.
- : Number of individuals with the characteristic (successes) in .
- : Sample size chosen.
- : Number of successes found in the sample.
Concept: Hypergeometric Probability
Hypergeometric Formula
The probability is calculated by counting the number of ways to select successes from available items, and failures from available items, divided by the total number of ways to choose items from .
Example Question and Solution (Town Residents)
Town Demographics
Question: In a town of residents, are under age 18. If residents are selected randomly without replacement, what is the probability that exactly of them will be under 18?
📝 View Detailed Solution ▼
Solution: We use the Hypergeometric formula with , , , and . The number of non-successes is .
This result is approximately 0.153592.
Approximation Note:
- Hypergeometric Result:
- Binomial Approximation Result: