Question
The mean and variance of 10 observations were calculated as 15 and 15 respectively by a student who took by mistake 25 instead of 15 for one observation. Then, the correct standard deviation is _____________.
Answer: 1
Solution
Key Concepts and Formulas
To accurately solve problems involving the correction of statistical data, we rely on the fundamental definitions of mean, variance, and standard deviation. For a set of observations :
-
Mean (): The average value, indicating the central tendency of the data. Here, represents the sum of all observations.
-
Variance (): A measure of how spread out the data points are from their mean. A widely used computational formula is: Here, represents the sum of the squares of all observations.
-
Standard Deviation (): The square root of the variance. It is expressed in the same units as the data, making it easier to interpret the typical deviation from the mean. By convention, standard deviation is always a non-negative value.
Problem-Solving Strategy
When an error in an observation is identified, the strategy to find the correct statistical measures involves a systematic correction process:
- Extract Incorrect Sums: Use the given incorrect mean and variance to determine the incorrect sum of observations () and the incorrect sum of squares of observations ().
- Adjust for Error: Correct these sums by subtracting the value (or its square) of the erroneous observation and adding the value (or its square) of the correct observation. This yields the correct sum of observations () and correct sum of squares of observations ().
- Calculate Correct Mean: Use the corrected sum of observations to find the correct mean ().
- Calculate Correct Variance: Employ the corrected sum of squares and the corrected mean to compute the correct variance ().
- Calculate Correct Standard Deviation: Finally, take the square root of the correct variance to find the correct standard deviation ().
Step-by-Step Solution
We are given information about 10 observations, where one observation was mistakenly recorded. Our goal is to find the correct standard deviation.
Given Information:
- Number of observations () = 10
- Incorrect Mean () = 15
- Incorrect Variance () = 15
- Observation mistakenly taken () = 25
- Correct observation () = 15
Step 1: Extracting Information from Incorrect Calculations
First, we use the given incorrect mean and variance to find the sums based on the erroneous data.
a. Calculate the Incorrect Sum of Observations (): The definition of the mean allows us to find the total sum of observations. We rearrange this to solve for the sum: Substituting the given values:
- Why this step? This sum (150) includes the incorrect observation (25). It's our starting point to adjust the total sum of the dataset correctly.
b. Calculate the Incorrect Sum of Squares of Observations (): We use the computational formula for variance to find the sum of squares. Substituting the given incorrect variance and mean: Now, we isolate the term : Finally, we find the sum of squares:
- Why this step? This sum of squares (2400) includes the square of the incorrect observation (). This value is crucial for correctly calculating the new variance after adjusting for the error.
Step 2: Correcting the Sums
Now we adjust the sums obtained in Step 1 to reflect the correct observation.
a. Calculate the Correct Sum of Observations (): To correct the sum, we remove the incorrect observation's value and add the correct observation's value. Substituting the values:
- Why this step? This gives us the true sum of all 10 observations, having replaced the erroneous value with the correct one.
b. Calculate the Correct Sum of Squares of Observations (): Similarly, to correct the sum of squares, we subtract the square of the incorrect observation and add the square of the correct observation. Substituting the values:
- Why this step? It's essential to adjust the sum of squares by subtracting the square of the incorrect value and adding the square of the correct value. This is a common point of error if one were to mistakenly subtract/add the values themselves rather than their squares.
Step 3: Calculating the Correct Mean ()
With the correct sum of observations, we can now find the correct mean. The number of observations () remains 10.
- Why this step? The correct mean is a necessary component for calculating the correct variance using its computational formula.
Step 4: Calculating the Correct Variance ()
Using the corrected sum of squares and the newly calculated correct mean, we can determine the correct variance. Substituting the corrected values:
- Why this step? This is the central calculation for the spread of the corrected data. It's vital to use both the correct sum of squares and the correct mean to ensure accuracy.
Step 5: Calculating the Correct Standard Deviation ()
Finally, we find the correct standard deviation by taking the positive square root of the correct variance.
- Why this step? The question specifically asks for the standard deviation, which is the final step in this correction process. Standard deviation is always non-negative, representing a magnitude of dispersion.
Common Mistakes & Tips
- Adjust Sums First: Always work with the sums ( and ) as intermediate steps. Do not attempt to directly adjust the mean or variance values, as this often leads to errors.
- Square the Values for Sum of Squares: When correcting the sum of squares (), remember to subtract the square of the incorrect observation and add the square of the correct observation, i.e., and . A common mistake is to use and directly.
- Use Correct Mean for Correct Variance: After calculating the correct sum of observations and sum of squares, ensure you calculate the correct mean before using it in the variance formula. Using the original (incorrect) mean with corrected sums of squares is a frequent error.
Summary
To correct the mean and variance after a data entry error, the most reliable method is to first reverse the calculations to find the incorrect sums of observations and their squares. Then, adjust these sums by removing the erroneous data point's contribution and adding the correct data point's contribution. Finally, recalculate the mean, variance, and standard deviation using these corrected sums. Following these steps systematically ensures an accurate result.
The final answer is .