Skip to main content
Back to Statistics & Probability
JEE Main 2021
Statistics & Probability
Statistics
Hard

Question

The mean and standard deviation of 20 observations were calculated as 10 and 2.5 respectively. It was found that by mistake one data value was taken as 25 instead of 35. if α\alpha and β\sqrt \beta are the mean and standard deviation respectively for correct data, then (α\alpha, β\beta) is :

Options

Solution

1. Key Concepts and Formulas

To accurately solve problems involving corrected statistical measures, a solid understanding of the fundamental definitions and computational formulas for mean and variance is essential.

  • Mean (x\overline{x}): The average of a dataset, calculated by summing all observations and dividing by the number of observations (NN). x=xiN\overline{x} = \frac{\sum x_i}{N}
  • Variance (σ2\sigma^2): A measure of data dispersion, indicating how spread out the numbers are from the mean. The computational formula is particularly useful for correction problems as it directly involves the sum of squares: σ2=xi2N(x)2\sigma^2 = \frac{\sum x_i^2}{N} - (\overline{x})^2
  • Standard Deviation (σ\sigma): The square root of the variance, providing a measure of spread in the same units as the original data. σ=σ2\sigma = \sqrt{\sigma^2}

2. Step-by-Step Solution

We are given N=20N=20, incorrect mean xinc=10\overline{x}_{\text{inc}} = 10, incorrect standard deviation σinc=2.5\sigma_{\text{inc}} = 2.5. A value of 2525 was mistakenly taken instead of 3535. We need to find the correct mean α\alpha and correct variance β\beta (since β\sqrt{\beta} is the correct standard deviation).

Step 1: Calculate the Incorrect Sum of Observations (xi,inc\sum x_{i, \text{inc}})

  • What we're doing: We first determine the total sum of all observations as it was originally calculated, including the erroneous value. This is a prerequisite for correcting the sum.
  • Why we're doing it: The mean formula directly relates the sum of observations to the mean and the number of observations. We use the given incorrect mean to find this initial sum.
  • Using the mean formula: xinc=xi,incN\overline{x}_{\text{inc}} = \frac{\sum x_{i, \text{inc}}}{N}
  • Substitute the given values: 10=xi,inc2010 = \frac{\sum x_{i, \text{inc}}}{20}
  • Solve for the incorrect sum: xi,inc=10×20=200\sum x_{i, \text{inc}} = 10 \times 20 = 200

Step 2: Calculate the Correct Sum of Observations (xi,corr\sum x_{i, \text{corr}})

  • What we're doing: We adjust the incorrect sum by removing the contribution of the wrong data point and adding the contribution of the correct data point.
  • Why we're doing it: This step directly corrects the cumulative sum of values, which will then allow us to calculate the true mean.
  • The correction formula is: xi,corr=xi,inc(wrong value)+(correct value)\sum x_{i, \text{corr}} = \sum x_{i, \text{inc}} - (\text{wrong value}) + (\text{correct value})
  • Substitute the values: xi,corr=20025+35\sum x_{i, \text{corr}} = 200 - 25 + 35 xi,corr=175+35=210\sum x_{i, \text{corr}} = 175 + 35 = 210

Step 3: Calculate the Correct Mean (α\alpha)

  • What we're doing: We compute the correct mean using the newly found correct sum of observations.
  • Why we're doing it: This is the primary goal for the mean calculation, using the fundamental definition of mean with the corrected data.
  • Using the mean formula with the correct sum: α=xcorr=xi,corrN\alpha = \overline{x}_{\text{corr}} = \frac{\sum x_{i, \text{corr}}}{N}
  • Substitute the values: α=21020\alpha = \frac{210}{20} α=10.5\alpha = 10.5 Thus, the correct mean α\alpha is 10.510.5.

Step 4: Calculate the Incorrect Variance (σinc2\sigma_{\text{inc}}^2)

  • What we're doing: We convert the given incorrect standard deviation into incorrect variance.
  • Why we're doing it: The variance formula is used to work with sums of squares, so having the variance directly simplifies subsequent calculations.
  • From the given incorrect standard deviation: σinc2=(2.5)2=6.25\sigma_{\text{inc}}^2 = (2.5)^2 = 6.25

Step 5: Calculate the Incorrect Sum of Squares of Observations (xi,inc2\sum x_{i, \text{inc}}^2)

  • What we're doing: We determine the total sum of the squares of all observations as it was originally calculated, including the square of the erroneous value.
  • Why we're doing it: To find the correct variance, we need the correct sum of squares. We use the incorrect variance and incorrect mean to work backward and find the initial sum of squares.
  • Rearranging the variance formula: σinc2=xi,inc2N(xinc)2\sigma_{\text{inc}}^2 = \frac{\sum x_{i, \text{inc}}^2}{N} - (\overline{x}_{\text{inc}})^2 xi,inc2=N(σinc2+(xinc)2)\sum x_{i, \text{inc}}^2 = N \left( \sigma_{\text{inc}}^2 + (\overline{x}_{\text{inc}})^2 \right)
  • Substitute the known incorrect values: xi,inc2=20(6.25+(10)2)\sum x_{i, \text{inc}}^2 = 20 \left( 6.25 + (10)^2 \right) xi,inc2=20(6.25+100)\sum x_{i, \text{inc}}^2 = 20 \left( 6.25 + 100 \right) xi,inc2=20×106.25=2125\sum x_{i, \text{inc}}^2 = 20 \times 106.25 = 2125

Step 6: Calculate the Correct Sum of Squares of Observations (xi,corr2\sum x_{i, \text{corr}}^2)

  • What we're doing: We adjust the incorrect sum of squares by removing the square of the wrong data point and adding the square of the correct data point.
  • Why we're doing it: Variance depends on the sum of squares. This correction is crucial to ensure the subsequent variance calculation is based on the true data.
  • The correction formula for sum of squares: xi,corr2=xi,inc2(wrong value)2+(correct value)2\sum x_{i, \text{corr}}^2 = \sum x_{i, \text{inc}}^2 - (\text{wrong value})^2 + (\text{correct value})^2
  • Substitute the values: xi,corr2=2125(25)2+(35)2\sum x_{i, \text{corr}}^2 = 2125 - (25)^2 + (35)^2 xi,corr2=2125625+1225\sum x_{i, \text{corr}}^2 = 2125 - 625 + 1225 xi,corr2=1500+1225=2725\sum x_{i, \text{corr}}^2 = 1500 + 1225 = 2725

Step 7: Calculate the Correct Variance (β\beta)

  • What we're doing: We compute the correct variance using the newly found correct sum of squares and the correct mean.
  • Why we're doing it: This is the primary goal for the variance calculation. It's critical to use the correct mean (α\alpha) calculated in Step 3 for this step.
  • Using the variance formula with the corrected values: β=σcorr2=xi,corr2N(xcorr)2\beta = \sigma_{\text{corr}}^2 = \frac{\sum x_{i, \text{corr}}^2}{N} - (\overline{x}_{\text{corr}})^2
  • Substitute the correct sum of squares, number of observations, and the correct mean (α=10.5\alpha = 10.5): β=272520(10.5)2\beta = \frac{2725}{20} - (10.5)^2 β=136.25110.25\beta = 136.25 - 110.25 β=26\beta = 26 Thus, the correct variance β\beta is 2626.

Step 8: State the Final Corrected Measures

  • We have found the correct mean α=10.5\alpha = 10.5 and the correct variance β=26\beta = 26.
  • Therefore, the pair (α,β)(\alpha, \beta) is (10.5,26)(10.5, 26).

3. Common Mistakes & Tips

  • Always Correct the Sums First: The most critical step in these problems is to first correct the sum of observations (xi\sum x_i) and the sum of squares of observations (xi2\sum x_i^2). Recalculate mean and variance only after these sums are accurate.
  • Use the Corrected Mean for Variance: A frequent error is using the original (incorrect) mean when calculating the corrected variance. Always use the newly calculated correct mean (α\alpha) in the variance formula for the corrected data.
  • Square Values for Sum of Squares: When correcting the sum of squares, remember to subtract the square of the wrong value and add the square of the correct value, i.e., (wrong)2+(correct)2-(\text{wrong})^2 + (\text{correct})^2. Do not simply subtract/add the values themselves.

4. Summary

This problem demonstrates a systematic approach to correcting statistical measures when an error in a single data point is identified. The process involves first determining the initial (incorrect) sums of observations and their squares. These sums are then adjusted by removing the contribution of the erroneous value and adding the contribution of the correct value. Finally, the corrected mean and variance are calculated using these adjusted sums, ensuring that the correct mean is used in the variance calculation. Following these steps leads to the correct mean α=10.5\alpha = 10.5 and the correct variance β=26\beta = 26.

5. Final Answer The final answer is (10.5,26)\boxed{(10.5, 26)} which corresponds to option (D).

Practice More Statistics & Probability Questions

View All Questions