Skip to main content
Back to Statistics & Probability
JEE Main 2021
Statistics & Probability
Statistics
Hard

Question

If the data x 1 , x 2 ,......., x 10 is such that the mean of first four of these is 11, the mean of the remaining six is 16 and the sum of squares of all of these is 2,000 ; then the standard deviation of this data is :

Options

Solution

Here is a clear, educational, and well-structured solution to the problem:


1. Key Concepts and Formulas

To determine the standard deviation of a dataset, we primarily rely on the following statistical measures and formulas:

  • Mean (xˉ\bar{x}): The average of all observations in a dataset. For a dataset x1,x2,,xnx_1, x_2, \ldots, x_n, the mean is calculated as: xˉ=i=1nxin\bar{x} = \frac{\sum_{i=1}^{n} x_i}{n}
  • Variance (σ2\sigma^2): A measure of how much the data points deviate from the mean. While there's a definitional formula involving squared differences from the mean, for computational efficiency, especially when the sum of squares of observations (xi2\sum x_i^2) is given, we use the following formula: σ2=i=1nxi2n(xˉ)2\sigma^2 = \frac{\sum_{i=1}^{n} x_i^2}{n} - (\bar{x})^2
  • Standard Deviation (σ\sigma): The positive square root of the variance. It's preferred for interpretation as it is in the same units as the original data. σ=σ2\sigma = \sqrt{\sigma^2}

Our goal is to find the standard deviation (σ\sigma) of the entire dataset. This requires us to first calculate the overall mean (xˉ\bar{x}) and then use the given sum of squares (xi2\sum x_i^2) to find the variance (σ2\sigma^2), before taking its square root.

2. Step-by-Step Solution

We are given a dataset of 10 observations: x1,x2,,x10x_1, x_2, \ldots, x_{10}. Therefore, the total number of observations, n=10n = 10.

The problem provides the following information:

  • Mean of the first four observations (n1=4n_1=4) is xˉ1=11\bar{x}_1 = 11.
  • Mean of the remaining six observations (n2=6n_2=6) is xˉ2=16\bar{x}_2 = 16.
  • Sum of squares of all 10 observations is i=110xi2=2000\sum_{i=1}^{10} x_i^2 = 2000.

Let's proceed with the calculations:

Step 1: Calculate the Total Sum of All Observations (i=110xi\sum_{i=1}^{10} x_i)

  • Why this step? To find the overall mean of the entire dataset, we first need the sum of all its observations. The problem provides means for two subsets, which allows us to find the sum of observations for each subset and then combine them. Recall that Sum of observations=Mean×Number of observations\text{Sum of observations} = \text{Mean} \times \text{Number of observations}.

    • For the first four observations (x1,x2,x3,x4x_1, x_2, x_3, x_4): Number of observations = 4 Mean (xˉ1\bar{x}_1) = 11 Sum of these observations (S1S_1) = 11×4=4411 \times 4 = 44.

    • For the remaining six observations (x5,x6,,x10x_5, x_6, \ldots, x_{10}): Number of observations = 6 Mean (xˉ2\bar{x}_2) = 16 Sum of these observations (S2S_2) = 16×6=9616 \times 6 = 96.

  • Total Sum of all 10 observations: The total sum is the sum of the sums of the two subsets: i=110xi=S1+S2=44+96=140\sum_{i=1}^{10} x_i = S_1 + S_2 = 44 + 96 = 140

Step 2: Calculate the Mean of the Entire Dataset (xˉ\bar{x})

  • Why this step? The overall mean (xˉ\bar{x}) is a crucial component for calculating the variance using the computational formula: σ2=xi2n(xˉ)2\sigma^2 = \frac{\sum x_i^2}{n} - (\bar{x})^2. We now have the total sum of observations and the total number of observations.

  • The mean of the entire dataset of 10 observations is: xˉ=i=110xin=14010=14\bar{x} = \frac{\sum_{i=1}^{10} x_i}{n} = \frac{140}{10} = 14

Step 3: Identify the Sum of Squares of All Observations (i=110xi2\sum_{i=1}^{10} x_i^2)

  • Why this step? This value is directly provided in the problem statement and is the other essential component for the variance formula.

  • From the problem statement: i=110xi2=2000\sum_{i=1}^{10} x_i^2 = 2000

Step 4: Calculate the Variance (σ2\sigma^2)

  • Why this step? Now that we have all the necessary components (xi2=2000\sum x_i^2 = 2000, xˉ=14\bar{x} = 14, and n=10n = 10), we can directly apply the computational formula for variance.

  • Using the formula: σ2=xi2n(xˉ)2\sigma^2 = \frac{\sum x_i^2}{n} - (\bar{x})^2 Substitute the values we found: σ2=200010(14)2\sigma^2 = \frac{2000}{10} - (14)^2 Perform the calculations: σ2=200196\sigma^2 = 200 - 196 σ2=4\sigma^2 = 4

Step 5: Calculate the Standard Deviation (σ\sigma)

  • Why this step? The standard deviation is the positive square root of the variance. This is the final quantity the question asks for.

  • Taking the positive square root of the variance: σ=σ2\sigma = \sqrt{\sigma^2} σ=4\sigma = \sqrt{4} σ=2\sigma = 2

3. Common Mistakes & Tips

  • Computational Formula is Your Friend: Always remember the computational formula for variance (σ2=xi2n(xˉ)2\sigma^2 = \frac{\sum x_i^2}{n} - (\bar{x})^2) as it often simplifies calculations significantly, especially in competitive exams.
  • Don't Confuse Variance and Standard Deviation: A frequent error is to calculate the variance and present it as the final answer. Remember to take the square root to get the standard deviation.
  • Accuracy in Arithmetic: Even with the correct formulas, small calculation errors can lead to incorrect answers. Double-check your arithmetic, especially when dealing with sums and squares.
  • Overall Mean vs. Subset Means: When dealing with combined data, ensure you calculate the overall mean for the entire dataset, not just average the means of the subsets.

4. Summary

In this problem, we systematically determined the standard deviation of a dataset of 10 observations. We began by using the given means of two subsets (first four and remaining six observations) to calculate the total sum of all observations. This total sum, combined with the total number of observations, allowed us to find the overall mean of the entire dataset. Finally, using the given sum of squares of all observations and the calculated overall mean, we applied the efficient computational formula for variance. Taking the square root of the variance yielded the standard deviation.

The final answer is 2\boxed{2}, which corresponds to option (B).

Practice More Statistics & Probability Questions

View All Questions