Question
If the data x 1 , x 2 ,......., x 10 is such that the mean of first four of these is 11, the mean of the remaining six is 16 and the sum of squares of all of these is 2,000 ; then the standard deviation of this data is :
Options
Solution
Here is a clear, educational, and well-structured solution to the problem:
1. Key Concepts and Formulas
To determine the standard deviation of a dataset, we primarily rely on the following statistical measures and formulas:
- Mean (): The average of all observations in a dataset. For a dataset , the mean is calculated as:
- Variance (): A measure of how much the data points deviate from the mean. While there's a definitional formula involving squared differences from the mean, for computational efficiency, especially when the sum of squares of observations () is given, we use the following formula:
- Standard Deviation (): The positive square root of the variance. It's preferred for interpretation as it is in the same units as the original data.
Our goal is to find the standard deviation () of the entire dataset. This requires us to first calculate the overall mean () and then use the given sum of squares () to find the variance (), before taking its square root.
2. Step-by-Step Solution
We are given a dataset of 10 observations: . Therefore, the total number of observations, .
The problem provides the following information:
- Mean of the first four observations () is .
- Mean of the remaining six observations () is .
- Sum of squares of all 10 observations is .
Let's proceed with the calculations:
Step 1: Calculate the Total Sum of All Observations ()
-
Why this step? To find the overall mean of the entire dataset, we first need the sum of all its observations. The problem provides means for two subsets, which allows us to find the sum of observations for each subset and then combine them. Recall that .
-
For the first four observations (): Number of observations = 4 Mean () = 11 Sum of these observations () = .
-
For the remaining six observations (): Number of observations = 6 Mean () = 16 Sum of these observations () = .
-
-
Total Sum of all 10 observations: The total sum is the sum of the sums of the two subsets:
Step 2: Calculate the Mean of the Entire Dataset ()
-
Why this step? The overall mean () is a crucial component for calculating the variance using the computational formula: . We now have the total sum of observations and the total number of observations.
-
The mean of the entire dataset of 10 observations is:
Step 3: Identify the Sum of Squares of All Observations ()
-
Why this step? This value is directly provided in the problem statement and is the other essential component for the variance formula.
-
From the problem statement:
Step 4: Calculate the Variance ()
-
Why this step? Now that we have all the necessary components (, , and ), we can directly apply the computational formula for variance.
-
Using the formula: Substitute the values we found: Perform the calculations:
Step 5: Calculate the Standard Deviation ()
-
Why this step? The standard deviation is the positive square root of the variance. This is the final quantity the question asks for.
-
Taking the positive square root of the variance:
3. Common Mistakes & Tips
- Computational Formula is Your Friend: Always remember the computational formula for variance () as it often simplifies calculations significantly, especially in competitive exams.
- Don't Confuse Variance and Standard Deviation: A frequent error is to calculate the variance and present it as the final answer. Remember to take the square root to get the standard deviation.
- Accuracy in Arithmetic: Even with the correct formulas, small calculation errors can lead to incorrect answers. Double-check your arithmetic, especially when dealing with sums and squares.
- Overall Mean vs. Subset Means: When dealing with combined data, ensure you calculate the overall mean for the entire dataset, not just average the means of the subsets.
4. Summary
In this problem, we systematically determined the standard deviation of a dataset of 10 observations. We began by using the given means of two subsets (first four and remaining six observations) to calculate the total sum of all observations. This total sum, combined with the total number of observations, allowed us to find the overall mean of the entire dataset. Finally, using the given sum of squares of all observations and the calculated overall mean, we applied the efficient computational formula for variance. Taking the square root of the variance yielded the standard deviation.
The final answer is , which corresponds to option (B).