Skip to main content
Back to Statistics & Probability
JEE Main 2020
Statistics & Probability
Statistics
Hard

Question

A data consists of n observations : x 1 , x 2 , . . . . . . ., x n . If i=1n(xi+1)2=9n\sum\limits_{i = 1}^n {{{\left( {{x_i} + 1} \right)}^2}} = 9n and i=1n(xi1)2=5n,\sum\limits_{i = 1}^n {{{\left( {{x_i} - 1} \right)}^2}} = 5n, then the standard deviation of this data is :

Options

Solution

1. Key Concepts and Formulas

To accurately determine the standard deviation of a data set, we rely on the following fundamental statistical definitions and computational formulas:

  • Standard Deviation (σ\sigma): A measure of the average amount of variability or dispersion of data points around the mean. The most common computational formula for standard deviation is: σ=i=1nxi2n(xˉ)2\sigma = \sqrt{\frac{\sum_{i=1}^n x_i^2}{n} - \left(\bar{x}\right)^2} where nn is the number of observations, xi2\sum x_i^2 is the sum of the squares of the observations, and xˉ\bar{x} is the mean.

  • Arithmetic Mean (xˉ\bar{x}): The average of all observations in the data set. xˉ=i=1nxin\bar{x} = \frac{\sum_{i=1}^n x_i}{n} where xi\sum x_i is the sum of all observations.

  • Properties of Summation: These properties are crucial for expanding and simplifying the given equations:

    • i=1n(Ai±Bi)=i=1nAi±i=1nBi\sum_{i=1}^n (A_i \pm B_i) = \sum_{i=1}^n A_i \pm \sum_{i=1}^n B_i (Distributivity over addition/subtraction)
    • i=1nc=nc\sum_{i=1}^n c = nc (Sum of a constant cc for nn terms)
    • i=1ncAi=ci=1nAi\sum_{i=1}^n c \cdot A_i = c \cdot \sum_{i=1}^n A_i (Constant multiple rule)

2. Step-by-Step Solution

Our goal is to find xi\sum x_i (to get xˉ\bar{x}) and xi2\sum x_i^2 from the given equations. We will expand the summations and form a system of linear equations.

Step 1: Expand and Simplify the First Given Equation We are given the equation: \sum_{i=1}^n {{\left( {{x_i} + 1} \right)}^2}} = 8n

  • Why this step? We need to transform the given summation into a form that explicitly involves xi2\sum x_i^2 and xi\sum x_i, which are the components needed for the standard deviation formula.

  • Action: First, expand the squared term (xi+1)2(x_i + 1)^2 using the identity (a+b)2=a2+2ab+b2(a+b)^2 = a^2 + 2ab + b^2: (xi+1)2=xi2+2xi+1(x_i + 1)^2 = x_i^2 + 2x_i + 1 Substitute this expansion back into the summation: i=1n(xi2+2xi+1)=8n\sum_{i=1}^n {\left( {x_i^2 + 2x_i + 1} \right)} = 8n

  • Action: Apply the summation properties: distribute the summation, pull out the constant factor, and sum the constant term. i=1nxi2+i=1n2xi+i=1n1=8n\sum_{i=1}^n {x_i^2} + \sum_{i=1}^n {2x_i} + \sum_{i=1}^n {1} = 8n i=1nxi2+2i=1nxi+n=8n\sum_{i=1}^n {x_i^2} + 2\sum_{i=1}^n {x_i} + n = 8n

  • Action: Isolate the terms involving xix_i by subtracting nn from both sides: i=1nxi2+2i=1nxi=8nn\sum_{i=1}^n {x_i^2} + 2\sum_{i=1}^n {x_i} = 8n - n i=1nxi2+2i=1nxi=7n(1)\sum_{i=1}^n {x_i^2} + 2\sum_{i=1}^n {x_i} = 7n \quad \dots(1) This is our first simplified equation.

Step 2: Expand and Simplify the Second Given Equation We are given the second equation: \sum_{i=1}^n {{\left( {{x_i} - 1} \right)}^2}} = 4n

  • Why this step? Similar to Step 1, we need to expand this summation to get another equation in terms of xi2\sum x_i^2 and xi\sum x_i, forming a system of equations.

  • Action: Expand the squared term (xi1)2(x_i - 1)^2 using the identity (ab)2=a22ab+b2(a-b)^2 = a^2 - 2ab + b^2: (xi1)2=xi22xi+1(x_i - 1)^2 = x_i^2 - 2x_i + 1 Substitute this expansion into the summation: i=1n(xi22xi+1)=4n\sum_{i=1}^n {\left( {x_i^2 - 2x_i + 1} \right)} = 4n

  • Action: Apply the summation properties: i=1nxi2i=1n2xi+i=1n1=4n\sum_{i=1}^n {x_i^2} - \sum_{i=1}^n {2x_i} + \sum_{i=1}^n {1} = 4n i=1nxi22i=1nxi+n=4n\sum_{i=1}^n {x_i^2} - 2\sum_{i=1}^n {x_i} + n = 4n

  • Action: Isolate the terms involving xix_i: i=1nxi22i=1nxi=4nn\sum_{i=1}^n {x_i^2} - 2\sum_{i=1}^n {x_i} = 4n - n i=1nxi22i=1nxi=3n(2)\sum_{i=1}^n {x_i^2} - 2\sum_{i=1}^n {x_i} = 3n \quad \dots(2) This is our second simplified equation.

Step 3: Solve the System of Equations for xi2\sum x_i^2 and xi\sum x_i We now have a system of two linear equations: (1) xi2+2xi=7n\sum x_i^2 + 2\sum x_i = 7n (2) xi22xi=3n\sum x_i^2 - 2\sum x_i = 3n

  • Why this step? We need the exact values of xi2\sum x_i^2 and xi\sum x_i to calculate the mean and standard deviation. Solving this system will provide those values.

  • Action (Finding xi2\sum x_i^2): Add Equation (1) and Equation (2) to eliminate the xi\sum x_i term. (xi2+2xi)+(xi22xi)=7n+3n(\sum x_i^2 + 2\sum x_i) + (\sum x_i^2 - 2\sum x_i) = 7n + 3n 2xi2=10n2\sum x_i^2 = 10n Divide by 2: xi2=5n\sum x_i^2 = 5n

  • Action (Finding xi\sum x_i): Subtract Equation (2) from Equation (1) to eliminate the xi2\sum x_i^2 term. (xi2+2xi)(xi22xi)=7n3n(\sum x_i^2 + 2\sum x_i) - (\sum x_i^2 - 2\sum x_i) = 7n - 3n 4xi=4n4\sum x_i = 4n Divide by 4: xi=n\sum x_i = n

Step 4: Calculate the Mean (xˉ\bar{x}) Now that we have xi\sum x_i, we can calculate the mean.

  • Why this step? The mean is a critical component required for the standard deviation formula.

  • Action: Use the formula xˉ=xin\bar{x} = \frac{\sum x_i}{n} and substitute xi=n\sum x_i = n. xˉ=nn\bar{x} = \frac{n}{n} xˉ=1\bar{x} = 1

Step 5: Calculate the Standard Deviation (σ\sigma) We have all the necessary components:

  • xi2=5n\sum x_i^2 = 5n

  • xˉ=1\bar{x} = 1

  • nn (number of observations)

  • Why this step? This is the final calculation to answer the question.

  • Action: Substitute these values into the standard deviation formula: σ=xi2n(xˉ)2\sigma = \sqrt{\frac{\sum x_i^2}{n} - \left(\bar{x}\right)^2} σ=5nn(1)2\sigma = \sqrt{\frac{5n}{n} - (1)^2} σ=51\sigma = \sqrt{5 - 1} σ=4\sigma = \sqrt{4} σ=2\sigma = 2

3. Common Mistakes & Tips

  • Algebraic Expansion: Be meticulous when expanding terms like (xi±c)2(x_i \pm c)^2. A common error is omitting the middle term (2cxi2cx_i) or getting its sign incorrect.
  • Summation Properties: Remember that i=1nc=nc\sum_{i=1}^n c = nc, not just cc. Also, ensure constant factors are correctly pulled outside the summation.
  • System of Equations: Treat xi2\sum x_i^2 and xi\sum x_i as distinct variables and solve the system carefully. Double-check arithmetic, especially during subtraction.
  • Variance vs. Standard Deviation: Remember to take the square root at the end. The quantity xi2n(xˉ)2\frac{\sum x_i^2}{n} - (\bar{x})^2 represents the variance (σ2\sigma^2), not the standard deviation itself.

4. Summary

This problem effectively tests your ability to apply algebraic manipulation alongside core statistical definitions. The solution involved:

  1. Expanding the given summations using algebraic identities and summation properties.
  2. Forming and solving a system of two linear equations to find xi2\sum x_i^2 and xi\sum x_i.
  3. Calculating the mean (xˉ\bar{x}) from xi\sum x_i.
  4. Finally, substituting these derived values into the standard deviation formula to obtain the answer. The careful handling of summations and simultaneous equations is key to solving such problems efficiently.

The final answer is 2\boxed{2}, which corresponds to option (A).

Practice More Statistics & Probability Questions

View All Questions