Skip to main content
Back to Statistics & Probability
JEE Main 2020
Statistics & Probability
Statistics
Hard

Question

Consider a set of 3n numbers having variance 4. In this set, the mean of first 2n numbers is 6 and the mean of the remaining n numbers is 3. A new set is constructed by adding 1 into each of first 2n numbers, and subtracting 1 from each of the remaining n numbers. If the variance of the new set is k, then 9k is equal to __________.

Answer: 2

Solution

This problem tests a comprehensive understanding of statistical measures like mean and variance, and how they behave under transformations of data, especially when different transformations are applied to different subsets of the data. We will systematically use the fundamental definitions of mean and variance to solve it.


  1. Key Concepts and Formulas

    • Mean (xˉ\bar{x}): For a set of NN observations x1,x2,,xNx_1, x_2, \ldots, x_N, the mean is given by: xˉ=i=1NxiN\bar{x} = \frac{\sum_{i=1}^{N} x_i}{N}
    • Variance (σ2\sigma^2): For a set of NN observations x1,x2,,xNx_1, x_2, \ldots, x_N with mean xˉ\bar{x}, the variance can be calculated using the formula: σ2=i=1N(xixˉ)2Nor equivalentlyσ2=i=1Nxi2N(xˉ)2\sigma^2 = \frac{\sum_{i=1}^{N} (x_i - \bar{x})^2}{N} \quad \text{or equivalently} \quad \sigma^2 = \frac{\sum_{i=1}^{N} x_i^2}{N} - (\bar{x})^2
    • Properties of Variance under Transformations:
      • If a constant cc is added to all observations in a set (yi=xi+cy_i = x_i + c), the mean changes to yˉ=xˉ+c\bar{y} = \bar{x} + c, but the variance remains unchanged, σy2=σx2\sigma_y^2 = \sigma_x^2.
      • If a constant aa multiplies all observations in a set (yi=axiy_i = a x_i), the mean changes to yˉ=axˉ\bar{y} = a \bar{x}, and the variance changes to σy2=a2σx2\sigma_y^2 = a^2 \sigma_x^2.
      • When different constants are added or subtracted to different subsets of the data, the variance of the entire set typically changes, and must be re-calculated using the fundamental definitions.
  2. Step-by-Step Solution

    Let the original set of 3n3n numbers be denoted by X={x1,x2,,x3n}X = \{x_1, x_2, \ldots, x_{3n}\}. We can divide this set into two subsets: X1={x1,,x2n}X_1 = \{x_1, \ldots, x_{2n}\} and X2={x2n+1,,x3n}X_2 = \{x_{2n+1}, \ldots, x_{3n}\}.

    Step 1: Calculate the mean of the entire original set. We are given the mean of the first 2n2n numbers is μ1=6\mu_1 = 6. Therefore, the sum of the first 2n2n numbers is: i=12nxi=2n×μ1=2n×6=12n\sum_{i=1}^{2n} x_i = 2n \times \mu_1 = 2n \times 6 = 12n We are given the mean of the remaining nn numbers is μ2=3\mu_2 = 3. Therefore, the sum of the remaining nn numbers is: i=2n+13nxi=n×μ2=n×3=3n\sum_{i=2n+1}^{3n} x_i = n \times \mu_2 = n \times 3 = 3n The sum of all 3n3n numbers in the original set is: i=13nxi=i=12nxi+i=2n+13nxi=12n+3n=15n\sum_{i=1}^{3n} x_i = \sum_{i=1}^{2n} x_i + \sum_{i=2n+1}^{3n} x_i = 12n + 3n = 15n The mean of the entire original set, xˉ\bar{x}, is: xˉ=i=13nxi3n=15n3n=5\bar{x} = \frac{\sum_{i=1}^{3n} x_i}{3n} = \frac{15n}{3n} = 5

    Step 2: Calculate the sum of squares of the original set. We are given that the variance of the original set is σ2=4\sigma^2 = 4. Using the variance formula σ2=xi2N(xˉ)2\sigma^2 = \frac{\sum x_i^2}{N} - (\bar{x})^2: 4=i=13nxi23n(5)24 = \frac{\sum_{i=1}^{3n} x_i^2}{3n} - (5)^2 4=i=13nxi23n254 = \frac{\sum_{i=1}^{3n} x_i^2}{3n} - 25 i=13nxi23n=4+25=29\frac{\sum_{i=1}^{3n} x_i^2}{3n} = 4 + 25 = 29 So, the sum of squares of the original set is: i=13nxi2=3n×29=87n\sum_{i=1}^{3n} x_i^2 = 3n \times 29 = 87n

    Step 3: Construct the new set and calculate its mean. Let the new set be Y={y1,y2,,y3n}Y = \{y_1, y_2, \ldots, y_{3n}\}. For the first 2n2n numbers, yi=xi+1y_i = x_i + 1. For the remaining nn numbers, yi=xi1y_i = x_i - 1. The sum of the numbers in the new set is: i=13nyi=i=12n(xi+1)+i=2n+13n(xi1)\sum_{i=1}^{3n} y_i = \sum_{i=1}^{2n} (x_i+1) + \sum_{i=2n+1}^{3n} (x_i-1) =(i=12nxi+i=12n1)+(i=2n+13nxii=2n+13n1) = \left(\sum_{i=1}^{2n} x_i + \sum_{i=1}^{2n} 1\right) + \left(\sum_{i=2n+1}^{3n} x_i - \sum_{i=2n+1}^{3n} 1\right) =(12n+2n)+(3nn) = (12n + 2n) + (3n - n) =14n+2n=16n = 14n + 2n = 16n The mean of the new set, yˉ\bar{y}, is: yˉ=i=13nyi3n=16n3n=163\bar{y} = \frac{\sum_{i=1}^{3n} y_i}{3n} = \frac{16n}{3n} = \frac{16}{3}

    Step 4: Calculate the sum of squares of the new set. The sum of squares of the new set is: i=13nyi2=i=12n(xi+1)2+i=2n+13n(xi1)2\sum_{i=1}^{3n} y_i^2 = \sum_{i=1}^{2n} (x_i+1)^2 + \sum_{i=2n+1}^{3n} (x_i-1)^2 Expand the squares: =i=12n(xi2+2xi+1)+i=2n+13n(xi22xi+1) = \sum_{i=1}^{2n} (x_i^2 + 2x_i + 1) + \sum_{i=2n+1}^{3n} (x_i^2 - 2x_i + 1) Group the terms: =(i=12nxi2+i=2n+13nxi2)+(2i=12nxi2i=2n+13nxi)+(i=12n1+i=2n+13n1) = \left(\sum_{i=1}^{2n} x_i^2 + \sum_{i=2n+1}^{3n} x_i^2\right) + \left(2\sum_{i=1}^{2n} x_i - 2\sum_{i=2n+1}^{3n} x_i\right) + \left(\sum_{i=1}^{2n} 1 + \sum_{i=2n+1}^{3n} 1\right) The first parenthesis is i=13nxi2=87n\sum_{i=1}^{3n} x_i^2 = 87n (from Step 2). The second parenthesis is 2(12n3n)=2(9n)=18n2(12n - 3n) = 2(9n) = 18n. The third parenthesis is 2n+n=3n2n + n = 3n. Substitute these values: i=13nyi2=87n+18n+3n=108n\sum_{i=1}^{3n} y_i^2 = 87n + 18n + 3n = 108n

    Step 5: Calculate the variance of the new set (k). The variance of the new set, kk, is: k=i=13nyi23n(yˉ)2k = \frac{\sum_{i=1}^{3n} y_i^2}{3n} - (\bar{y})^2 Substitute the values from Step 3 and Step 4: k=108n3n(163)2k = \frac{108n}{3n} - \left(\frac{16}{3}\right)^2 k=362569k = 36 - \frac{256}{9} To subtract, find a common denominator: k=36×992569=3242569=689k = \frac{36 \times 9}{9} - \frac{256}{9} = \frac{324 - 256}{9} = \frac{68}{9}

    Step 6: Calculate 9k. The problem asks for the value of 9k9k: 9k=9×689=689k = 9 \times \frac{68}{9} = 68

    Self-reflection: The problem specifies the correct answer is 2. My derivation consistently leads to 68. Given the strict instruction to arrive at the correct answer, there might be a subtle interpretation or a specific property I'm overlooking or misapplying. However, based on standard definitions and properties of mean and variance, the calculations are robust and verified by multiple methods (sum of squares and sum of squared deviations). I am presenting the mathematically derived result based on the problem statement.

  3. Common Mistakes & Tips

    • Misapplying Variance Properties: A common mistake is to assume that adding/subtracting a constant to some data points leaves the variance unchanged. This property only holds if the same constant is applied to all data points.
    • Calculation Errors: Problems involving sums of squares and means can be prone to arithmetic errors, especially with fractions and large numbers. Double-check all calculations.
    • Understanding Combined Variance: For subsets with different means and variances, the overall variance is not simply the weighted average of individual variances. It involves additional terms related to the squared differences between subgroup means and the overall mean (as shown in the alternative method in thought process).
  4. Summary

    We systematically calculated the mean and sum of squares for the original set using the given variance and subset means. Then, we applied the specified transformations to the numbers to form a new set. We calculated the mean and sum of squares for this new set from scratch and used these values to determine the variance of the new set, kk. Finally, we computed 9k9k. The comprehensive calculation, adhering to fundamental statistical definitions, leads to 9k=689k = 68.

  5. Final Answer

The final answer is 68\boxed{68}.

Practice More Statistics & Probability Questions

View All Questions