Question
Consider the following statements: (a) Mode can be computed from histogram (b) Median is not independent of change of scale (c) Variance is independent of change of origin and scale. Which of these is/are correct?
Options
Solution
Key Concepts and Formulas
- Measures of Central Tendency: These describe the center of a dataset. Examples include Mean, Median, and Mode.
- Mode: The value that appears most frequently in a dataset. For grouped data, it's estimated from the class with the highest frequency (modal class). The formula for grouped data is: where is the lower limit of the modal class, is the class width, is the frequency of the modal class, is the frequency of the preceding class, and is the frequency of the succeeding class.
- Median: The middle value of an ordered dataset.
- Measures of Dispersion: These describe the spread or variability of a dataset.
- Variance (): The average of the squared differences from the mean. For a dataset with mean , the variance is .
- Effect of Linear Transformations ():
- If is a measure of central tendency (Mean, Median, Mode) for dataset , then for , the new measure .
- If is the variance for dataset , then for , the new variance . Consequently, the standard deviation .
- Change of Origin: Adding/subtracting a constant ().
- Change of Scale: Multiplying by a constant ().
Step-by-Step Solution
Step 1: Analyze Statement (a) - Mode can be computed from histogram.
-
What we are doing: We are determining if a histogram provides sufficient information to calculate the mode.
-
Why: The mode is a fundamental measure of central tendency, and a histogram is a primary tool for visualizing frequency distributions.
-
Explanation:
- A histogram graphically represents the frequency distribution of data. The height of each bar corresponds to the frequency of the respective class interval.
- For ungrouped data (where each bar represents a single value), the mode is simply the value corresponding to the tallest bar. Its exact value can be read directly.
- For grouped continuous data, we cannot find the exact mode directly. However, the histogram clearly identifies the modal class, which is the class interval with the highest frequency (the tallest bar).
- The mode for grouped data is estimated using the formula: All the components of this formula—the lower limit of the modal class (), the class width (), the frequency of the modal class (), and the frequencies of the classes preceding () and succeeding () the modal class—can be directly read or derived from the histogram.
- Since all necessary values are available from the histogram, the mode (or its estimate for grouped data) can indeed be computed.
-
Conclusion: Statement (a) is correct.
Step 2: Analyze Statement (b) - Median is not independent of change of scale.
-
What we are doing: We are examining how the median responds to a change in scale (multiplication by a constant).
-
Why: Understanding the effect of transformations on statistical measures is crucial.
-
Explanation:
- The median is the middle value of an ordered dataset. It is a measure of central tendency.
- Consider a dataset with median . If we apply a linear transformation to create a new dataset , the new median is related to by the same transformation: .
- A change of scale specifically refers to multiplying each data point by a constant (i.e., ). So, .
- In this case, the new median becomes .
- If , then . This means the median changes when the scale changes. For example, if , . If we apply a scale change by , , then . Clearly, .
- Since the median's value changes with the scale factor (if ), it is not independent of a change of scale.
-
Conclusion: Statement (b) is correct.
Step 3: Analyze Statement (c) - Variance is independent of change of origin and scale.
-
What we are doing: We are investigating how variance is affected by both change of origin (addition/subtraction) and change of scale (multiplication).
-
Why: Variance is a measure of dispersion, and its behavior under transformations is different from measures of central tendency.
-
Explanation:
- Variance () measures the spread of data points around the mean.
- Effect of Change of Origin ():
- Let with mean and variance .
- Let where .
- The new mean will be .
- The deviation of each new data point from the new mean is:
- Since the deviations remain unchanged, their squares and sum of squares also remain unchanged.
- Therefore, the new variance .
- Variance is independent of a change of origin.
- Effect of Change of Scale ():
- Let where .
- The new mean will be .
- The deviation of each new data point from the new mean is:
- Squaring this deviation:
- The new variance .
- Variance is not independent of a change of scale; it is multiplied by the square of the scale factor ().
- Since the statement claims variance is independent of both change of origin and scale, but it is demonstrably not independent of change of scale, the statement is incorrect.
-
Conclusion: Statement (c) is incorrect.
Common Mistakes & Tips
- Distinguish Central Tendency vs. Dispersion: Measures of central tendency (mean, median, mode) transform linearly (), while measures of dispersion (variance, standard deviation) are affected differently. Variance is affected by , and standard deviation by .
- Variance and Scale Factor: Always remember the factor for variance when dealing with a change of scale. This is a very common point of error.
- "Computed" for Grouped Data: For grouped data, "computed" for mode often implies estimation using a specific formula, which relies entirely on information from the histogram.
Summary
We have systematically analyzed each statement. Statement (a) is correct because a histogram provides all necessary information to compute (or estimate) the mode using the relevant formula for grouped or ungrouped data. Statement (b) is correct because the median changes proportionally with a change of scale (multiplication by a constant), meaning it is not independent of scale. Statement (c) is incorrect because while variance is independent of change of origin (addition/subtraction), it is not independent of change of scale; it gets multiplied by the square of the scale factor. Therefore, only statements (a) and (b) are correct.
The final answer is , which corresponds to option (C).