What is a frequency polygon?+
A frequency polygon is a line graph used to represent a frequency distribution. It is created by plotting points at the midpoint of each class interval (x-axis) against the corresponding frequency (y-axis), then connecting consecutive points with straight lines. The polygon extends to the x-axis one class-width before the first point and one class-width after the last point, forming a closed shape. It is commonly used in statistics to visualize the shape of a distribution and compare multiple datasets.
How is a frequency polygon different from a histogram?+
A histogram uses filled rectangular bars to show frequency for each class interval. A frequency polygon uses a line connecting midpoints of those bars. Both convey the same information, but frequency polygons are preferred when overlaying two or more distributions on the same graph because lines do not block each other the way bars do. Frequency polygons also more clearly show the overall shape and trend of the distribution.
How do you find the midpoint of a class interval?+
Midpoint = (lower class limit + upper class limit) / 2. For the class 20 to 30, midpoint = (20+30)/2 = 25. For 100 to 120, midpoint = (100+120)/2 = 110. Midpoints are the representative x-values plotted on the frequency polygon. They are also used in the mean and standard deviation formulas for grouped data.
What is the formula for the mean from grouped data?+
Mean = sum(f_i * m_i) / sum(f_i), where m_i is the midpoint of each class and f_i is the frequency. This is an approximation because the midpoint assumes all observations in a class equal the midpoint, which is rarely exactly true. For midpoints 10, 20, 30, 40, 50 with frequencies 5, 12, 18, 10, 5: Mean = 1480/50 = 29.6.
What is the standard deviation for grouped data?+
Population SD from grouped data = sqrt[sum(f_i * (m_i - mean)^2) / N], where N = total frequency. This calculator uses the population (not sample) formula since grouped data is typically a complete tabulated distribution rather than a sample. For the exam score example with midpoints 10-50 and frequencies 5,12,18,10,5: variance = 123.84, SD = 11.13.
What is the modal class in a frequency polygon?+
The modal class is the class interval with the highest frequency. Its midpoint is the mode estimate for the grouped data. On the polygon, it appears as the highest peak. For midpoints 10, 20, 30, 40, 50 with frequencies 5, 12, 18, 10, 5, the modal class midpoint is 30. When two classes tie for highest frequency, the data is bimodal.
What is the median class in a frequency polygon?+
The median class is the class where the cumulative frequency first reaches or exceeds N/2. For N = 50, you look for the class where the running total of frequencies reaches 25. If cumulative frequencies are 5, 17, 35, the third class (cumF = 35) is the median class. Its midpoint is the grouped median estimate. For a more precise median, use interpolation within the class interval.
How many classes should a frequency polygon have?+
Between 5 and 15 classes typically works best. Sturges' rule suggests k = 1 + 3.322 * log10(n) for n observations. For n = 50, k = 1 + 3.322 * log10(50) = about 7 classes. Too few classes (3 or 4) hide important shape features. Too many classes (20 or more) produce a jagged polygon with many zero-frequency spikes.
What does the shape of a frequency polygon tell you?+
A symmetric bell shape suggests a roughly normal distribution with mean near the center. A right-skewed shape (long tail to the right) means most values are low with a few very high outliers, common in income data. A left-skewed shape means most values are high with a few very low outliers. A flat shape indicates roughly uniform data. A bimodal shape with two peaks suggests two subgroups in the data.
Can I compare two frequency polygons on the same chart?+
Yes, and this is one of the main advantages of frequency polygons over histograms. By drawing two or more polygons in different colors or line styles on the same axes, you can visually compare the distributions of two groups, such as test scores before and after an intervention, or sales performance in two regions. Make sure both polygons use the same class intervals for a fair comparison.
What is the difference between a frequency polygon and an ogive?+
A frequency polygon plots frequency (count per class) against midpoints. An ogive (cumulative frequency polygon) plots cumulative frequency against the upper class boundary. The frequency polygon shows where data is most concentrated (peaks at modes). The ogive always rises from zero to N and is used to estimate percentiles by reading off the y-axis at any x value.
Why does the frequency polygon extend to zero on both ends?+
The polygon is extended to the x-axis one class width before the first midpoint and one class width after the last midpoint, giving it anchor points at frequency zero on both sides. This creates a closed, well-defined geometric shape (the polygon) rather than a floating line. The enclosed area represents the total frequency. This convention also makes it easier to compare the shapes of two polygons drawn on the same axes.