What is a frequency distribution in statistics?+
A frequency distribution is a table that summarizes how often each value or range of values occurs in a dataset. It displays frequency (raw count), relative frequency (proportion), cumulative frequency (running total), and cumulative relative frequency (running proportion). It is the starting point for understanding the shape, center, and spread of any dataset.
How do you calculate relative frequency in a frequency distribution?+
Relative frequency = class frequency / total count (N). If 12 out of 50 observations fall in a class, the relative frequency is 12/50 = 0.24 or 24%. All relative frequencies in the table must sum to exactly 1.00 (100%). This calculator computes and displays relative frequencies automatically as percentages.
What is cumulative frequency and how do you find it?+
Cumulative frequency for a class is the sum of all frequencies from the first class up to and including the current class. For classes with frequencies 4, 7, 9, 5, the cumulative frequencies are 4, 11, 20, 25. The last cumulative frequency always equals N. Cumulative frequencies are used to find medians, quartiles, and percentile ranks from a table.
What is the difference between ungrouped and grouped frequency distribution?+
Ungrouped frequency distribution lists every unique value and its exact count. It suits discrete data with few distinct values (quiz scores 0-10, number of children). Grouped frequency distribution organizes data into class intervals (20-30, 30-40) and is better for continuous data (heights, incomes) or large datasets where most values appear only once.
How many classes should I use for a grouped frequency distribution?+
Sturges' Formula gives a starting point: k = 1 + 3.322 x log10(N). For N = 30 use about 6 classes; for N = 100 use about 7 to 8; for N = 1000 use about 11. Between 5 and 15 classes works for most practical datasets. Too few classes hides detail; too many creates mostly empty intervals. Adjust until the distribution shape is clear.
How is class width calculated for grouped frequency distribution?+
Class width = (maximum value - minimum value) / number of classes. For data ranging from 40 to 100 with 6 classes: width = (100 - 40) / 6 = 10. This calculator divides the range evenly among the chosen number of classes and places the maximum value in the last class to ensure all data is captured.
What is the midpoint of a class interval and why does it matter?+
The midpoint (class mark) is the average of the lower and upper class boundaries: midpoint = (lower + upper) / 2. For the class 60-70, the midpoint is 65. Midpoints are used to estimate the mean and variance from grouped data because you treat all observations in a class as if they equal the midpoint. This estimate is less precise than using raw data but is necessary when only the grouped table is available.
Can I find the mean from a frequency distribution table?+
Yes. For ungrouped data: mean = sum of (value x frequency) / N. For grouped data: mean = sum of (midpoint x frequency) / N. For example, if values 2, 3, 4 have frequencies 3, 5, 2, then mean = (2x3 + 3x5 + 4x2) / 10 = (6 + 15 + 8) / 10 = 2.9. This estimate uses midpoints for grouped data and is a weighted average.
What is the relationship between frequency distribution and probability?+
Relative frequency is an empirical estimate of probability. If value 5 appears 8 times in 40 observations, the relative frequency is 0.20, which estimates the probability of observing a 5. By the law of large numbers, as N increases, relative frequencies converge to true probabilities. A relative frequency distribution approximates the theoretical probability distribution of the population.
How do you find the mode from a frequency distribution?+
The mode is the value or class with the highest frequency. In an ungrouped table, look for the largest number in the frequency column. In a grouped table, the modal class is the class with the highest frequency. To estimate the exact mode within a modal class you can use the formula: mode = L + [(f1 - f0) / (2f1 - f0 - f2)] x w, where L is the lower boundary, f1 is the modal class frequency, f0 and f2 are adjacent class frequencies, and w is class width.
How do you find the median from a frequency distribution table?+
For ungrouped data, the median is the value where the cumulative frequency first reaches or exceeds N/2. For N = 20, find the class where cumulative frequency reaches 10. For grouped data, use linear interpolation: median = L + [(N/2 - CF) / f] x w, where L is the lower boundary of the median class, CF is the cumulative frequency before the median class, f is the median class frequency, and w is class width.
What is the difference between a frequency table and a frequency distribution?+
The terms are often used interchangeably. Strictly speaking, a frequency table is the raw count display, while a frequency distribution includes relative and cumulative columns that fully describe how the data is distributed. This calculator generates a complete frequency distribution: counts, proportions, running totals, and running proportions for each class or value.