Types of Data
Data in statistics can be broadly classified into two types: categorical and quantitative. Categorical data represents characteristics or attributes, often expressed as percentages or proportions (e.g., eye color or grade level). Quantitative data, on the other hand, consists of numerical values that can be averaged or measured (e.g., height or test scores).
Bivariate Categorical Data
When studying relationships, we often collect data on two variables simultaneously. For instance, we might look at “grade level” (e.g., junior or senior) and “homework completion” (on time or not). This type of data, involving two categorical variables, is called bivariate categorical data.
To explore relationships between these variables, we can use several visualization tools:
- Histogram: A bar chart displaying the frequency of occurrences for each category combination. For example, it can show how many juniors and seniors complete homework on time versus those who don’t.

- Frequency Chart: Similar to a histogram but shows percentages instead of counts, making it easier to compare proportions across categories, like the percentage of students in each grade who finish homework on time.

- Mosaic Plot: A visual tool that uses rectangles to represent the proportions of data in each category combination. The size of each rectangle reflects the frequency or proportion, helping to highlight relationships between the two variables.

Bivariate Quantitative Data
When both variables are numerical, such as plant height and amount of fertilizer used, we’re dealing with bivariate quantitative data. The goal is often to determine if there’s a correlation, like whether more fertilizer leads to taller plants.
A primary tool for visualizing this data is the
scatterplot. In a scatterplot, one variable is plotted on the x-axis and the other on the y-axis, allowing us to observe patterns or trends. For example, plotting fertilizer amounts against plant heights can reveal whether increased fertilizer correlates with taller plants (positive relationship) or shorter plants (negative relationship).

Beyond visualization, statistical methods like correlation analysis can quantify the strength and direction of the relationship, helping determine if it’s significant and whether one variable can predict the other.
Why Study Variable Relationships?
Analyzing relationships between variables helps us understand if and how they influence each other. This can enable predictions—for example, knowing one variable’s value might help predict the other. Equally important, discovering that two variables are
not related can be a significant finding, allowing us to rule out certain factors in a study.
Relationships between variables can be:
- Positive: As one variable increases, the other tends to increase (e.g., more study time, higher grades).
- Negative: As one variable increases, the other tends to decrease (e.g., more screen time, less sleep).
- No Relationship: No consistent pattern or trend exists between the variables.
Finding no relationship is just as valuable as finding one, as it helps narrow down factors affecting an outcome.
Question for Chapter Notes: Introducing Statistics: Are Variables Related?
Try yourself:
What type of data involves two categorical variables?Explanation
The type of data that involves two categorical variables is called bivariate categorical data.This means we are looking at two characteristics at the same time, like grade level and homework completion status.
- Bivariate categorical data: Involves two categorical variables.
- Quantitative data: Consists of numerical values.
- Categorical data: Represents characteristics or attributes.
- Bivariate quantitative data: Involves two numerical variables.
Report a problem
Key Terms to Review
- Bivariate Quantitative Data: Data involving two numerical variables, analyzed using scatterplots, correlation coefficients, or regression to uncover trends and relationships.
- Bivariate Categorical Data: Data involving two categorical variables, often displayed in contingency tables to show frequency or relationships between categories.
- Categorical Data: Variables grouped into distinct categories (e.g., gender, car type), used to analyze proportions and frequencies across groups.
- Frequency Chart: A visual tool showing the count or percentage of occurrences in each category, aiding in pattern identification.
- Histogram: A bar chart showing the frequency of data within intervals, useful for understanding the distribution of quantitative or categorical data.
- Mosaic Plot: A plot using rectangles to show proportions of categorical data, emphasizing relationships between two variables.
- Quantitative Data: Numerical data that can be measured or counted, used for statistical analysis and presented in charts or graphs.