## Table of Contents

- What is Bivariate Data?
- Bivariate Relationship
- What is a Bivariate Analysis?
- Types of Analysis for Bivariate Data
- Lesson Summary

Learn what bivariate data is. See a comparison of univariate vs. bivariate. Understand what a bivariate analysis is.
Updated: 12/13/2021

- What is Bivariate Data?
- Bivariate Relationship
- What is a Bivariate Analysis?
- Types of Analysis for Bivariate Data
- Lesson Summary

In a scientific study, researchers collect information about their variables of interest in the form of data. The data that is collected can be *univariate* or *bivariate*, depending on the nature of the **study**.

- Univariate data:
*Uni*means one and*variate*is another word for a variable. This refers to an instance in which a*single variable*is examined or described. For example, a researcher may measure the amount of time it takes people to complete a crossword puzzle as the sole variable of interest. - Bivariate data:
*Bi*means two. Therefore,**bivariate data**involves studying and comparing*two separate variables*. For example, a researcher may record how long it takes people to complete a crossword puzzle while measuring the stress levels of the participants. In this example, the two variables that the researcher is examining are time and stress.

As previously mentioned, univariate data involves collecting information about a single variable. Here are more examples of univariate data:

- Recording gender as students enter a room.
- Recording age as voters enter a polling booth.
- Asking participants to report their favorite type of music.

Univariate data is often used to understand the characteristics of a population, such as central tendency and variability. It can also be used to describe the basic findings of an experiment, such as reporting the **mean** and **standard deviation** for individual variables.

Univariate **data analysis** is not used to compare the relationship between different variables. To do so, researchers use *bivariate data* analyses.

**Bivariate data** is used to look for relationships between variables. One of the main purposes of determining whether or not two variables are related is to see if one variable causes the other. A causal link between two variables is typically found by determining if changes in one variable are caused by changes in another. This type of research involves two basic types of variables:

- Independent variable: The variable that a researcher manipulates in an experiment. The researcher predicts that changing the independent variable causes a meaningful change in another variable.
- Dependent variable: The variable that a researcher predicts will change by manipulating the independent variable. The important thing to note is that the researcher does not directly change this variable; it is changed or controlled by an outside factor (often the independent variable).

For example, a researcher is interested in whether or not students' attitudes toward a statistics class change based on the time of the class. To answer this question, the research measures students' attitudes from a morning class and an evening class. The researcher finds that students in the evening class have a more positive attitude toward statistics than students in the morning class.

What are the independent and dependent variables in the previous example? The *independent variable* is the time of the class because it is the variable that was manipulated by the researcher. Students' attitudes toward statistics are the *dependent variable*, or the variable that the researcher predicted might change in relation to the independent variable.

Now, take a moment to look at a few bivariate data examples.

Figure 1 shows the results of a study that measured anxiety levels and feelings of loneliness.

Figure 2 shows the results of a study that compared the amount of time students spent studying and their exam grades.

Figure 3 is a graphical representation of bivariate data. It shows the relationship between age and vocabulary scores in a sample of preschoolers. We will return to the idea of graphing bivariate data later.

Data analysis is the process of organizing, describing, and evaluating data in order to look for information that might help researchers make conclusions about various phenomena. It is used to help determine whether a particular research finding represents a real-world quality or some type of statistical fluke or error.

There are different **bivariate data analyses**.

A scatter plot is a graph used to show the values associated with two variables. As an example, take a moment to look at the graph from a previous example:

In this particular example, we have already identified age as the independent variable and vocabulary as the dependent variable. Notice that the independent is on the horizontal axis (i.e., the x-axis) and the dependent variable is on the vertical axis (i.e., the y-axis). This is a rule-of-thumb you should keep in mind when making and interpreting these types of graphs.

Take a moment to examine the following example for how to create a scatter plot.

Imagine you are interested in whether or not running influences how much people sleep. To answer this question, you have people run as many laps as they can and then record how many hours they sleep the following night. You obtained the following data:

Let's graph this data on a scatter plot one data point at a time. Each participant will be represented on the graph in accord to where X (i.e., laps ran) and Y (i.e., hours slept) intercept. First, plot the point where X=2 and Y=4 meet on the graph.

Then, plot the point where X=3 and Y=5 meet on the graph.

Do the same for where X=4 and Y=6 meet on the graph.

Finally, plot the point where X=5 and Y=7 meet on the graph.

A correlation analysis is a statistical test used to determine whether or not two variables are related. It does this by comparing scores on two different measures to tell if changes in one influence changes in the other. Variables can be correlated in one of two ways:

- A positive correlation is a relationship in which two variables
*move in the same direction*. For example, scores from the ACT are positively correlated with college GPA. In other words, students who have*higher*ACT scores also tend to have a*higher*college GPA. - A negative correlation is a relationship in which two variables
*move in the opposite direction*. In other words, a negative correlation occurs when*high*scores on one scale are accompanied by*low*scores on another scale (or vice versa). For example, a researcher finds that students who miss class*more often*score*lower*on final exams. As absences go up, final exam scores go down.

A very common saying in statistics and research methodology courses is *correlation does not equal causation*. Correlation analysis can only indicate whether or not two variables are related. To understand why, consider the two previous examples:

- Scoring high on the ACT does not cause a higher GPA. There has to be some other explanatory variable(s) that links ACT scores and GPA, such as intelligence or conscientiousness.
- Missing class does not cause one to score lower on a final exam. As in the previous example, there are other factors linking these variables. Students may not be as likely to ask a teacher to clarify a confusing concept if they do not attend class. Or, a student might miss class due to poor health, which may also hinder their test performance.

A scatter plot is often used to graph correlational data. For example:

Figure 10 is an example of a positive correlation. You can tell by the general direction of the data. Scores on the y-axis increase as scores on the x-axis increase. This creates a slope that is moving upwards.

Figure 11 is an example of a negative correlation. Notice the general direction of the data in this graph. Scores on the y-axis decrease as scores on the x-axis increase. This creates a slope that is moving downward.

Figure 12 is an example of what a scatterplot looks like when two variables are not correlated. Notice that there is no discernable increasing or decreasing pattern in the data. The data looks randomly scattered on the graph.

Regression analysis is a statistical test used to determine if one variable can predict the outcome of another variable. It does this by determining whether or not scores from one type of measure reliably correspond to scores from another measure. For example, scores from various standardized tests (e.g., ACT) can be used to predict other measures of academic achievement (e.g., GPA). In fact, this is one reason why various academic programs often require students to take one of these tests prior to admission. Scoring high enough on these tests is considered an indicator that a student can succeed in an academic setting.

Researchers collect data as a way to learn more about various real-world phenomena. **Data analysis** often involves univariate and bivariate analyses depending on the goal of the researcher. Univariate data involves analyzing a single variable and is often used to describe samples and populations. For example, researchers often report the **mean** and **standard deviation** for individual variables as a general description of the participants in a **study**. **Bivariate data** involves analyzing two separate variables and is used to determine whether or not two variables are related.

There are different methods of **bivariate data analysis**. A scatter plot is a graphical representation of bivariate data that illustrates the relationship between two variables on an x and y-axis. A correlation analysis is used to determine whether or not two variables are associated. Scatter plots and correlational analyses are often used in conjunction to interpret whether a correlation is positive or negative. Regression analysis is when one variable is used to predict the outcome of another variable.

To unlock this lesson you must be a Study.com Member.

Create your account

Frequently Asked Questions

"Bi" means two and "variate" is another word for a variable. So, bivariate refers to a statistical analysis that involves the comparison of two separate variables.

Bivariate data analysis is a statistical test that involves two separate variables. It is used to determine whether or not two variables are related.

Bivariate data can be used to determine whether or not two variables are related. If a relationship between variables is established, bivariate data is used to determine if there is a causal link between those variables.

An example of bivariate data is data collected from a study that compares levels of anxiety to the number of times participants pace back and forth in a room.

Are you a student or a teacher?

Already a member? Log In

BackWhat teachers are saying about Study.com

Already registered? Log in here for access

Related Study Materials

Browse by subject