Descriptive Statistics | Definitions, Types, Examples (2024)

Published on 4 November 2022 by Pritha Bhandari. Revised on 9 January 2023.

Descriptive statistics summarise and organise characteristics of a data set. A data set is a collection of responses or observations from a sample or entire population.

In quantitative research, after collecting data, the first step of statistical analysis is to describe characteristics of the responses, such as the average of one variable (e.g., age), or the relation between two variables (e.g., age and creativity).

The next step is inferential statistics, which help you decide whether your data confirms or refutes your hypothesis and whether it is generalisable to a larger population.

Table of contents

  1. Types of descriptive statistics
  2. Frequency distribution
  3. Measures of central tendency
  4. Measures of variability
  5. Univariate descriptive statistics
  6. Bivariate descriptive statistics
  7. Frequently asked questions

Types of descriptive statistics

There are 3 main types of descriptive statistics:

  • The distribution concerns the frequency of each value.
  • The central tendency concerns the averages of the values.
  • The variability or dispersion concerns how spread out the values are.

You can apply these to assess only one variable at a time, in univariate analysis, or to compare two or more, in bivariate and multivariate analysis.

You want to study the popularity of different leisure activities by gender. You distribute a survey and ask participants how many times they did each of the following in the past year:
  • Go to a library
  • Watch a movie at a theater
  • Visit a national park

Your data set is the collection of responses to the survey. Now you can use descriptive statistics to find out the overall frequency of each activity (distribution), the averages for each activity (central tendency), and the spread of responses for each activity (variability).

Frequency distribution

A data set is made up of a distribution of values, or scores. In tables or graphs, you can summarise the frequency of every possible value of a variable in numbers or percentages.

  • Simple frequency distribution table
  • Grouped frequency distribution table
For the variable of gender, you list all possible answers on the left hand column. You count the number or percentage of responses for each answer and display it on the right hand column.
GenderNumber
Male182
Female235
Other27

From this table, you can see that more women than men or people with another gender identity took part in the study.

In a grouped frequency distribution, you can group numerical response values and add up the number of responses for each group. You can also convert each of these numbers to percentages.
Library visits in the past yearPercent
0–46%
5–820%
9–1242%
13–1624%
17+8%

From this table, you can see that most people visited the library between 5 and 16 times in the past year.

Measures of central tendency

Measures of central tendency estimate the center, or average, of a data set. The mean, median and mode are 3 ways of finding the average.

Here we will demonstrate how to calculate the mean, median, and mode using the first 6 responses of our survey.

  • Mean
  • Median
  • Mode

The mean, or M, is the most commonly used method for finding the average.

To find the mean, simply add up all response values and divide the sum by the total number of responses. The total number of responses or observations is called N.

Mean number of library visits
Data set15, 3, 12, 0, 24, 3
Sum of all values15 + 3 + 12 + 0 + 24 + 3 = 57
Total number of responsesN = 6
MeanDivide the sum of values by N to find M:57/6 = 9.5

The median is the value that’s exactly in the middle of a data set.

To find the median, order each response value from the smallest to the biggest. Then, the median is the number in the middle. If there are two numbers in the middle, find their mean.

Median number of library visits
Ordered data set0, 3, 3, 12, 15, 24
Middle numbers3, 12
MedianFind the mean of the two middle numbers: (3 + 12)/2 = 7.5

The mode is the simply the most popular or most frequent response value. A data set can have no mode, one mode, or more than one mode.

To find the mode, order your data set from lowest to highest and find the response that occurs most frequently.

Mode number of library visits
Ordered data set0, 3, 3, 12, 15, 24
ModeFind the most frequently occurring response: 3

Measures of variability

Measures of variability give you a sense of how spread out the response values are. The range, standard deviation and variance each reflect different aspects of spread.

Range

The range gives you an idea of how far apart the most extreme response scores are. To find the range, simply subtract the lowest value from the highest value.

Ordered data set: 0, 3, 3, 12, 15, 24

Range: 24 – 0 = 24

Standard deviation

The standard deviation (s) is the average amount of variability in your dataset. It tells you, on average, how far each score lies from the mean. The larger the standard deviation, the more variable the data set is.

There are six steps for finding the standard deviation:

  1. List each score and find their mean.
  2. Subtract the mean from each score to get the deviation from the mean.
  3. Square each of these deviations.
  4. Add up all of the squared deviations.
  5. Divide the sum of the squared deviations by N – 1.
  6. Find the square root of the number you found.
In the table below, you complete Steps 1 through 4.
Raw dataDeviation from meanSquared deviation
1515 – 9.5 = 5.530.25
33 – 9.5 = -6.542.25
1212 – 9.5 = 2.56.25
00 – 9.5 = -9.590.25
2424 – 9.5 = 14.5210.25
33 – 9.5 = -6.542.25
M = 9.5Sum = 0Sum of squares = 421.5

Step 5: 421.5/5 = 84.3

Step 6: √84.3 = 9.18

From learning thats = 9.18, you can say that on average, each score deviates from the mean by 9.18 points.

Variance

The variance is the average of squared deviations from the mean. Variance reflects the degree of spread in the data set. The more spread the data, the larger the variance is in relation to the mean.

To find the variance, simply square the standard deviation. The symbol for variance is s2.

Data set: 15, 3, 12, 0, 24, 3

s = 9.18

s2 = 84.3

Univariate descriptive statistics

Univariate descriptive statistics focus on only one variable at a time. It’s important to examine data from each variable separately using multiple measures of distribution, central tendency and spread. Programs like SPSS and Excel can be used to easily calculate these.

Visits to the library
N6
Mean9.5
Median7.5
Mode3
Standard deviation9.18
Variance84.3
Range24

If you were to only consider the mean as a measure of central tendency, your impression of the ‘middle’ of the data set can be skewed by outliers, unlike the median or mode.

Likewise, while the range is sensitive to extreme values, you should also consider the standard deviation and variance to get easily comparable measures of spread.

Bivariate descriptive statistics

If you’ve collected data on more than one variable, you can use bivariate or multivariate descriptive statistics to explore whether there are relationships between them.

In bivariate analysis, you simultaneously study the frequency and variability of two variables to see if they vary together. You can also compare the central tendency of the two variables before performing further statistical tests.

Multivariate analysis is the same as bivariate analysis but with more than two variables.

Contingency table

In a contingency table, each cell represents the intersection of two variables. Usually, an independent variable (e.g., gender) appears along the vertical axis and a dependent one appears along the horizontal axis (e.g., activities). You read ‘across’ the table to see how the independent and dependent variables relate to each other.

Number of visits to the library in the past year
Group0–45–89–1213–1617+
Children3268372322
Adults3648438325

Interpreting a contingency table is easier when the raw data is converted to percentages. Percentages make each row comparable to the other by making it seem as if each group had only 100 observations or participants. When creating a percentage-based contingency table, you add the N for each independent variable on the end.

Visits to the library in the past year (Percentages)
Group0–45–89–1213–1617+N
Children18%37%20%13%12%182
Adults15%20%18%35%11%235

From this table, it is more clear that similar proportions of children and adults go to the library over 17 times a year. Additionally, children most commonly went to the library between 5 and 8 times, while for adults, this number was between 13 and 16.

Scatter plots

A scatter plot is a chart that shows you the relationship between two or three variables. It’s a visual representation of the strength of a relationship.

In a scatter plot, you plot one variable along the x-axis and another one along the y-axis. Each data point is represented by a point in the chart.

You investigate whether people who visit the library more tend to watch a movie at a theater less. You plot the number of times participants watched movies at a theater along the x-axis and visits to the library along the y-axis.

From your scatter plot, you see that as the number of movies seen at movie theaters increases, the number of visits to the library decreases. Based on your visual assessment of a possible linear relationship, you perform further tests of correlation and regression.

Descriptive Statistics | Definitions, Types, Examples (2)

Frequently asked questions

What’s the difference between descriptive and inferential statistics?

Descriptive statistics summarise the characteristics of a data set. Inferential statistics allow you to test a hypothesis or assess whether your data is generalisable to the broader population.

What are the 3 main types of descriptive statistics?

The 3 main types of descriptive statistics concern the frequency distribution, central tendency, and variability of a dataset.

  • Distribution refers to the frequencies of different responses.
  • Measures of central tendency give you the average for each response.
  • Measures of variability show you the spread or dispersion of your dataset.
What’s the difference between univariate, bivariate and multivariate descriptive statistics?
  • Univariate statistics summarise only one variableat a time.
  • Bivariate statistics compare two variables.
  • Multivariate statistics compare more than two variables.

Cite this Scribbr article

If you want to cite this source, you can copy and paste the citation or click the ‘Cite this Scribbr article’ button to automatically add the citation to our free Reference Generator.

Bhandari, P. (2023, January 09). Descriptive Statistics | Definitions, Types, Examples. Scribbr. Retrieved 12 August 2024, from https://www.scribbr.co.uk/stats/descriptive-statistics-explained/

Cite this article

Is this article helpful?

You have already voted. Thanks :-) Your vote is saved :-) Processing your vote...

Descriptive Statistics | Definitions, Types, Examples (3)

Pritha Bhandari

Pritha has an academic background in English, psychology and cognitive neuroscience. As an interdisciplinary researcher, she enjoys writing articles explaining tricky research concepts for students and academics.

Descriptive Statistics | Definitions, Types, Examples (2024)

References

Top Articles
Big Easy Veterinarian
Coolest Homemade Alien Abduction Costume Ideas
Winston Salem Nc Craigslist
The Atlanta Constitution from Atlanta, Georgia
Soap2Day Autoplay
The Realcaca Girl Leaked
Comcast Xfinity Outage in Kipton, Ohio
Obituary (Binghamton Press & Sun-Bulletin): Tully Area Historical Society
Poplar | Genus, Description, Major Species, & Facts
Amelia Bissoon Wedding
RBT Exam: What to Expect
Best Suv In 2010
Eva Mastromatteo Erie Pa
Diamond Piers Menards
Teacup Yorkie For Sale Up To $400 In South Carolina
Is A Daytona Faster Than A Scat Pack
Kaitlyn Katsaros Forum
Chase Bank Pensacola Fl
Project Reeducation Gamcore
Target Minute Clinic Hours
Haunted Mansion Showtimes Near Epic Theatres Of West Volusia
Chime Ssi Payment 2023
Naya Padkar Gujarati News Paper
پنل کاربری سایت همسریابی هلو
Robotization Deviantart
Bj's Tires Near Me
Why comparing against exchange rates from Google is wrong
Primerica Shareholder Account
100 Million Naira In Dollars
Kaiju Paradise Crafting Recipes
USB C 3HDMI Dock UCN3278 (12 in 1)
Bismarck Mandan Mugshots
Jason Brewer Leaving Fox 25
Craigslist Malone New York
Paul Shelesh
Does Target Have Slime Lickers
Login
Gli italiani buttano sempre più cibo, quasi 7 etti a settimana (a testa)
Frequently Asked Questions
The Average Amount of Calories in a Poke Bowl | Grubby's Poke
877-552-2666
25 Hotels TRULY CLOSEST to Woollett Aquatics Center, Irvine, CA
Walmart Listings Near Me
Is Chanel West Coast Pregnant Due Date
Laura Houston Wbap
Mytmoclaim Tracking
Acuity Eye Group - La Quinta Photos
Sleep Outfitters Springhurst
Hampton Inn Corbin Ky Bed Bugs
Joe Bartosik Ms
Coors Field Seats In The Shade
Latest Posts
Article information

Author: Kimberely Baumbach CPA

Last Updated:

Views: 6427

Rating: 4 / 5 (41 voted)

Reviews: 88% of readers found this page helpful

Author information

Name: Kimberely Baumbach CPA

Birthday: 1996-01-14

Address: 8381 Boyce Course, Imeldachester, ND 74681

Phone: +3571286597580

Job: Product Banking Analyst

Hobby: Cosplaying, Inline skating, Amateur radio, Baton twirling, Mountaineering, Flying, Archery

Introduction: My name is Kimberely Baumbach CPA, I am a gorgeous, bright, charming, encouraging, zealous, lively, good person who loves writing and wants to share my knowledge and understanding with you.