Inferential Statistics – A Guide With Definition & Examples
  • Oct 2022
  • 0

Inferential Statistics – A Guide With Definition & Examples

29th October 2022

You seek to make conclusions beyond the immediate facts by using inferential statistics. Inferential statistics, for example, are used to infer what the public would believe based on sample data.

Alternatively, we use inferential statistics to establish whether the difference between groups seen in this study is reliable or random chance.

 

Inferential Statistics – In a Nutshell

  • Inferential statistics aid in the conclusion of an entire population based on a sample of that population.

  • Inferential statistics examine a sample to conclude the population, whereas descriptive statistics describe the characteristics of a known dataset.

  • There are methods in inferential statistics for verifying and validating our conclusions from an experiment that incorporates hypothesis testing.

  • There are three types of hypothesis tests:

  • Regression analysis (Simple linear, Multiple linear)

  • Tests of comparability (T-test, ANOVA)

  • Correlation tests (Chi-square, Pearson's) are used to examine various variables and parameters.

 

Definition: Inferential statistics

Inferential statistics is a branch of statistics that collects and analyses data using a probabilistic approach.

It enables us to draw inferences and make references about a population based on a sample and apply them to a wider population.

There are numerous forms of inferential statistics, each of which is appropriate for a specific research strategy and sample characteristics.

It is used to compare two models to determine whether one is statistically more significant than the other.

The following statements are clear examples of inferential statistics:

  • Based on a survey, the mean weekly hours spent on gaming consoles by teenagers in the United Kingdom is 9.00 hours.

  • In 2025, the city b’s population will be 2.5 million.

 

Descriptive statistics vs. inferential statistics

Descriptive statistics use bar graphs, histograms, or pie charts to organize, summarize, and display the features of a data collection.

They use measures of central tendency such as mean, median, and mode as tools, as well as measures of dispersion and variability such as range, variance, and standard deviation.

Inferential statistics enable us to evaluate hypotheses and determine whether data is generalizable to a larger population.

Sample data is also used to create inferences and draw conclusions about persons, with the results expressed as probabilities.

 

Inferential statistics: Hypothesis testing

Hypothesis testing is a statistical analysis tool that uses inferential statistics. The goal is to use samples to compare populations across factors.

It consists of the following steps:

 

1. Determine the null and alternative Hypotheses

The null hypothesis (Ho) asserts that the population's value is considered to be true. The null hypothesis is contradicted by the alternative hypothesis (H1).

It is an educated guess of all eventualities that are not covered by the null hypothesis.

 

2. Selecting significance level

The criterion by which we judge whether the claim under consideration is true or false.

 

3. Determine the rejection region

These are the test statistic values for which the null hypothesis is rejected.

The samples are compared, and two decisions are made based on the level of significance. These are some examples:

  • Rejecting the null hypothesis: When the null hypothesis is true, the sample average has a low chance of occurrence if the probability of receiving a sample is less than 5%.

  • Failure to reject the null hypothesis: When the null hypothesis is true, the sample average has a high likelihood of occurring if the probability of attaining a sample mean is larger than 5%.

Inferential statistical tests are used to test hypotheses, which can be parametric (ANOVA, T-test), which is based on assumptions about the population distribution from which the sample is drawn, or non-parametric (Spearman's correlation), which is not dependent on an assumption.

 

4. Comparison test

This inferential statistics test determines whether there are discrepancies in the means, medians, or rankings of two or more groups' scores.

Comparison test

Parametric

What’s being compared?

Samples

t-test

?

Means

2 samples

ANOVA

?

Means

3+ samples

Mood’s median

?

Medians

2+ samples

Wilcoxon signed-rank

?

Distributions

2 samples

Wilcoxon rank-sum (Mann-Whitney U)

?

Sums of rankings

2 samples

Kruskal-Wallis H

?

Mean rankings

3+ samples

5. Correlation test

These inferential statistics tests examine how closely two variables are related.

Correlation test

Parametric?

Variables

Pearson’s r

?

Interval/ratio variables

Spearman’s r

?

Ordinal/interval/ratio variables which

Chi square tests of independence

?

Nominal/ordinal variables

6. Regression analysis

These inferential statistics tests show whether changes in predictor factors lead to changes in outcome variables.

Regression test

Predictor

Outcome

Simple linear regression

1 interval/ratio variable

1 interval/ratio variable

Multiple linear regression

2+ interval/ratio variable(s)

1 interval/ratio variable

Logistic regression

1+ any variable(s)

1 binary variable

Nominal regression

1+ any variable(s)

1 nominal variable

Ordinal regression

1+ any variable(s)

1 ordinal variable

Inferential statistics exampleT-test formula

The t-test value can be calculated with the following formula: 

Example:

After new sales training is given to employees, the mean sale goes up to £50 (a sample of 25 employees examined) with a standard deviation of £12. Before the training, the average sale was £100. Check if the training helped at α = 0.05.

Solution: The t-test in inferential statistics solves this problem with the formula:

x = 150, μ = 100, s= 12, n = 25

H0: μ=100

H1: μ=100

= 20.83

The degree of freedom is given by 25 – 1 = 24. Using the t table at α = 0.05, the critical value is T(0.05, 24) = 1.71. As 20.83 > 1.71 thus, H0 is rejected. The conclusion is that the training helped in increasing the average sales.

0 Comments


LOAD MORE COMMENTS

Leave Your Comment Here