Open App

GMAT Exam > GMAT Notes > Quantitative for GMAT > Statistics and Probability (Part - 1)

Statistics and Probability (Part - 1) | Quantitative for GMAT PDF Download

Q: 3. What are the basic concepts in probability theory?

Ans. The basic concepts in probability theory include:- Sample space: It is the set of all possible outcomes of a random experiment.- Event: It is a subset of the sample space, representing a specific outcome or a collection of outcomes.- Probability: It is a measure of the likelihood of an event occurring. It is a number between 0 and 1, where 0 represents impossibility and 1 represents certainty.- Random variable: It is a variable that takes on different values based on the outcomes of a random experiment. It can be discrete or continuous.- Probability distribution: It describes the probabilities associated with different values of a random variable. It can be represented through a probability mass function (for discrete variables) or a probability density function (for continuous variables).

Table of contents
Statistics
Statistical Data
Inclusive and Exclusive distributions
Frequency Distribution Table
Cumulative Frequency Table
Graphical Representation of Data
Cumulative frequency curve or ogive
Measures of Central Tendency

Statistics

Fundamental Characteristics of Statistics

Statistics have the following important characteristics:

Statistics are aggregate of facts and not a single observation.
Statistics are expressed quantitatively.
In an experiment statistics are related to each other and comparable. It can be classified into various groups.
Statistics are collected for a pre-determined purpose.
In collection of statistics a reasonable standard of accuracy must be maintained.

Limitations of Statistics

Statistics have the following limitations:

Statistics is not fit for study of qualitative phenomenon like honesty, intelligence, poverty etc.
Statistics deals with groups and does not study individuals.
Laws of statistics are not exact. These are true on averages.
Data collected for a definite purpose may not be suitable for another purpose.

Statistical Data

Statistical data are the facts which are collected for the purpose of investigation. There are two types of statistical data:

Primary data: The data collected by an investigator for the first time for his own purpose are called primary data. As the primary data are collected by the user of the data, so it is more reliable and relevant.
Secondary data: The data collected by a secondary source and used by the investigator for his purpose is called secondary data. For example score of a cricket match noted from newspapers is secondary data.

Thus data which are primary in the hands of one become secondary in the hands of the other.

Data collected by any source also can be divided in following two types:

Raw Data: Raw data are those data which are obtained from the original source but not arranged numerically. This is also called ‘ungrouped data’ for example marks of 10 students in maths are given as:
75, 96, 25, 32, 89, 62, 40, 79, 35, 55
An ‘array’ is an arrangement of raw numerical data in the ascending or descending order of magnitude. Above data can be written as
25, 32, 35, 40, 55, 62, 75, 79, 89, 96
Grouped data: An array can be placed systematically in groups or categories. For example the above data can be grouped in following manner.

Statistics and Probability (Part - 1) | Quantitative for GMAT

Some Basic Definitions

Variate: Variate is a quantity that may vary from observation to observation.
Range: Range is difference between the maximum and minimum observations.
Class Interval: When data are divided in groups, each group is called a class interval.
Class Limit: Every class interval has two limits. The smallest observation of the interval is called lower limit and the largest observation of the interval is called upper limit.
Class Mark: The mid value of any class is called its class mark.
Class Mark =
Class Size: Class size is defined as the difference between two successive class marks. It is also the difference between the upper and lower limits of any class interval.
Frequency: In a particular class the count of the number of observation is called its frequency. So the corresponding frequency of a class is called its class frequency.
Cumulative Frequency: The cumulative frequency of any class is obtained by adding all the frequencies successively prior to that class i.e. it is the sum of all frequencies up to that class.

Inclusive and Exclusive distributions

Inclusive Distribution: When in a distribution, the upper limit does not coincide with the lower limit of the next class then the distribution is called an inclusive distribution. e.g.
Exclusive Distribution: An exclusive distribution is that distribution in which the upper limit of one class coincides with the lower limit of the next class. e.g.
True Class Limit: In the case of exclusive classes the upper and lower limits are respectively known as its true upper limits and true lower limits.
In the case of inclusive classes, the true lower and upper limits are obtained by subtracting 0.5 from the lower limit and adding 0.5 to the upper limit.
True upper limits and true lower limits are also known as boundaries of the class.
Tally: Tally method is used to keep the chance of error at minimum in counting. A bar (|) called tally mark is put against any item when it occurs. The fifth occurrence of any item is represented by putting diagonally a cross tally (|) on the first four tallies.

Frequency Distribution Table

The tabular arrangement of data showing the frequency of each item is called a frequency distribution table. It is a method to present raw data in the form from which one can easily understand the information contained in the raw data.

Frequency distribution are of two types:

Discrete frequency distribution: In this type of frequency distribution, in the first column of frequency table we write all possible values of the variables from the lowest to the highest, in the second column we write tally marks and in the third column we show frequency of each item. In this method data are not divided into groups or classes.
Continuous or Grouped Frequency Distribution: In the frequency distribution data are divided into groups or classes. This method is used only where the values in the raw data are largely repeating and the difference between the greatest and the smallest observations is not very large.

Preparation of a frequency distribution table:

The following steps are taken to prepare a frequency distribution table:

First of all we arrange the data in an array.
Then draw a table consisting of 3 columns. First column is used for class, the second column for tally and the third column for frequency.
Then in the first column we write the classes keeping the lowest and the highest scores in view.
In second column we put tally marks against each class according to the scores.
Then we write frequency of each class in the third column after counting the tally.
Figures in first column and third column taken together represent the frequency table.

Cumulative Frequency Table

Cumulative frequency table is obtained from the ordinary frequency table by successively adding the several frequencies. Thus to form a cumulative frequency table we add a column of cumulative frequency in the frequency distribution table. It is obvious that the cumulative frequency of the last class is the sum of the frequencies of all the classes.

Cumulative frequency series are of two types:

Less than series
More than series

Graphical Representation of Data

A given data can be represented in graphical way. There are various methods of graphical representation of frequency distribution. Here we shall study only four of them:

Bar Graphs
Histogram
Frequency Polygon
Cumulative frequency curve or ogive

Bar Graphs

The frequency distribution of a discrete value is best represented by a bar graph. The height of the bars is proportional to the frequency of each variate-value. In a bar graph the bars must be kept distinct to show that the variate-values are distinct. The bars are of equal width and are drawn with equal spacing between them on the x-axis depicting the variable. The values of the variable are shown on the y-axis.

Histogram

Histogram is a graphical representation of a grouped frequency distribution with continuous classes. It consists of a set of rectangles where heights of rectangles are proportional to their class frequencies, for equal class intervals. There is no gap between two successive rectangles. The rectangles are constructed with base as the class size and their heights representing the frequencies.

Frequency Polygon

A frequency polygon is a graph of frequency distribution. It is a line graph of class frequency which is plotted against class mark.
A frequency polygon can be obtained by two methods:
(1) By using Histogram: A frequency polygon can be obtained by joining mid points of the top of the rectangles of a histogram. For this we obtain the mid points of the upper horizontal sides of each rectangle and then join these mid points by dotted lines to get frequency polygon. End of a frequency polygon preferably extended to the mid points of imagined class intervals adjacent to first and last class intervals.

Statistics and Probability (Part - 1) | Quantitative for GMAT

(2) Frequency polygon without using Histogram: Following procedure is used to make a frequency polygon without using histogram.

Calculate the class marks, x₁, x₂, ...., x_n of each of the given class intervals.
Mark class marks x₁, x₂, .... x_n, along X-axis and frequencies f₁, f₂, .... f_n along Y-axis.
Plot the points (x₁, f₁), (x₂, f₂), ,....., (x_n, f_n).
Obtain the mid-points of two class intervals of zero frequencies at the beginning of the first interval and at the end of the last interval.
Join the points (x₁, f₁), (x₂, f₂), ..., (x_n, f_n) by the line segments and complete the frequency polygon by joining the mid points of the first and last intervals to the mid points of the imagined classes adjacent to them.

Cumulative frequency curve or ogive

The graphical representation of a cumulative frequency distribution is known as cumulative frequency curve or an ogive.
An ogive can be constructed by following two methods:
Less than method: A less than ogive can be constructed by following steps:

First of all we make class intervals in exclusive form if it is given in inclusive form.
Then we construct a less than type cumulative frequency distribution by adding the frequency of each class to the sum of frequencies of its prior classes.
Now we mark upper class limits along X-axis and cumulative frequencies along Y-axis.(iv) We plot the points (upper class limit, corresponding cumulative frequency) and join them by a free hand curve.
The lower limit of the first class interval becomes the upper limit of the imagined class with frequency 0. We join the imagined point (lower limit of first class, 0) with the first point of the curve and so on.

In this way we get the required curve called an Ogive by less than type method.

More than Type:
We apply the following steps to construct a more than type ogive:

Step (1): First of all we make class intervals in exclusive form if it is given in inclusive form.
Step (2): Then we construct a more than type cumulative frequency distribution.
Step (3): Now we mark lower lass limits along x-axis and cumulative frequencies along y-axis.
Step (4): We plot the points (lower class limit, corresponding cumulative frequency) and join them by a free hand curve.
Step (5): The upper limit of the last class interval becomes the lower limit of the imagined class interval with frequency 0. We join the imagine point (upper limit of last class, 0) with the last point of the curve to end the ogive.

In this way we get the required curve called an ogive by more than type method.

Measures of Central Tendency

An average of a distribution is a single expression which represents a group of variables in a simple and concise manner. It is the representative of entire distribution. Averages are generally in the central parts of the distribution and therefore they are called Measures of Central Tendency.
An ideal measures of central tendency should have following properties:

It should be defined rigidly.
It should be based on all observations.
It should be easy to calculate and readily comprehensible.
It should be affected as less as possible by fluctuations of sampling.
Extreme values should not affect very much to measure of central tendency.

Following three types of measures of central tendency are used for analysing data:

Arithmetic mean
Median
Mode

Arithmetic mean for ungrouped data (A. M.)

The arithmetic mean is the most commonly used measure of central tendency. It is obtained by dividing number of observations to the sum of observations. The A. M. of n observations, x1, x2, x3, ......,, xn is given by
A.M = Statistics and Probability (Part - 1) | Quantitative for GMAT

Properties of Arithmetic Mean:

If x is the mean of n observations, x₁, x₂, ....., x_n, then the mean of observations x₁ + a, x₂ + a, ...., x_n + a is , i.e. if each observation is increased by a, then the mean is also increased by a.
If is the mean of n observations, x₁, x₂, ..... x_n, then the mean of observation, x₁ – a, x₂ – a, ..., x_n – a is i.e. if each observation is decreased by a, then the mean is also decreased by a.
If is the mean of x₁, x₂, .... x_n then mean of ax₁, ax₂, .... ax_n is , where a is any number different from zero i.e. if each observation is multiplied by a non-zero number a, then the mean is also multiplied by a.
If is the mean of n observations x₁, x₂, ...., x_n then the mean of x₁/a, x₂/a, ..... x_n/a is xÌ„/a where a ≠ 0, i.e. if each observation is divided by a non-zero number, then the mean is also divided by it.

Arithmetic mean of Grouped Data:
Let x₁, x₂, x₃, ..... x_n be n observations whose frequencies are f₁, f₂, f₃, .., fn respectively, then the arithmetic mean of this distribution is given by
Statistics and Probability (Part - 1) | Quantitative for GMAT

Combined Mean:
Let and be the means of two groups of observations with number of observations n1 and n2 respectively, then the combined mean of two groups is given by,

Statistics and Probability (Part - 1) | Quantitative for GMAT

Merits of Arithmetic Mean:

A. M. is rigidly defined.
It is very simple. One can easily understand and calculate it.
It is uniquely defined.
It is based upon all the observations.
A. M. is least affected by sampling fluctuations.
We can mathematically analysis mean.
A. M. relatively reliable.

Demerits of Arithmetic Mean:

A. M. cannot be used for qualitative characteristics like richness, beauty, poverty etc.
A. M. of a given data can not be determined by inspection. It can be also represented graphically also.
If any observation is missing then A.M. cannot be calculated.
A. M. is very much affected by extreme values. In case of extreme items, A. M. gives a distorted picture of the distribution and no longer remains representative of the distribution.
If the extreme class is open, e.g. below 10 or above 100 then A. M. cannot be calculated.
If the given data from which the mean has to be calculated, is not given then A. M. may lead to wrong conclusions.
A. M. cannot be used in the study of ratios, rates etc.

Uses of Arithmetic Mean:

A. M. is extensively used in practical statistics.
Estimates can be obtained using A. M.
A. M. is used for different purposes by different persons like it is used for calculating average marks of the students. It is also used by businessmen to find out profit per unit article, output per machine, average monthly income and expenditure etc.
= 16 – 6
= 10
Hence, f1 = 6 and f2 = 10

Median

Median is defined as the value of that item of the arrayed data which divides the whole data into two equal parts. Hence we have following definition of median:
The middle item of the arrayed data is called its median.

Calculation of median of raw data:

If the number of observations ‘n’ is odd, then the median will be the value of observation.
If n is even, then we have two middle terms i.e. (n/2)th observation and (n/2 + 1)th observation.
Median of the given data will be mean of these two middle observations.

The document Statistics and Probability (Part - 1) | Quantitative for GMAT is a part of the GMAT Course Quantitative for GMAT.

All you need of GMAT at this link: GMAT

	Quantitative for GMAT 121 videos\|148 docs\|111 tests

Quantitative for GMAT

121 videos|148 docs|111 tests

Join Course for Free

FAQs on Statistics and Probability (Part - 1) - Quantitative for GMAT

1. What is the difference between statistics and probability?

Ans. Statistics is the branch of mathematics that deals with the collection, analysis, interpretation, presentation, and organization of data. It involves techniques for summarizing and describing data, making inferences or predictions, and testing hypotheses. On the other hand, probability is the study of uncertainty and the likelihood of events occurring. It provides a framework for quantifying uncertainty and enables us to make decisions and predictions based on the likelihood of different outcomes.

2. How is probability used in statistics?

Ans. Probability is used in statistics to analyze and interpret data. It helps in determining the likelihood of different outcomes or events based on the available data. Probability provides a mathematical framework for making predictions and drawing conclusions from data. It also allows statisticians to calculate probabilities of events occurring and to assess the uncertainty associated with statistical estimates.

3. What are the basic concepts in probability theory?

Ans. The basic concepts in probability theory include: - Sample space: It is the set of all possible outcomes of a random experiment. - Event: It is a subset of the sample space, representing a specific outcome or a collection of outcomes. - Probability: It is a measure of the likelihood of an event occurring. It is a number between 0 and 1, where 0 represents impossibility and 1 represents certainty. - Random variable: It is a variable that takes on different values based on the outcomes of a random experiment. It can be discrete or continuous. - Probability distribution: It describes the probabilities associated with different values of a random variable. It can be represented through a probability mass function (for discrete variables) or a probability density function (for continuous variables).

4. How is statistics used in decision-making?

Ans. Statistics is used in decision-making by providing information and insights based on data analysis. It helps in understanding patterns, trends, and relationships in data, which can assist in making informed decisions. Statistics can be used to evaluate the effectiveness of different options, assess risks and uncertainties, and support the development of strategies or policies. Through statistical analysis, decision-makers can make predictions, test hypotheses, and quantify the likelihood of different outcomes.

5. What are the different types of statistical data analysis techniques?

Ans. Some of the different types of statistical data analysis techniques include: - Descriptive statistics: It involves summarizing and describing data using measures such as mean, median, mode, standard deviation, and range. - Inferential statistics: It involves making inferences or predictions about a population based on a sample. It includes techniques such as hypothesis testing, confidence intervals, and regression analysis. - Exploratory data analysis: It involves exploring and visualizing data to discover patterns, relationships, and outliers. - Data mining: It involves extracting knowledge or information from large datasets using techniques such as clustering, classification, and association rule mining. - Time series analysis: It involves analyzing data collected over time to identify trends, seasonal patterns, and forecast future values.

About this Document

4.88/5 Rating

Sep 20, 2025 Last updated

Related Exams

GMAT GRE Quant Entrepreneurship

Document Description: Statistics and Probability (Part - 1) for GMAT 2025 is part of Quantitative for GMAT preparation. The notes and questions for Statistics and Probability (Part - 1) have been prepared according to the GMAT exam syllabus. Information about Statistics and Probability (Part - 1) covers topics like Statistics, Statistical Data, Inclusive and Exclusive distributions, Frequency Distribution Table, Cumulative Frequency Table, Graphical Representation of Data, Cumulative frequency curve or ogive, Measures of Central Tendency and Statistics and Probability (Part - 1) Example, for GMAT 2025 Exam. Find important definitions, questions, notes, meanings, examples, exercises and tests below for Statistics and Probability (Part - 1).

Introduction of Statistics and Probability (Part - 1) in English is available as part of our Quantitative for GMAT for GMAT & Statistics and Probability (Part - 1) in Hindi for Quantitative for GMAT course. Download more important topics related with notes, lectures and mock test series for GMAT Exam by signing up for free. GMAT: Statistics and Probability (Part - 1) | Quantitative for GMAT

Description

Full syllabus notes, lecture & questions for Statistics and Probability (Part - 1) | Quantitative for GMAT - GMAT | Plus excerises question with solution to help you revise complete syllabus for Quantitative for GMAT | Best notes, free PDF download

Information about Statistics and Probability (Part - 1)

In this doc you can find the meaning of Statistics and Probability (Part - 1) defined & explained in the simplest way possible. Besides explaining types of Statistics and Probability (Part - 1) theory, EduRev gives you an ample number of questions to practice Statistics and Probability (Part - 1) tests, examples and also practice GMAT tests

	Quantitative for GMAT 121 videos\|148 docs\|111 tests

Quantitative for GMAT

121 videos|148 docs|111 tests

Join Course for Free

Download as PDF

Explore Courses for GMAT exam

Statistics and Probability (Part - 1) | Quantitative for GMAT

Summary

Important questions

video lectures

Extra Questions

past year papers

Objective type Questions

study material

Semester Notes

shortcuts and tricks

Sample Paper

Previous Year Questions with Solutions

Free

Exam

practice quizzes

Statistics and Probability (Part - 1) | Quantitative for GMAT

mock tests for examination

pdf

ppt

MCQs

Viva Questions

;

Additional Information about Statistics and Probability (Part - 1) for GMAT Preparation

Statistics and Probability (Part - 1) Free PDF Download

The Statistics and Probability (Part - 1) is an invaluable resource that delves deep into the core of the GMAT exam. These study notes are curated by experts and cover all the essential topics and concepts, making your preparation more efficient and effective. With the help of these notes, you can grasp complex subjects quickly, revise important points easily, and reinforce your understanding of key concepts. The study notes are presented in a concise and easy-to-understand manner, allowing you to optimize your learning process. Whether you're looking for best-recommended books, sample papers, study material, or toppers' notes, this PDF has got you covered. Download the Statistics and Probability (Part - 1) now and kickstart your journey towards success in the GMAT exam.

Importance of Statistics and Probability (Part - 1)

The importance of Statistics and Probability (Part - 1) cannot be overstated, especially for GMAT aspirants. This document holds the key to success in the GMAT exam. It offers a detailed understanding of the concept, providing invaluable insights into the topic. By knowing the concepts well in advance, students can plan their preparation effectively. Utilize this indispensable guide for a well-rounded preparation and achieve your desired results.

Statistics and Probability (Part - 1) Notes

Statistics and Probability (Part - 1) Notes offer in-depth insights into the specific topic to help you master it with ease. This comprehensive document covers all aspects related to Statistics and Probability (Part - 1). It includes detailed information about the exam syllabus, recommended books, and study materials for a well-rounded preparation. Practice papers and question papers enable you to assess your progress effectively. Additionally, the paper analysis provides valuable tips for tackling the exam strategically. Access to Toppers' notes gives you an edge in understanding complex concepts. Whether you're a beginner or aiming for advanced proficiency, Statistics and Probability (Part - 1) Notes on EduRev are your ultimate resource for success.

Statistics and Probability (Part - 1) GMAT Questions

The "Statistics and Probability (Part - 1) GMAT Questions" guide is a valuable resource for all aspiring students preparing for the GMAT exam. It focuses on providing a wide range of practice questions to help students gauge their understanding of the exam topics. These questions cover the entire syllabus, ensuring comprehensive preparation. The guide includes previous years' question papers for students to familiarize themselves with the exam's format and difficulty level. Additionally, it offers subject-specific question banks, allowing students to focus on weak areas and improve their performance.

Study Statistics and Probability (Part - 1) on the App

Students of GMAT can study Statistics and Probability (Part - 1) alongwith tests & analysis from the EduRev app, which will help them while preparing for their exam. Apart from the Statistics and Probability (Part - 1), students can also utilize the EduRev App for other study materials such as previous year question papers, syllabus, important questions, etc. The EduRev App will make your learning easier as you can access it from anywhere you want. The content of Statistics and Probability (Part - 1) is prepared as per the latest GMAT syllabus.

Education Revolution

Signup to see your scores go up
within 7 days!

Continue with Google

Takes less than 10 seconds to signup