Q1: The correlation coefficient, denoted by \(r\), measures the strength and direction of a linear relationship between two variables. What is the range of possible values for \(r\)? (a) \(-1 \leq r \leq 1\) (b) \(0 \leq r \leq 1\) (c) \(-\infty < r=""><> (d) \(0 \leq r \leq 100\)
Solution:
Ans: (a) Explanation: The correlation coefficient \(r\) always ranges from -1 to 1. A value of \(r = 1\) indicates a perfect positive linear relationship, \(r = -1\) indicates a perfect negative linear relationship, and \(r = 0\) indicates no linear relationship. Option (b) only includes positive values, option (c) suggests infinite values which is incorrect, and option (d) treats it as a percentage which is not how correlation is measured.
Q2: A scatter plot shows that as the number of hours studied increases, test scores also increase. If the correlation coefficient is calculated to be \(r = 0.85\), what type of correlation does this represent? (a) Strong negative correlation (b) Weak positive correlation (c) Strong positive correlation (d) No correlation
Solution:
Ans: (c) Explanation: A correlation coefficient of \(r = 0.85\) is close to 1, indicating a strong positive correlation. Values of \(r\) close to 1 or -1 indicate strong relationships, while values close to 0 indicate weak relationships. Since 0.85 is positive and greater than 0.7 (commonly considered the threshold for strong correlation), it represents a strong positive relationship between hours studied and test scores.
Q3: Which of the following correlation coefficients indicates the strongest linear relationship between two variables? (a) \(r = 0.45\) (b) \(r = -0.92\) (c) \(r = 0.68\) (d) \(r = 0.03\)
Solution:
Ans: (b) Explanation: The strength of a correlation is determined by the absolute value of \(r\), not its sign. The absolute values are: |0.45| = 0.45, |-0.92| = 0.92, |0.68| = 0.68, and |0.03| = 0.03. Since 0.92 is the largest absolute value, \(r = -0.92\) indicates the strongest linear relationship, even though it is negative (meaning the variables move in opposite directions).
Q4: A researcher finds that the correlation between ice cream sales and drowning incidents is \(r = 0.78\). What can be concluded from this correlation? (a) Ice cream sales cause drowning incidents (b) Drowning incidents cause ice cream sales (c) There is a strong positive association, but correlation does not imply causation (d) The data must be incorrect because these variables are unrelated
Solution:
Ans: (c) Explanation: The fundamental principle is that correlation does not imply causation. While there is a strong positive correlation (\(r = 0.78\)), this does not mean one variable causes the other. In this case, both variables are likely influenced by a confounding variable (temperature or summer season). When it's hot, both ice cream sales and swimming (and thus drowning incidents) increase. Options (a) and (b) incorrectly assume causation, and option (d) is wrong because unrelated variables can still show correlation due to lurking variables.
Q5: If all data points in a scatter plot lie exactly on a straight line with a negative slope, what is the value of the correlation coefficient? (a) \(r = 0\) (b) \(r = 1\) (c) \(r = -1\) (d) \(r = -0.5\)
Solution:
Ans: (c) Explanation: When all data points lie exactly on a straight line, there is a perfect linear relationship. If the line has a negative slope (meaning as one variable increases, the other decreases), the correlation coefficient is \(r = -1\). This represents perfect negative correlation. If the slope were positive, \(r\) would equal 1. Values between -1 and 1 occur when points do not fall exactly on a line.
Q6: The formula for the Pearson correlation coefficient includes which of the following components? (a) Mean and median of the data sets (b) Standard deviations and covariance of the variables (c) Range and interquartile range (d) Mode and frequency distribution
Solution:
Ans: (b) Explanation: The Pearson correlation coefficient is calculated using the formula \(r = \frac{\text{Cov}(X,Y)}{s_X \cdot s_Y}\), where Cov(X,Y) is the covariance between variables X and Y, and \(s_X\) and \(s_Y\) are the standard deviations of X and Y respectively. The other options list statistical measures that are not directly used in calculating the correlation coefficient.
Q7: A correlation coefficient of \(r = 0.02\) suggests which of the following? (a) A strong positive linear relationship (b) A strong negative linear relationship (c) Almost no linear relationship (d) A perfect linear relationship
Solution:
Ans: (c) Explanation: A correlation coefficient very close to zero (such as \(r = 0.02\)) indicates almost no linear relationship between the two variables. The variables are essentially independent in terms of linear association. Values close to 0 (typically between -0.3 and 0.3) suggest weak or negligible correlation. Strong correlations have absolute values above 0.7, and perfect correlations are exactly ±1.
Q8: Which statement about correlation is TRUE? (a) A correlation of \(r = 0\) means the variables are completely unrelated in all ways (b) Correlation can detect any type of relationship between variables (c) Correlation only measures the strength of linear relationships (d) A high correlation always means one variable causes changes in the other
Solution:
Ans: (c) Explanation: The correlation coefficient specifically measures the strength and direction of linear relationships only. Variables can have strong non-linear relationships (like quadratic or exponential) but still show \(r = 0\). Option (a) is incorrect because \(r = 0\) only means no linear relationship exists; non-linear relationships may still be present. Option (b) is wrong because correlation only detects linear patterns. Option (d) is incorrect because correlation does not imply causation.
Section B: Fill in the Blanks
Q9: The correlation coefficient is a numerical measure that describes the __________ and __________ of a linear relationship between two quantitative variables.
Solution:
Ans: strength and direction Explanation: The correlation coefficient provides two key pieces of information: the strength (how closely the data points cluster around a line, indicated by the absolute value of \(r\)) and the direction (whether the relationship is positive or negative, indicated by the sign of \(r\)).
Q10: If the correlation coefficient between two variables is negative, this indicates that as one variable increases, the other variable __________.
Solution:
Ans: decreases Explanation: A negative correlation means the variables move in opposite directions. When one variable increases, the other decreases. This is represented by a negative value of \(r\) and appears as a downward-sloping pattern in a scatter plot.
Q11: A correlation coefficient of \(r = 1\) represents a __________ positive linear relationship.
Solution:
Ans: perfect Explanation: When \(r = 1\), there is a perfect positive linear relationship, meaning all data points lie exactly on a straight line with a positive slope. This is the strongest possible positive correlation.
Q12: The principle that states "a correlation between two variables does not necessarily mean that one causes the other" is known as correlation does not imply __________.
Solution:
Ans: causation Explanation: This is a fundamental principle in statistics: correlation does not imply causation. Just because two variables are correlated does not mean that changes in one variable cause changes in the other. There may be confounding variables or the relationship may be coincidental.
Q13: The Pearson correlation coefficient is calculated by dividing the __________ of two variables by the product of their standard deviations.
Solution:
Ans: covariance Explanation: The formula for the Pearson correlation coefficient is \(r = \frac{\text{Cov}(X,Y)}{s_X \cdot s_Y}\), where the numerator is the covariance between X and Y, and the denominator is the product of their standard deviations. This standardizes the covariance to produce a value between -1 and 1.
Q14: In a scatter plot, when data points are widely scattered with no clear pattern, the correlation is described as __________.
Solution:
Ans: weak (or no correlation) Explanation: When data points in a scatter plot show no clear linear pattern and are widely scattered, this indicates a weak or negligible correlation, with \(r\) close to zero. The variables do not have a strong linear relationship.
Section C: Word Problems
Q15: A teacher records the number of hours 10 students spend on social media per day and their GPAs. The correlation coefficient is calculated to be \(r = -0.76\). Interpret this correlation coefficient in the context of the problem, including both its strength and direction.
Solution:
Ans: Final Answer: There is a strong negative correlation between hours spent on social media and GPA. This means that as students spend more hours on social media, their GPAs tend to decrease. The correlation is considered strong because the absolute value of 0.76 is relatively close to 1.
Q16: A sports analyst examines the relationship between the number of practice hours per week and the number of goals scored by 15 soccer players. The data yields a correlation coefficient of \(r = 0.88\). What does this value tell us about the relationship? Additionally, can the analyst conclude that more practice hours cause more goals to be scored? Explain.
Solution:
Ans: Final Answer: The correlation coefficient of \(r = 0.88\) indicates a strong positive correlation between practice hours and goals scored. As practice hours increase, goals scored tend to increase as well. However, the analyst cannot conclude that more practice hours cause more goals to be scored because correlation does not imply causation. Other factors such as natural talent, coaching quality, or physical fitness could also influence the number of goals scored.
Q17: The following data shows the relationship between temperature (in °F) and hot chocolate sales at a café:
Temperature: 30, 35, 40, 45, 50
Sales: 85, 78, 70, 65, 58
Based on this data, would you expect the correlation coefficient to be positive, negative, or close to zero? Explain your reasoning without calculating the exact value.
Solution:
Ans: Final Answer: The correlation coefficient would be expected to be negative. As the temperature increases from 30°F to 50°F, the hot chocolate sales decrease from 85 to 58. This shows an inverse relationship where higher temperatures correspond to lower sales, which is characteristic of a negative correlation.
Q18: A researcher studies 20 cities and finds that the correlation between the number of churches and the number of bars is \(r = 0.82\). A student concludes that "churches cause bars to open nearby." Explain why this conclusion is flawed and provide a more reasonable explanation for the observed correlation.
Solution:
Ans: Final Answer: The student's conclusion is flawed because correlation does not imply causation. The presence of churches does not cause bars to open. A more reasonable explanation is that both variables are influenced by a confounding variable: population size. Larger cities have more people, which leads to both more churches (to serve religious communities) and more bars (to serve social/entertainment needs). The correlation exists because both variables increase with population, not because one causes the other.
Q19: A scatter plot of study time versus exam scores for a class shows all points lying exactly on a straight line that rises from left to right. Without performing any calculations, determine the correlation coefficient for this data and justify your answer.
Solution:
Ans: Final Answer: The correlation coefficient is \(r = 1\). When all data points lie exactly on a straight line that rises from left to right (positive slope), there is a perfect positive linear relationship. This corresponds to a correlation coefficient of exactly 1, indicating that study time and exam scores increase together in perfect proportion.
Q20: A biologist measures the wingspan and body length of 12 butterflies of the same species. She calculates the correlation coefficient and finds \(r = 0.15\). Interpret this correlation in context. Should the biologist conclude that wingspan and body length are related? Explain.
Solution:
Ans: Final Answer: The correlation coefficient of \(r = 0.15\) indicates a very weak positive correlation between wingspan and body length. This value is very close to zero, suggesting that there is almost no linear relationship between these two variables in the butterflies studied. The biologist should conclude that wingspan and body length do not appear to be linearly related, or if there is a relationship, it is too weak to be meaningful. However, this doesn't rule out the possibility of a non-linear relationship or the need for a larger sample size.
ppt, Extra Questions, practice quizzes, Previous Year Questions with Solutions, Worksheet (with Solutions): Correlation Coefficients, shortcuts and tricks, video lectures, Semester Notes, mock tests for examination, Exam, Worksheet (with Solutions): Correlation Coefficients, Important questions, pdf , Summary, Viva Questions, study material, past year papers, Free, Sample Paper, Worksheet (with Solutions): Correlation Coefficients, Objective type Questions, MCQs;