Table of contents |
|
Multiple Choice Questions |
|
Fill in the Blanks |
|
True or False |
|
Short Answer Questions |
|
Long Answer Questions |
|
Q.1: Which of the following is a major application of data science?
a) Game development
b) Targeted advertising
c) Mobile app creation
d) Hardware design
Q.2: What type of machine learning model is used in the restaurant food waste prediction project?
a) Clustering
b) Regression
c) Classification
d) Reinforcement learning
Q.3: Which Python library is primarily used for numerical and logical operations on arrays?
a) Pandas
b) Matplotlib
c) NumPy
d) Scikit-learn
Q.4: What type of data does Pandas handle efficiently?
a) Image data
b) Tabular data with heterogeneously-typed columns
c) Audio data
d) Unstructured text data
Q.5: Which type of plot is used to represent the frequency of a variable over time?
a) Scatter plot
b) Bar chart
c) Histogram
d) Pie plot
Q.6: What does the K-Nearest Neighbor (KNN) algorithm primarily rely on to make predictions?
a) Random sampling
b) Surrounding points or neighbors
c) Predefined rules
d) Statistical averages
Q.7: What is the purpose of the interquartile range (IQR) in a box plot?
a) To show the mean of the data
b) To represent the spread between the 25th and 75th percentiles
c) To identify the mode of the data
d) To calculate the variance
Q.8: Which of the following is a source of online data collection?
a) Manual record-keeping
b) Open-sourced government portals
c) Physical surveys
d) Direct observations
Q.9: What type of data issue involves incorrect values like a decimal in a phone number column?
a) Missing data
b) Outliers
c) Erroneous data
d) Null values
Q.10: In the KNN algorithm, why is K often chosen as an odd number in classification problems?
a) To reduce computation time
b) To ensure a tiebreaker in majority voting
c) To increase model complexity
d) To eliminate outliers
Q.1: Data science combines Python with mathematical concepts like __________, data analysis, and probability.
Q.2: The __________ canvas in problem scoping identifies who is experiencing the problem.
Q.3: In the restaurant project, the dataset includes the __________ of dish produced daily.
Q.4: The Python library __________ is used for creating visualizations like bar graphs and scatter plots.
Q.5: The statistical measure __________ represents the most frequent value in a sequence.
Q.1: Data science algorithms are not used in search engines like Google.
Q.2: NumPy arrays can contain multiple data types, unlike Python lists.
Q.3: Pandas is built on top of NumPy for enhanced data manipulation.
Q.4: A histogram is used to represent discontinuous data with gaps.
Q.5: In the KNN algorithm, a lower K value (e.g., K=1) makes predictions more stable.
Q.1: What is the goal of the restaurant food waste prediction project?
Q.2: Name two benefits of using data science in airline route planning.
Q.3: How does Pandas handle missing data in datasets?
Q.4: Explain the difference between a scatter plot and a histogram in data visualization.
Q.5: What is the purpose of the K-Nearest Neighbor (KNN) algorithm in data science?
Q.1: Describe the 4Ws problem canvas and how it is applied to the restaurant food waste problem.
Q.2: Explain the steps involved in evaluating the regression model for the restaurant food waste prediction project.
Q.3: Discuss the differences between NumPy arrays and Python lists, highlighting their key features.
Q.4: Explain how Matplotlib is used for data visualization, including the types of plots it can create.
Q.5: Describe the K-Nearest Neighbor (KNN) algorithm and explain how the choice of K affects its predictions, using the fruit sweetness example.
24 videos|87 docs|8 tests
|
1. What is data science and why is it important in today's world? | ![]() |
2. What are the key components of data science? | ![]() |
3. How do data scientists use statistics in their work? | ![]() |
4. What programming languages are commonly used in data science? | ![]() |
5. What role does machine learning play in data science? | ![]() |