Table of contents | |
Problem Scoping | |
Data Acquisition | |
Data Exploration | |
Modelling | |
Evaluation |
Whenever we begin a new project, we encounter a number of challenges. In fact, we are surrounded with issues! These issues might be minor or major; sometimes we overlook them, and other times we require immediate attention.
To understand a problem, determine the different aspects that affect the problem, and define the project’s goal are problem scoping.
Follow the following steps to identify the problem scoping from the project –
The 4 W’s of Problem Scoping are Who, What, Where, and Why. This 4 W’s helps to identify and understand the problem in a better manner.
After you’ve completed the above 4Ws, make a summary of what you’ve learned. The problem statement template is the name for this summary. This template summarizes all of the important points in one place. So, if the same problem comes again, this statement will make it much easier to fix.
Problem Statement Template with space to fill details according to your Goal:
The method of collecting correct and dependable data to work with is known as data acquisition. Data can be in the form of text, video, photos, audio, and so on, and it can be gathered from a variety of places such as websites, journals, and newspapers.
Data is a representation of facts or instructions about an entity that can be processed or conveyed by a human or a machine, such as numbers, text, pictures, audio clips, videos, and so on.
There is two type of data –
a. Structured Data
When data is in a standardized format, has a well-defined structure, follows a consistent order, and is easily accessible by humans and program. This data is in the form of numbers, characters, special characters etc.
b. Unstructured Data
Unstructured data is information that doesn’t follow traditional data models and is therefore difficult to store and manage. Video, audio, and image files, as well as log files, are all examples of unstructured data.
Dataset is a collection of data in tabular format. Dataset contains numbers or values that are related to a specific subject. For example, students’ test scores in a class is a dataset.
The dataset is divided into two parts
There are six ways to collect data.
Exploration helps you gain a better understanding of a dataset, making it easier to explore and use it later. It also helps to quickly understand the data’s trends, and patterns.
Data visualization charts are graphical representations of data that use symbols to convey a story and help people understand large volumes of information.
Venn Diagram of AI
The rule-based approach to AI modeling is when the developer defines the relationship or patterns in data. The machine follows the developer’s rules or instructions and completes its job properly.
An AI model is a program that has been trained to recognize patterns using a set of data. AI modeling is the process of creating algorithms, also known as models, that may be educated to produce intelligent results. This is the process of programming code to create a machine artificially.
Refers to AI modeling in which the developer hasn’t specified the relationship or patterns in the data. Random data is provided to the computer in this method, and the system is left to figure out patterns and trends from it. When the data is unlabeled and too random for a human to make sense of, this method is usually used.
The concept of Decision Trees is similar to that of Story Speaker. It’s a rule-based AI model that uses numerous judgments (or rules) to assist the machine in determining what an element is. The following is the basic structure of a decision tree:
When building a decision tree, it’s common for the dataset to have redundant material that’s of no use. As a result, you should make a list of the parameters that directly affect the output and use only those when designing a decision tree.
For a single dataset, there may be several decision trees that lead to correct prediction. The most straightforward option should be selected.
After a model has been created and trained, it must be thoroughly tested in order to determine its efficiency and performance; this is known as evaluation.
40 videos|35 docs|6 tests
|
1. What is the importance of problem scoping in the data science process? |
2. How does data acquisition play a role in the data science process? |
3. What are some common techniques used for data exploration in data science? |
4. How is modeling approached in the data science process? |
5. How is the evaluation of models conducted in data science projects? |
|
Explore Courses for Class 10 exam
|