Botany Exam  >  Botany Notes  >  Bioinformatics  >  Lecture 4 - Biological sequence database: National Center for Biotechnology Information (PART 1)

Lecture 4 - Biological sequence database: National Center for Biotechnology Information (PART 1) | Bioinformatics - Botany PDF Download

Download, print and study this document offline
Please wait while the PDF view is loading
 Page 1


Biological sequence database: NCBI 
 0 
 
 
 
 
 
 
 
 
 
 
Subject : Bioinformatics 
Lesson : Biological sequence database: National Center for Biotechnology 
Information (NCBI ) 
Lesson Developer : Sandip Das 
College/ Department: Department of Botany, University of Delhi 
 
 
 
 
 
 
 
 
 
 
Page 2


Biological sequence database: NCBI 
 0 
 
 
 
 
 
 
 
 
 
 
Subject : Bioinformatics 
Lesson : Biological sequence database: National Center for Biotechnology 
Information (NCBI ) 
Lesson Developer : Sandip Das 
College/ Department: Department of Botany, University of Delhi 
 
 
 
 
 
 
 
 
 
 
Biological sequence database: NCBI 
 1 
Table of Contents 
Chapter: Biological sequence database: National Center for 
Biotechnology Information (NCBI)  
? Introduction  
? Databases at NCBI 
? Literature 
? Bookshelf 
? Pubmed 
? Nucleic Acid 
? dbEST 
? dbGSS 
? dbGSS 
? Popset 
? dbGaP 
? dbVar 
o Genome 
o Taxonomy 
o PubChem 
o Expression analysis 
o Protein 
 
? Summary  
? Exercise/ Practice 
? Glossary 
? References/ Bibliography/ Further Reading 
 
 
 
 
National Center for Biotechnology Information (NCBI) 
NCBI has emerged as the primary free-to-access source of data and analysis tools in the 
field of computational biology.  The free-access nature of NCBI is possible as the policy of 
funding and publication in most countries dictates that the researcher mandatorily deposits 
the information generated using public-fund into a free-to-access central repository. In 
return, the repository (such as NCBI or EMBL) assigns a unique identification number, often 
termed as accession number, to the data that also can be used to identify the depositor and 
Page 3


Biological sequence database: NCBI 
 0 
 
 
 
 
 
 
 
 
 
 
Subject : Bioinformatics 
Lesson : Biological sequence database: National Center for Biotechnology 
Information (NCBI ) 
Lesson Developer : Sandip Das 
College/ Department: Department of Botany, University of Delhi 
 
 
 
 
 
 
 
 
 
 
Biological sequence database: NCBI 
 1 
Table of Contents 
Chapter: Biological sequence database: National Center for 
Biotechnology Information (NCBI)  
? Introduction  
? Databases at NCBI 
? Literature 
? Bookshelf 
? Pubmed 
? Nucleic Acid 
? dbEST 
? dbGSS 
? dbGSS 
? Popset 
? dbGaP 
? dbVar 
o Genome 
o Taxonomy 
o PubChem 
o Expression analysis 
o Protein 
 
? Summary  
? Exercise/ Practice 
? Glossary 
? References/ Bibliography/ Further Reading 
 
 
 
 
National Center for Biotechnology Information (NCBI) 
NCBI has emerged as the primary free-to-access source of data and analysis tools in the 
field of computational biology.  The free-access nature of NCBI is possible as the policy of 
funding and publication in most countries dictates that the researcher mandatorily deposits 
the information generated using public-fund into a free-to-access central repository. In 
return, the repository (such as NCBI or EMBL) assigns a unique identification number, often 
termed as accession number, to the data that also can be used to identify the depositor and 
Biological sequence database: NCBI 
 2 
several other features.  The following section will introduce you to a variety of databases 
dealing with a wide range of disciplines. Please do note that although the data may be 
organized separately for the sake of simplicity and clarity, in reality, all the databases are 
inter-linked and can be navigated from one to the other. The databases are also associated 
with their appropriate analysis tools. 
The following section lists some of the databases that have been created at NCBI. For the 
sake of simplicity, the databases in this lesson have been divided into three sections-section 
I dealing with publication, literature and small scale DNA/RNA sequencing projects; section 
II-dealing with whole genome, epigenome, maps of genomes, taxonomy and chemical 
structures; and section III dealing with resources for RNA and protein that are required for 
“functional genomics” . These sections marked as I, II and III will be dealt in their 
respective chapters. 
 
 
Databases-I: 
Literature (PubMed, PubMed Central; NCBI Bookshelf): 
 DNA and RNA (Refseq, nucleotide, EST, GSS, WGS, PopSet, trace archive, SRA): 
Databases-II: 
Genomes (Map Viewer, Genome workbench, Plant Genome Central, Genome 
Reference Consortium, Epigenomics, Genomics Structural variation): 
Maps: 
Taxonomy: 
PubChem Substance:  
Databases-III: 
Expression analysis-GEO 
Proteins (Reference sequences, GenPept, UniProt/SwissProt, PRF, PDB, Protein 
clusters, Structure, UniGene, CDD): 
 
Entrez is the single point database search and retrieval system that allows a user to 
perform the search and retrieve action against “all” or a “specific” database in an 
interlinked manner. 
 
Page 4


Biological sequence database: NCBI 
 0 
 
 
 
 
 
 
 
 
 
 
Subject : Bioinformatics 
Lesson : Biological sequence database: National Center for Biotechnology 
Information (NCBI ) 
Lesson Developer : Sandip Das 
College/ Department: Department of Botany, University of Delhi 
 
 
 
 
 
 
 
 
 
 
Biological sequence database: NCBI 
 1 
Table of Contents 
Chapter: Biological sequence database: National Center for 
Biotechnology Information (NCBI)  
? Introduction  
? Databases at NCBI 
? Literature 
? Bookshelf 
? Pubmed 
? Nucleic Acid 
? dbEST 
? dbGSS 
? dbGSS 
? Popset 
? dbGaP 
? dbVar 
o Genome 
o Taxonomy 
o PubChem 
o Expression analysis 
o Protein 
 
? Summary  
? Exercise/ Practice 
? Glossary 
? References/ Bibliography/ Further Reading 
 
 
 
 
National Center for Biotechnology Information (NCBI) 
NCBI has emerged as the primary free-to-access source of data and analysis tools in the 
field of computational biology.  The free-access nature of NCBI is possible as the policy of 
funding and publication in most countries dictates that the researcher mandatorily deposits 
the information generated using public-fund into a free-to-access central repository. In 
return, the repository (such as NCBI or EMBL) assigns a unique identification number, often 
termed as accession number, to the data that also can be used to identify the depositor and 
Biological sequence database: NCBI 
 2 
several other features.  The following section will introduce you to a variety of databases 
dealing with a wide range of disciplines. Please do note that although the data may be 
organized separately for the sake of simplicity and clarity, in reality, all the databases are 
inter-linked and can be navigated from one to the other. The databases are also associated 
with their appropriate analysis tools. 
The following section lists some of the databases that have been created at NCBI. For the 
sake of simplicity, the databases in this lesson have been divided into three sections-section 
I dealing with publication, literature and small scale DNA/RNA sequencing projects; section 
II-dealing with whole genome, epigenome, maps of genomes, taxonomy and chemical 
structures; and section III dealing with resources for RNA and protein that are required for 
“functional genomics” . These sections marked as I, II and III will be dealt in their 
respective chapters. 
 
 
Databases-I: 
Literature (PubMed, PubMed Central; NCBI Bookshelf): 
 DNA and RNA (Refseq, nucleotide, EST, GSS, WGS, PopSet, trace archive, SRA): 
Databases-II: 
Genomes (Map Viewer, Genome workbench, Plant Genome Central, Genome 
Reference Consortium, Epigenomics, Genomics Structural variation): 
Maps: 
Taxonomy: 
PubChem Substance:  
Databases-III: 
Expression analysis-GEO 
Proteins (Reference sequences, GenPept, UniProt/SwissProt, PRF, PDB, Protein 
clusters, Structure, UniGene, CDD): 
 
Entrez is the single point database search and retrieval system that allows a user to 
perform the search and retrieve action against “all” or a “specific” database in an 
interlinked manner. 
 
Biological sequence database: NCBI 
 3 
 
 
 
Figure : Various databases at NCBI can be accessed through the Entrez portal 
Source: http://www.ncbi.nlm.nih.gov/sites/gquery 
 
The National Center for Biotechnology Center (NCBI) site is conveniently organized into 
four major domains and these domains are interlinked :  
1. Databases,  
2. Tools,  
3. Data submission and  
4. Education  
 
The following figure depicts the interlinked nature of these domains and can be reached 
by 
1. Open the ncbi page by typing in www.ncbi.nlm.nih.gov in  the web browser 
2. Click the “search” button on the home page without enetering any keyword . 
3. On the  top left hand corner of the webpage, click on the “site map” to reach the 
page. 
Page 5


Biological sequence database: NCBI 
 0 
 
 
 
 
 
 
 
 
 
 
Subject : Bioinformatics 
Lesson : Biological sequence database: National Center for Biotechnology 
Information (NCBI ) 
Lesson Developer : Sandip Das 
College/ Department: Department of Botany, University of Delhi 
 
 
 
 
 
 
 
 
 
 
Biological sequence database: NCBI 
 1 
Table of Contents 
Chapter: Biological sequence database: National Center for 
Biotechnology Information (NCBI)  
? Introduction  
? Databases at NCBI 
? Literature 
? Bookshelf 
? Pubmed 
? Nucleic Acid 
? dbEST 
? dbGSS 
? dbGSS 
? Popset 
? dbGaP 
? dbVar 
o Genome 
o Taxonomy 
o PubChem 
o Expression analysis 
o Protein 
 
? Summary  
? Exercise/ Practice 
? Glossary 
? References/ Bibliography/ Further Reading 
 
 
 
 
National Center for Biotechnology Information (NCBI) 
NCBI has emerged as the primary free-to-access source of data and analysis tools in the 
field of computational biology.  The free-access nature of NCBI is possible as the policy of 
funding and publication in most countries dictates that the researcher mandatorily deposits 
the information generated using public-fund into a free-to-access central repository. In 
return, the repository (such as NCBI or EMBL) assigns a unique identification number, often 
termed as accession number, to the data that also can be used to identify the depositor and 
Biological sequence database: NCBI 
 2 
several other features.  The following section will introduce you to a variety of databases 
dealing with a wide range of disciplines. Please do note that although the data may be 
organized separately for the sake of simplicity and clarity, in reality, all the databases are 
inter-linked and can be navigated from one to the other. The databases are also associated 
with their appropriate analysis tools. 
The following section lists some of the databases that have been created at NCBI. For the 
sake of simplicity, the databases in this lesson have been divided into three sections-section 
I dealing with publication, literature and small scale DNA/RNA sequencing projects; section 
II-dealing with whole genome, epigenome, maps of genomes, taxonomy and chemical 
structures; and section III dealing with resources for RNA and protein that are required for 
“functional genomics” . These sections marked as I, II and III will be dealt in their 
respective chapters. 
 
 
Databases-I: 
Literature (PubMed, PubMed Central; NCBI Bookshelf): 
 DNA and RNA (Refseq, nucleotide, EST, GSS, WGS, PopSet, trace archive, SRA): 
Databases-II: 
Genomes (Map Viewer, Genome workbench, Plant Genome Central, Genome 
Reference Consortium, Epigenomics, Genomics Structural variation): 
Maps: 
Taxonomy: 
PubChem Substance:  
Databases-III: 
Expression analysis-GEO 
Proteins (Reference sequences, GenPept, UniProt/SwissProt, PRF, PDB, Protein 
clusters, Structure, UniGene, CDD): 
 
Entrez is the single point database search and retrieval system that allows a user to 
perform the search and retrieve action against “all” or a “specific” database in an 
interlinked manner. 
 
Biological sequence database: NCBI 
 3 
 
 
 
Figure : Various databases at NCBI can be accessed through the Entrez portal 
Source: http://www.ncbi.nlm.nih.gov/sites/gquery 
 
The National Center for Biotechnology Center (NCBI) site is conveniently organized into 
four major domains and these domains are interlinked :  
1. Databases,  
2. Tools,  
3. Data submission and  
4. Education  
 
The following figure depicts the interlinked nature of these domains and can be reached 
by 
1. Open the ncbi page by typing in www.ncbi.nlm.nih.gov in  the web browser 
2. Click the “search” button on the home page without enetering any keyword . 
3. On the  top left hand corner of the webpage, click on the “site map” to reach the 
page. 
Biological sequence database: NCBI 
 4 
 
 Figure: Various databases are organized into four major domains and are interlinked 
Source: http://www.ncbi.nlm.nih.gov/guide/sitemap/ 
 
 
Databases of NCBI 
The following section introduces you to some of the following databases at NCBI 
Databases-I: 
Literature (PubMed, PubMed Central; NCBI Bookshelf): 
 DNA and RNA (Refseq, nucleotide, EST, GSS, WGS, PopSet, trace archive, SRA): 
 
Literature:  
? Bookshelf provides free access and allows users to browse and retrieve a wealth of 
information in life sciences and healthcare. The information may be in the form of 
books documents and policy information from various government agencies and 
publishers. The bookshelf titles are organized subject-wise, by Type or by Publisher 
in a searchable or browsable format. 
Read More
16 docs

FAQs on Lecture 4 - Biological sequence database: National Center for Biotechnology Information (PART 1) - Bioinformatics - Botany

1. What is the National Center for Biotechnology Information (NCBI)?
Ans. The National Center for Biotechnology Information (NCBI) is a part of the United States National Library of Medicine (NLM), which is a branch of the National Institutes of Health (NIH). It is a comprehensive resource that provides access to various biological databases, including genetic sequences, proteins, and scientific literature.
2. How can the NCBI's biological sequence databases be useful for botany?
Ans. The NCBI's biological sequence databases can be extremely valuable for botany as they provide access to a vast amount of genetic information. Botanists can use these databases to study the DNA sequences of different plant species, identify genes responsible for specific traits, and compare sequences among different plants. This information can contribute to a better understanding of plant evolution, genetic diversity, and even the development of new agricultural techniques.
3. What are some of the major biological sequence databases offered by the NCBI?
Ans. The NCBI provides access to several major biological sequence databases, including GenBank, RefSeq, and the Sequence Read Archive (SRA). GenBank is a comprehensive database that contains DNA sequences from various organisms, while RefSeq provides curated and annotated sequences for reference genomes. The SRA database stores raw sequencing data, allowing researchers to access and analyze the original experimental data.
4. How can researchers submit their own biological sequences to the NCBI databases?
Ans. Researchers can submit their own biological sequences to the NCBI databases through the submission portal called BankIt. BankIt allows users to upload and submit their sequences, along with relevant metadata and documentation. The submitted sequences undergo a validation process before being incorporated into the respective databases.
5. Can I access scientific literature related to botany through the NCBI?
Ans. Yes, the NCBI provides access to a vast collection of scientific literature through its database called PubMed. PubMed contains millions of articles from various scientific journals, including those related to botany. Researchers and botany enthusiasts can search for specific articles, journals, or topics within PubMed to access relevant scientific literature for their studies or research.
16 docs
Download as PDF
Explore Courses for Botany exam
Signup for Free!
Signup to see your scores go up within 7 days! Learn & Practice with 1000+ FREE Notes, Videos & Tests.
10M+ students study on EduRev
Related Searches

study material

,

Lecture 4 - Biological sequence database: National Center for Biotechnology Information (PART 1) | Bioinformatics - Botany

,

video lectures

,

Sample Paper

,

practice quizzes

,

Summary

,

Important questions

,

Viva Questions

,

Lecture 4 - Biological sequence database: National Center for Biotechnology Information (PART 1) | Bioinformatics - Botany

,

Extra Questions

,

mock tests for examination

,

Previous Year Questions with Solutions

,

shortcuts and tricks

,

Exam

,

MCQs

,

Lecture 4 - Biological sequence database: National Center for Biotechnology Information (PART 1) | Bioinformatics - Botany

,

pdf

,

ppt

,

Semester Notes

,

Free

,

past year papers

,

Objective type Questions

;