Single Sequence Record in FASTA Format

Q

How to read a Single Sequence Record in FASTA Format?

✍: FYIcenter.com

A

If you want to store additional information to a DNA or protein sequence, you can use the Bio.SeqRecord class, which contains the following properties:

seq – The sequence itself as a Seq object.
id – The primary ID used to identify the sequence.
name – A “common” name for the sequence.
description – A human readable description for the sequence.
letter annotations – Holds per-letter-annotations using a dictionary of additional information about the letters in the sequence.
annotations – A dictionary of additional information about the sequence.
features – A list of SeqFeature objects with more structured information about the features on the sequence.
dbxrefs - A list of database cross-references for the sequence.

We can download an example of a Sequence Record in FASTA Format.

fyicenter$ wget https://raw.githubusercontent.com/biopython/biopython/master/Tests/GenBank/NC_005816.fna

-rw-r--r--. 1 fyicenter staff  9853 Jan 27 23:55 NC_005816.fna

Then we can create a Bio.SeqRecord object with the SeqIO.read() function.

fyicenter$ python 
>>> from Bio import SeqIO

>>> record = SeqIO.read("NC_005816.fna", "fasta")
>>> print(record)
ID: gi|45478711|ref|NC_005816.1|
Name: gi|45478711|ref|NC_005816.1|
Description: gi|45478711|ref|NC_005816.1| Yersinia pestis biovar Microtus str. 91001 plasmid pPCP1, complete sequence
Number of features: 0
Seq('TGTAACGAACGGTGCAATAGTGATCCACACCCAACGCCTGAAATCAGATCCAGG...CTG')

As you can see, the FASTA format does not provide enough properties and a good structure for Biopython to parse from.

⇒ Single Sequence Record in GenBank Format

⇐ What Are Translation Tables

⇑ Biopython - Tools for Biological Computation

⇑⇑ OBF (Open Bioinformatics Foundation) Tools

2023-04-04, 812🔥, 0💬

Install Biopython
How to install Biopython? The easiest way to install Biopython is to use the "pip" command as shown below. 1. Make sure that Python 3 is installed. fyicenter$ python --version Python 3.8.8 2. Install Biopython. fyicenter$ pip install biopython Requirement already satisfied: numpy in ... Installing c... 2023-02-04, 939🔥, 0💬

mRNA, Protein and Translation
How to derive protein sequence from a mRNA sequence? Biologically, the protein sequence is produced by a translation process from a mRNA sequence. We can simulate this biological translation process using the translation() function. fyicenter$ python >>> from Bio.Seq import Seq ... 2023-03-17, 896🔥, 0💬

What Are Translation Tables
What Are Translation Tables? Translation tables, also called codon tables, are conversion tables that map 3-nucleobase combinations into amino acids to form protein sequences. It is known that all organisms do not use exactly the same translation table. But they vary from a standard translation tabl... 2023-03-17, 881🔥, 0💬

Retrieve Record Summary with Bio.Entrez.esummary()
How to Retrieve Record Summary with Bio.Entrez.esummary() function? Bio.Entrez.esummary() function allows you to retrieve summary information of a given record in a given NCBI Databas. It uses the Entrez Web services provided by www.ncbi.nlm.nih.gov. 1. Retrieve summary information from the "nlmcata... 2023-08-09, 841🔥, 0💬

List NCBI Databases with Bio.Entrez.einfo()
How to List NCBI Databases with Bio.Entrez.einfo() function? Bio.Entrez.einfo() function allows you to list NCBI Databases and their related information. It uses the Entrez Web services provided by www.ncbi.nlm.nih.gov. 1. Get a list of NCBI databases. fyicenter$ python >>> from... 2023-07-08, 825🔥, 0💬

Single Sequence Record in FASTA Format
How to read a Single Sequence Record in FASTA Format? If you want to store additional information to a DNA or protein sequence, you can use the Bio.SeqRecord class, which contains the following properties: seq – The sequence itself as a Seq object. id – The primary ID used to identify the sequence. ... 2023-04-04, 813🔥, 0💬

Single Sequence Record in GenBank Format
How to read a Single Sequence Record in GenBank Format? The GenBank format for DNA or protein sequences contains more properties and a better structure that FASTA format. You can follow these steps to download GenBank file example and create a Bio.SeqRecord object. 1. Download an example of a Sequen... 2023-04-04, 801🔥, 0💬

Play with the Bio.Seq Module
How to import the Bio.Seq module and use its functions? Here are some examples on how to import the Bio.Seq module and use its functions. 1. Import the Bio.Seq module and create a Bio.Seq object. fyicenter$ python >>> from Bio.Seq import Seq >>> my_seq = Seq... 2023-02-04, 756🔥, 0💬

Search NCBI Databases with Bio.Entrez.esearch()
How to Search NCBI Databases with Bio.Entrez.esearch() function? Bio.Entrez.esearch() function allows you to search NCBI Databases with a given criteria. It uses the Entrez Web services provided by www.ncbi.nlm.nih.gov. 1. Search in PubMed for publications that include Biopython in their title. It r... 2023-08-09, 727🔥, 0💬

All rights in the contents of this web site are reserved by the individual author. fyicenter.com does not guarantee the truthfulness, accuracy, or reliability of any contents.