Collections:
Single Sequence Record in FASTA Format
How to read a Single Sequence Record in FASTA Format?
✍: FYIcenter.com
If you want to store additional information to a DNA or protein sequence, you can use the Bio.SeqRecord class, which contains the following properties:
We can download an example of a Sequence Record in FASTA Format.
fyicenter$ wget https://raw.githubusercontent.com/biopython/biopython/master/Tests/GenBank/NC_005816.fna -rw-r--r--. 1 fyicenter staff 9853 Jan 27 23:55 NC_005816.fna
Then we can create a Bio.SeqRecord object with the SeqIO.read() function.
fyicenter$ python >>> from Bio import SeqIO >>> record = SeqIO.read("NC_005816.fna", "fasta") >>> print(record) ID: gi|45478711|ref|NC_005816.1| Name: gi|45478711|ref|NC_005816.1| Description: gi|45478711|ref|NC_005816.1| Yersinia pestis biovar Microtus str. 91001 plasmid pPCP1, complete sequence Number of features: 0 Seq('TGTAACGAACGGTGCAATAGTGATCCACACCCAACGCCTGAAATCAGATCCAGG...CTG')
As you can see, the FASTA format does not provide enough properties and a good structure for Biopython to parse from.
⇒ Single Sequence Record in GenBank Format
2023-04-04, 294🔥, 0💬
Popular Posts:
How to search for Open Babel RPM binary packages for CentOS computers? You can follow this tutorial ...
How Stereoinformation Is Presented in SDF/Mol and SDF/Mol V3000 Files? Ideally stereoinformation sho...
Where to find 3Dmol.js FAQ (Frequently Asked Questions)? I want to learn more about 3Dmol.js JavaScr...
Molecule Summary: ID: FYI-1002522 Names: InChIKey: VQECHRQHFMUVRS-UHFFFAOYS A-NSMILES: c3ccc(CC(Cc1c...
Molecule Summary: ID: FYI-1002941 Names: InChIKey: QNOYEQFIQLMSPD-UHFFFAOYS A-NSMILES: CS(=O)(=Nc1c(...