Collections:
Read Sequence Alignments with Bio.AlignIO
How to Read Sequence Alignments with Bio.AlignIO package?
✍: FYIcenter.com
Bio.AlignIO module allows you to read and write Sequence Alignments
as MultipleSeqAlignment objects.
Enter the following sequence alignment file, PF05371_seed.faa, in FASTA format.
>COATB_BPIKE/30-81 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRLFKKFSSKA >Q9T0Q8_BPIKE/1-52 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKLFKKFVSRA >COATB_BPI22/32-83 DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRLFKKFSSKA >COATB_BPM13/24-72 AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPZJ2/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFASKA >Q9T0Q9_BPFD/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPIF1/22-73 FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKLFKKFVSRA
Read the above sequence alignment file with the Bio.AlignIO.read() function.
fyicenter$ python
>>> from Bio import AlignIO
>>> alignment = AlignIO.read("PF05371_seed.faa", "fasta")
>>>
>>> print(alignment)
Alignment with 7 rows and 52 columns
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRL...SKA COATB_BPIKE/30-81
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKL...SRA Q9T0Q8_BPIKE/1-52
DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRL...SKA COATB_BPI22/32-83
AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPM13/24-72
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPZJ2/1-49
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA Q9T0Q9_BPFD/1-49
FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKL...SRA COATB_BPIF1/22-73
⇒ Calculate Substitutions in Alignments
⇐ Scan Prosite Databas with Bio.ExPASy.ScanProsite.scan()
2023-09-05, 969🔥, 0💬
Popular Posts:
Molecule Summary: ID: FYI-1002294 Names: InChIKey: LEYWMXGGVZTHDM-UHFFFAOYS A-NSMILES: COc1ccc2c(c1)...
Molecule Summary: ID: FYI-1003816 Names: InChIKey: IXPRTBBVBMTJDL-UHFFFAOYS A-NSMILES: O=C(NCCn1cccn...
Molecule Summary: ID: FYI-1003612 Names: InChIKey: ZPWWVFANUGXHSA-UHFFFAOYS A-NSMILES: Cn1nccc1c2cc(...
What Is SDF/Mol V3000 file format? SDF (Structural Data File) V3, also call Mol V3000 file, or Molfi...
Molecule Summary: ID: FYI-1001936 SMILES: C1=CC(=CC=C1C2=C(C(=O)C3 =C(O2)C=C(C=C3)O)O)OReceived at F...