Collections:
Read Sequence Alignments with Bio.AlignIO
How to Read Sequence Alignments with Bio.AlignIO package?
✍: FYIcenter.com
Bio.AlignIO module allows you to read and write Sequence Alignments
as MultipleSeqAlignment objects.
Enter the following sequence alignment file, PF05371_seed.faa, in FASTA format.
>COATB_BPIKE/30-81 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRLFKKFSSKA >Q9T0Q8_BPIKE/1-52 AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKLFKKFVSRA >COATB_BPI22/32-83 DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRLFKKFSSKA >COATB_BPM13/24-72 AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPZJ2/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFASKA >Q9T0Q9_BPFD/1-49 AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKLFKKFTSKA >COATB_BPIF1/22-73 FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKLFKKFVSRA
Read the above sequence alignment file with the Bio.AlignIO.read() function.
fyicenter$ python
>>> from Bio import AlignIO
>>> alignment = AlignIO.read("PF05371_seed.faa", "fasta")
>>>
>>> print(alignment)
Alignment with 7 rows and 52 columns
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIRL...SKA COATB_BPIKE/30-81
AEPNAATNYATEAMDSLKTQAIDLISQTWPVVTTVVVAGLVIKL...SRA Q9T0Q8_BPIKE/1-52
DGTSTATSYATEAMNSLKTQATDLIDQTWPVVTSVAVAGLAIRL...SKA COATB_BPI22/32-83
AEGDDP---AKAAFNSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPM13/24-72
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA COATB_BPZJ2/1-49
AEGDDP---AKAAFDSLQASATEYIGYAWAMVVVIVGATIGIKL...SKA Q9T0Q9_BPFD/1-49
FAADDATSQAKAAFDSLTAQATEMSGYAWALVVLVVGATVGIKL...SRA COATB_BPIF1/22-73
⇒ Calculate Substitutions in Alignments
⇐ Scan Prosite Databas with Bio.ExPASy.ScanProsite.scan()
2023-09-05, 970🔥, 0💬
Popular Posts:
Molecule Summary: ID: FYI-1003951 Names: InChIKey: FAUPJUJQOJXWNE-UHFFFAOYS A-NSMILES: CCc4nc(C(N)=O...
Molecule Summary: ID: FYI-1003951 Names: InChIKey: FAUPJUJQOJXWNE-UHFFFAOYS A-NSMILES: CCc4nc(C(N)=O...
Molecule Summary: ID: FYI-1003721 Names: InChIKey: YGTXGNMTLDKYGX-HMMYKYKNS A-NSMILES: Cc2nn(C(=O)c1...
Molecule Summary: ID: FYI-1006077 Names: InChIKey: IMEXUOSFWFZJGE-QWQFASRJS A-NSMILES: C[C@H]6CCC=[N...
Molecule Summary: ID: FYI-1002316 Names: InChIKey: XFPMHRMLIAVJRI-UHFFFAOYS A-MSMILES: O=C([O-])c1cc...