Parse PDB Entry with Bio.PDB.MMCIFParser.parser Module

Q

How to Parse PDB Entry with Bio.PDB.MMCIFParser.parser.get_structure() function?

✍: FYIcenter.com

A

Bio.PDB.MMCIFParser.parser.get_structure() function allows you to parse and any PDB (Protein Database) data files.

1. Download a PDB file in PDB format.

fyicenter$ curl http://files.rcsb.org/view/1fat.pdb > 1fat.pdb
fyicenter$ ls -l *.pdb
-rw-r--r--. 1 fyicenter staff 662580 Jan 2 09:52 1fat.pdb

2. Parse the file with the get_structure() function.

>>> from Bio.PDB.PDBParser import PDBParser
>>> parser = PDBParser()
>>> structure = parser.get_structure("1fat", "1fat.pdb")
  .../Bio/PDB/StructureBuilder.py:89: PDBConstructionWarning: WARNING: Chain A is discontinuous at line 7975.
  warnings.warn(
  .../Bio/PDB/StructureBuilder.py:89: PDBConstructionWarning: WARNING: Chain B is discontinuous at line 7991.
  warnings.warn(
  .../Bio/PDB/StructureBuilder.py:89: PDBConstructionWarning: WARNING: Chain C is discontinuous at line 8007.
  warnings.warn(
  .../Bio/PDB/StructureBuilder.py:89: PDBConstructionWarning: WARNING: Chain D is discontinuous at line 8023.
  warnings.warn(

3. Walk through the PDB structure.

>>> print(structure)
<Structure id=1fat>

>>> len(structure)
1

>>> model = structure[0]
>>> print(model) 
<Model id=0>

>>> len(model)
4

>>> model.child_dict
{'A': <Chain id=A>, 'B': <Chain id=B>, 'C': <Chain id=C>, 'D': <Chain id=D>}

>>> chain = model["A"]
>>> print(chain)
<Chain id=A>

>>> len(chain)
239

>>> residues = list(chain)
>>> residue = residues[0]
>>> print(residue)
<Residue SER het=  resseq=1 icode= >

>>> len(residue)
6

>>> atoms = list(residue)
>>> atoms
[<Atom N>, <Atom CA>, <Atom C>, <Atom O>, <Atom CB>, <Atom OG>]

>>> atom = atoms[0]
>>> print("Element: {}, Mass: {}, XYZ: {}".format(atom.element, atom.mass, atom.coord))
Element: N, Mass: 14.0067, XYZ: [22.898 12.385 31.874]

4. Access a single atom with given model, chain, residue.

>>> atom = structure[0]["A"][100]["CA"]
>>> print("Element: {}, Mass: {}, XYZ: {}".format(atom.element, atom.mass, atom.coord))
Element: C, Mass: 12.0107, XYZ: [ 28.073 -11.331  56.355]

5. Access all atoms in all residues on all chains and in all models.

>>> for model in structure:
...   for chain in model:
...     for residue in chain:
...       for atom in residue:
...         print("Element: {}, Mass: {}, XYZ: {}".format(atom.element, atom.mass, atom.coord))
...

⇒ Calculate Pairwise Sequence Alignment

⇐ Use Bio.SearchIO Module to Parse BLAST XML Result

⇑ Biopython - Tools for Biological Computation

⇑⇑ OBF (Open Bioinformatics Foundation) Tools

2023-05-09, 914🔥, 0💬

Scan Prosite Database with Bio.ExPASy.ScanProsite.scan()
How to Scan Prosite Database with Bio.ExPASy.ScanProsite.scan() function? Bio.ExPASy.ScanProsite.scan() function allows to scan the Prosite database with a given sequence. Here is an example on how to Scan Prosite Database. fyicenter$ python >>> from Bio.ExPASy import ScanProsit... 2023-09-05, 1493🔥, 0💬

Calculate Pairwise Sequence Alignment
How to Calculate Pairwise Sequence Alignment? You can use the Bio.Align.PairwiseAligner() function to Calculate Pairwise Sequence Alignment. It uses the Needleman-Wunsch, Smith-Waterman, Gotoh (three-state), and Waterman-Smith-Beyer global and local pairwise alignment algorithms. Here is an example ... 2023-05-09, 1125🔥, 0💬

Read Sequence Alignments with Bio.AlignIO
How to Read Sequence Alignments with Bio.AlignIO package? Bio.AlignIO module allows you to read and write Sequence Alignments as MultipleSeqAlignment objects. Enter the following sequence alignment file, PF05371_seed.faa, in FASTA format. >COATB_BPIKE/30-81 AEPNAATNYATEAMDSLKTQAIDLISQTWP VVTTV... 2023-09-05, 976🔥, 0💬

Pre-defined Sequence Alignment Score Settings
How to Use Pre-defined Sequence Alignment Score Settings? Biopython provides 3 Pre-defined Sequence Alignment Score Settings: "blastn" and "megablast" for nucleotide alignments, and "blastp" for protein alignments. Here is an example on how to "blastp" score settings. fyicenter$ python >&... 2023-08-03, 969🔥, 0💬

Fetch Sequences from NCBI with Bio.Blast.NCBIWWW.qblast()
How to Fetch Sequences from NCBI with Bio.Blast.NCBIWWW.qblast()? The function qblast() in the Bio.Blast.NCBIWWW module allows you to call the online version of BLAST to fetch DNA or protein sequences from https://blast.ncbi.nlm.nih.gov /Blast.cgi.Currently the qblast() function only works with 5 BL... 2023-05-09, 956🔥, 0💬

Too Many Results from align() Function
Why there are So Many Results from the align() Function? If you are using the default score settings, you may get a very large number of possible alignments. Here is an example using the first and the third sequences from the PF05371_seed.faa file. fyicenter$ python >>> from Bio... 2023-05-09, 932🔥, 0💬

Use Bio.SearchIO Module to Parse BLAST XML Result
How to Use Bio.SearchIO Module to Parse BLAST XML Result? The Bio.SearchIO module allows to parse sequence search result from different result format. 1. Try the following code to query the "nt" database under the "blastn" program with a given DNA sequence, which is reverse translated from a protein... 2023-05-09, 922🔥, 0💬

Parse PDB Entry with Bio.PDB.MMCIFParser.parser Module
How to Parse PDB Entry with Bio.PDB.MMCIFParser.parser.get _structure()function? Bio.PDB.MMCIFParser.parser.get _structure()function allows you to parse and any PDB (Protein Database) data files. 1. Download a PDB file in PDB format. fyicenter$ curl http://files.rcsb.org/view/1fa t.pdb> 1fat.... 2023-05-09, 915🔥, 0💬

Calculate Substitutions in Alignments
How to Calculate Substitutions in Sequence Alignments? The substitutions property of an alignment reports how often letters in the alignment are substituted for each other. This is calculated by taking all pairs of rows in the alignment, counting the number of times two letters are aligned to each o... 2023-08-03, 913🔥, 0💬

All rights in the contents of this web site are reserved by the individual author. fyicenter.com does not guarantee the truthfulness, accuracy, or reliability of any contents.