pdbread
Read data from Protein Data Bank (PDB) file
Syntax
PDBStruct
= pdbread(File
)
PDBStruct
=
pdbread(File
, 'ModelNum', ModelNumValue
)
PDBStruct
= pdbread(File
,'TimeOut', TimeOutValue
)
Input Arguments
File | Either of the following:
Tip You can use the |
ModelNumValue | Positive integer specifying a model in a PDB-formatted file. |
TimeOutValue | Connection timeout in seconds, specified as a positive scalar. The default value is 5. For details, see here. |
Output Arguments
PDBStruct | MATLAB structure containing a field for each PDB record. |
Description
The Protein Data Bank (PDB) database is an archive of experimentally determined 3-D biological macromolecular structure data. For more information about the PDB format, see:
reads
the data from PDB-formatted text file PDBStruct
= pdbread(File
)File
and
stores the data in the MATLAB structure, PDBStruct
,
which contains a field for each PDB record. The following table summarizes
the possible PDB records and the corresponding fields in the MATLAB structure PDBStruct
:
PDB Database Record | Field in the MATLAB Structure |
---|---|
HEADER | Header |
OBSLTE | Obsolete |
TITLE | Title |
CAVEAT | Caveat |
COMPND | Compound |
SOURCE | Source |
KEYWDS | Keywords |
EXPDTA | ExperimentData |
AUTHOR | Authors |
REVDAT | RevisionDate |
SPRSDE | Superseded |
JRNL | Journal |
REMARK 1 | Remark1 |
REMARK N Note N equals 2 through 999. | Remarkn Note n equals 2 through 999. |
DBREF | DBReferences |
SEQADV | SequenceConflicts |
SEQRES | Sequence |
FTNOTE | Footnote |
MODRES | ModifiedResidues |
HET | Heterogen |
HETNAM | HeterogenName |
HETSYN | HeterogenSynonym |
FORMUL | Formula |
HELIX | Helix |
SHEET | Sheet |
TURN | Turn |
SSBOND | SSBond |
LINK | Link |
HYDBND | HydrogenBond |
SLTBRG | SaltBridge |
CISPEP | CISPeptides |
SITE | Site |
CRYST1 | Cryst1 |
ORIGXn | OriginX |
SCALEn | Scale |
MTRIXn | Matrix |
TVECT | TranslationVector |
MODEL | Model |
ATOM | Atom |
SIGATM | AtomSD |
ANISOU | AnisotropicTemp |
SIGUIJ | AnisotropicTempSD |
TER | Terminal |
HETATM | HeterogenAtom |
CONECT | Connectivity |
reads
only the model specified by PDBStruct
=
pdbread(File
, 'ModelNum', ModelNumValue
)ModelNumValue
from
the PDB-formatted text file
and
stores the data in the MATLAB structure File
PDBStruct
.
If ModelNumValue
does not correspond to
an existing mode number in File
, then pdbread
reads
the coordinate information of all the models.
sets the connection timeout (in seconds) to read data from the PDB database.PDBStruct
= pdbread(File
,'TimeOut', TimeOutValue
)
The Sequence Field
The Sequence
field is also a structure containing
sequence information in the following subfields:
NumOfResidues
ChainID
ResidueNames
— Contains the three-letter codes for the sequence residues.Sequence
— Contains the single-letter codes for the sequence residues.
Note
If the sequence has modified residues, then the ResidueNames
subfield
might not correspond to the standard three-letter amino acid codes.
In this case, the Sequence
subfield will contain
the modified residue code in the position corresponding to the modified
residue. The modified residue code is provided in the ModifiedResidues
field.
The Model Field
The Model
field is also a structure or an
array of structures containing coordinate information. If the MATLAB structure
contains one model, the Model
field is a structure
containing coordinate information for that model. If the MATLAB structure
contains multiple models, the Model
field is an
array of structures containing coordinate information for each model.
The Model
field contains the following subfields:
Atom
AtomSD
AnisotropicTemp
AnisotropicTempSD
Terminal
HeterogenAtom
The Atom Field
The Atom
field is also an array of structures
containing the following subfields:
AtomSerNo
AtomName
altLoc
resName
chainID
resSeq
iCode
X
Y
Z
occupancy
tempFactor
segID
element
charge
AtomNameStruct
— Contains three subfields:chemSymbol
,remoteInd
, andbranch
.
Examples
Use the
getpdb
function to retrieve structure information from the Protein Data Bank (PDB) for the nicotinic receptor protein with identifier1abt
, and then save the data to the PDB-formatted filenicotinic_receptor.pdb
in the MATLAB Current Folder.getpdb('1abt', 'ToFile', 'nicotinic_receptor.pdb');
Read the data from the
nicotinic_receptor.pdb
file into a MATLAB structurepdbstruct
.pdbstruct = pdbread('nicotinic_receptor.pdb');
Read only the second model from the
nicotinic_receptor.pdb
file into a MATLAB structurepdbstruct_Model2
.pdbstruct_Model2 = pdbread('nicotinic_receptor.pdb', 'ModelNum', 2);
View the atomic coordinate information in the model fields of both MATLAB structures
pdbstruct
andpdbstruct_Model2
.pdbstruct.Model ans = 1x4 struct array with fields: MDLSerNo Atom Terminal pdbstruct_Model2.Model ans = MDLSerNo: 2 Atom: [1x1205 struct] Terminal: [1x2 struct]
Read the data from a URL into a MATLAB structure,
gfl_pdbstruct
.gfl_pdbstruct = pdbread('http://www.rcsb.org/pdb/files/1gfl.pdb');
Version History
Introduced before R2006a
See Also
genpeptread
| getpdb
| pdbdistplot
| pdbsuperpose
| pdbtransform
| pdbwrite