| |
|
|
|
|
|
 |
 |
|
 |
| |
|
| |
|
|
| |
|
|
|
|
The details of our curation/data preparation processes for each of
the category are as follow: |
| |
| |
Isoform details are curated from published literatures documenting them. The literature
references can be found in each of the detail page.
|
| |
| |
|
Datasets of Somatic Mutations and Germline Mutations were downloaded from IARC
TP53 Mutation Database (version of database
downloaded: R11).
Users of the knowledgebase should refer to the Disclaimer
for IARC.
Data Processing
The following IARC records were excluded from the knowledgebase:
- Records without PubMed IDs.
- Records with ambiguous information, such as
- deletion records without known number of nucleotides deleted,
- intron mutations without exact nucleotide
mutation positions,
- records which are inconsistent with respect to the
reference
p53 nucleotide sequence
(GenBank Accession Number X54156).
IARC mutation data is represented as follows:
- Information on block deletions is stored with start
and end nucleotide positions. Hence, the mutation frequencies
include deletions that are found within these deletion blocks.
For example, a deletion of 9bps starting from nucleotide position
14,492 (codon 275) includes all records of deletions found in codons
275, 276 and 277.
- Tandem mutations are concurrent mutations found on
consecutive nucleotide positions. For such mutations,
information on the start and end nucleotide
positions correspond to the first
and second nucleotide positions of the mutation
record.
- Changes were made to the following 'Mutation Types':
- “G:C > T:A at CpG” is changed to “G:C > T:A”,
- “CC tandem” is changed to “Tandem”.
*Note : The mutation frequencies generated by the knowledgebase
differs from IARC's because
of different
data
representations (refer to FAQ).
Given the possibility of errors that could arise from the simple merger of datasets from the IARC and T. Soussi web sites, we are in the process of reviewing how best to incorporate correctly information from these two sources. As such, we are only using data from IARC for a start until this issue is resolved.
|
| |
| |
|
The p53 polymorphism data were obtained from five resources, i.e.
The International HapMap
Project,
Perlegen Science, NIEHS (National
Institute of Environmental Health Sciences), IARC (International
Agency for Research on Cancer) and other published literatures
documenting them.
Users of the p53 Knowledgebase should refer to:
The HapMap, Perlegen and NIEHS are online databases for polymorphism and
genotyping data of several populations. Information such as the
location, alleles,
frequencies and the population were extracted from these online
databases. For the NIEHS data, a location is considered polymorphic
only if the minor allele frequency is at least 1%. All the
polymorphism data are mapped to the reference p53 nucleotide sequence
(GenBank Accession Number X54156).
In addition to the information above, functional positions of the
polymorphic site (whether they are intronic, synonymous, non-synonymous
or untranslated) were also checked.
Furthermore,
short descriptions such as deletion, insertion, or
ambiguity were given.
The IARC data and other literature served as additional sources
for polymorphism data, such as alleles and positions. Information
such as the population and allelic frequencies were supplemented from other
databases whenever available.
Besides the polymorphism data, disease association relating to the polymorphism
site were also curated. Disease association information include
disease name, population studied,
association ("Yes" or "No"), additional remarks and
the literature references.
|
| |
| |
|
The p53 haplotypes were inferred from the International
HapMap Project's data using HapBlock.
HapBlock was used for block reconstruction as it is free for academic
use and provides several haplotype block and tag SNP definitions.
Users of the p53 Knowledgebase should refer to HapMap's
Guidelines for Data Use and HapBlock's
Terms of Usage. Genotyping data, frequency information, and general pedigree information
for the Caucasian and the African population is required for inferring
the haplotypes.
Currently, the 50 kb upstream and 50
kb downstream regions of the p53 gene were included for the inference
and display of haplotype blocks across the p53 gene.
Child information from
the Caucasian and African populations were excluded
from the inference. In addition, SNPs with minor allele frequency
less than 5% were removed.
Haploytype blocks were inferred using the LD based
method as described by Gabriel et. al. and
tag SNP definition by using r2 measurement as described
by Carlson et. al.
|
| |
| |
p53-related molecules data such as their SWISSPROT
ID, Entrez
Gene ID, Gene
Ontology, KEGG
pathways, Interpro
entry,
Pfam
entry and PDB entries was
obtained from the molecule's UCSC
genome browser's entry. Additional information such as the Homologene
ID and the BIND interaction
between the molecule and p53 was also included from their respective
sources.
Users of the p53 Knowledgebase should refer to the following:
|
| |
| |
|
Transcription factors information was curated from online databases (TRED,
TRANSFAC, BIND, BioCarta and KEGG)
and other published literatures documenting them (Only data from
experiments using human cell cultures were considered). Data pertaining
to p53 promoters was
obtained from TRED and TRANSFAC whereas
data pertaining to proteins interacting with transcription factors
of p53 was obtained from BIND and BioCarta.
Pathway information involving the transcription factors of p53
was obtained from KEGG.
Users
of the p53 Knowledgebase should refer to the following:
|
| |
| |
|
Target genes information was curated from online databases (TRANSFAC)
and other published literatures documenting them. Binding site sequences were
extracted for each target gene from published literatures and their nucleotide
positions verified using the UCSC
Genome Browser's May 2004 assembly. The official symbol for target
gene is obtained from Entrez
Genes. Additional relevant journal references,
if any,
are taken from Bibliography section. Additional
binding site information, if any, is obtained by following links to BIND in
the Interactions section.
Users of the p53 Knowledgebase
should refer to the following:
|
| |
| |
The protein interaction section of the Knowledgebase aims to
give a comprehensive coverage of all human proteins, human protein complexes,
viral proteins and small molecules interacting with p53 protein in human.
The
interactions
include
protein-protein bindings and post-translational modifications (phosphorylation,
NEDDylation, ubiquitination, etc).
The p53 protein interaction data is taken from AfCS-Nature
Signaling Gateway.
It consists of BIND records
with supporting literatures. Each protein interaction record in the Knowledgebase
corresponds to a single molecule
(protein , protein complex or small molecule) that interacts with the
human p53 protein. These records are supported by at least one experimental
result as documented
in published literatures.
BIND records referring to the same interaction and were merged into a single
record. Information on experimental systems, techniques and binding
sites were verified with the cited literature before being included into
our database. Extra information such as cause and regulation of interaction
as well as molecular, physiological and localization effects resulting
from the interactions (which are not found in the AfCS-Nature
Signaling Gateway and BIND) were also curated from the published literatures.
Users of the p53 Knowledgebase should refer to the following:
|
| |
| |
|
Information on the p53 antibodies was primarily obtained from
Dr. Borivoj Vojtesek, Institute
of Molecular and Cell Biology.
|
| |
| |
|
Post Translational Modifications details were curated from published literatures
documenting them. The literature references can be
found in each of the detail page.
|
| |
| |
|
The p53 Regulatory Pathways page aims to consolidate information
from various public and commercial data sources with relevance to
the p53 molecule and its pathways. This page consists of the following
sections:
This section centers on the p53 core regulatory network, constructed
based on a review article by Harris and Levine (2005). This network
is a summary of positive and negative feedback loops involving p53
and is described in more detail in the book chapter written by members
of the BioPathways Group at BII (Aguda et al., 2005). Data in this
section was obtained by doing searches from the commercially available
Pathway
Assist software. For each interaction shown in the core
network, a search of direct interactions was performed. Pathway
Assist returns a network diagram of biological associations from
the ResNet database.
This section contains pathways obtained from public databases.
The pathways listed in this page are those that involve p53 and
provide links directly to the relevant public database. Listed below
are the databases from which links are provided:
Users of the p53 Knowledgebase should refer to:
This section is a list of links to resources
that may be useful for those doing pathways analysis.
|
| |
| |
|
The PDB Structure section includes all the experimentally solved
RCSB's PDB structures that contain
at least part of the human p53 protein. Data extracted for each PDB
entry includes:
- The complex partner of the p53 peptide in the
structure
- Chains in which the p53 peptides are located.
- Mutation information about the p53 protein in
the structure.
- The annotation of the PDB entry from RCSB's
beta site.
- PubMed ID of the literature.
The start and end amino acid positions of the p53 peptide in each structure
was further verified by aligning the peptide sequence with the
full-length p53 sequence.
Users of the p53 Knowledgebase should
refer to RCSB
PDB's Advisory Notice. |
| |
| |
- Olivier M, Eeles R, Hollstein M, Khan MA, Harris CC, Hainaut P. (2002). The IARC TP53 Database: new online mutation analysis and recommendations to users. Hum Mutat. 6:607-614.
- Beroud C, Soussi T. (2003). The UMD-p53 database: new mutations
and analysis tools. Hum Mutat. 21:176-181
- The International HapMap Consortium. (2003). The International HapMap Project. Nature. 426:789-796.
- NIEHS SNPs. NIEHS Environmental
Genome Project, University of Washington, Seattle, WA (URL: http://egp.gs.washington.edu) [September 2005 accessed].
- Carlson CS, Eberle MA, Rieder MJ, Yi Q, Kruglyak L, Nickerson DA. (2004). Selecting a maximally informative set of single-nucleotide polymorphisms for association analysis using linkage disequilibrium. Am J Hum Genet. 74:106-120.
- Gabriel
SB, Schaffner SF, Ngyuen H, Moore JM, Roy J, Blumenstiel B, Higgins
J, DeFelice M, Lochner A, Faggart M. (2002). The structure of
haplotype blocks in the human genome. Science. 296:2225-2229.
- Zhang
K, Qin Z, Chen T, Liu JS, Waterman MS, Sun F. (2005).
HapBlock: haplotype block partitioning and tag SNP selection
software using a set of dynamic programming algorithms. Bioinformatics 21:131–134.
- Boeckmann
B, Bairoch A, Apweiler R, Blatter MC, Estreicher A, Gasteiger
E, Martin MJ, Michoud K, O'Donovan C, Phan I,
Pilbout S, Schneider M. (2003). The SWISS-PROT protein
knowledgebase and its supplement TrEMBL in 2003. Nucleic
Acids Res. 31:365-370
- Matys,
V., Fricke, E., Geffers, R., Gossling,
E., Haubrock, M., Hehl, R., Hornischer, K., Karas, D., Kel,
A.E., Kel-Margoulis, O.V., Kloos, D.U., Land, S., Lewicki-Potapov,
B., Michael, H., Münch, R., Reuter, I., Rotert, S., Saxel,
H., Scheer, M., Thiele, S. and Wingender, E. (2003). TRANSFAC:
transcriptional regulation, from patterns to profiles. Nucleic
Acids Res. 31: 374-378.
- Zhao
F, Xuan Z, Liu L, Zhang MQ. (2005). TRED: a Transcriptional
Regulatory Element Database and a platform for in silico
gene regulation studies. Nucleic Acids Res. 33:D103-D107.
- Alfarano
C, Andrade CE, Anthony K, Bahroos N, Bajec M, Bantoft K,
Betel D, Bobechko B, Boutilier K, Burgess E, Buzadzija K,
Cavero R, D'Abreo C, Donaldson I, Dorairajoo D, Dumontier MJ,
Dumontier MR, Earles V, Farrall R, Feldman H, Garderman E,
Gong Y, Gonzaga R, Grytsan V, Gryz E, Gu V, Haldorsen E, Halupa
A, Haw R, Hrvojic A, Hurrell L, Isserlin R, Jack F, Juma F,
Khan A, Kon T, Konopinsky S, Le V, Lee E, Ling S, Magidin M,
Moniakis J, Montojo J, Moore S, Muskat B, Ng I, Paraiso JP,
Parker B, Pintilie G, Pirone R, Salama JJ, Sgro S, Shan T,
Shu Y, Siew J, Skinner D, Snyder K, Stasiuk R, Strumpf D, Tuekam
B, Tao S, Wang Z, White M, Willis R, Wolting C, Wong S, Wrong
A, Xin C, Yao R, Yates B, Zhang S, Zheng K, Pawson T, Ouellette
BF, Hogue CW. (2005). The Biomolecular Interaction Network
Database and related tools 2005 update. Nucleic Acids Res. 33:D418-D424.
- Ogata
H, Goto S, Sato K, Fujibuchi W, Bono H, Kanehisa M. KEGG: Kyoto Encyclopedia of Genes and Genomes. (1999). Nucleic
Acids Res. 27:29-34.
- Karolchik
D, Baertsch R, Diekhans M, Furey TS, Hinrichs A, Lu YT, Roskin
KM, Schwartz M, Sugnet CW, Thomas DJ, Weber RJ,
Haussler D, Kent WJ. (2003). The
UCSC Genome Browser Database. Nucleic Acids Res. 31:51-4.
- Harris
SL, Levine AJ. (2005). The p53 pathway: positive and negative
feedback loops. Oncogene 24:2899-2908.
- Nikitin
A, Egorov S, Daraselia N, Mazo I. (2003). Pathway
studio--the analysis and navigation of molecular networks.
Bioinformatics. 19:2155-2157.
- Dahlquist
KD, Salomonis N, Vranizan K, Lawlor SC, Conklin BR.
(2002). GenMAPP, a new tool for viewing and analyzing microarray
data on biological pathways.
Nat Genet. 31:19-20.
- Joshi-Tope
G, Gillespie M, Vastrik I, D'Eustachio P, Schmidt E, de Bono
B, Jassal B, Gopinath GR, Wu GR, Matthews L, Lewis S,
Birney E, Stein L. (2005). Reactome: a knowledgebase of biological
pathways. Nucleic Acids Res. 33:D428-D432.
- Kohn
KW, Aladjem MI, Weinstein JN, Pommier Y. (2005). Molecular
Interaction Maps of Bioregulatory Networks: A General Rubric
for Systems Biology. Mol Biol Cell. [Epub
ahead of print]
-
Berman HM, Battistuz T, Bhat TN, Bluhm WF, Bourne PE, Burkhardt K, Feng Z, Gilliland GL, Iype L, Jain S, Fagan P, Marvin J, Padilla D, Ravichandran V, Schneider B, Thanki N, Weissig H, Westbrook JD, Zardecki C. (2000).
The Protein Data Bank. Nucleic Acids Res. 28:235-242.
|
|
|
| |
|
|
|
|
|