BioTech FYI Center

Nucleotide Sequence Databases

1.Nucleotide Sequence Databases

1.1 International Nucleotide Sequence Database Collaboration

Database name
Full name and/or description
URL
DDBJ-DNA Data Bank of Japan
All known nucleotide and protein sequences
EMBL-Nucleotide Sequence Database
All known nucleotide and protein sequences
GenBank
All known nucleotide and protein sequences

1.2. DNA sequences: genes, motifs and regulatory sites

1.2.1. Coding and coding DNA

Database name
Full name and/or description
URL
ACLAME
A classification of genetic mobile elements
CUTG
Codon usage tabulated from GenBank
Genetic Codes
Genetic codes in various organisms and organelles
Entrez Gene
Gene-centered information at NCBI
HERVd
Human endogenous retrovirus database
Hoppsigen
Human and mouse homologous processed pseudogenes
Imprinted Gene Catalogue
Imprinted genes and parent-of-origin effects in animals
Islander
Pathogenicity islands and prophages in bacterial genomes
MICdb
Prokaryotic microsatellites
NPRD
Nucleosome positioning region database
STRBase
Short tandem DNA repeats database
TIGR Gene Indices
Organism-specific databases of EST and gene sequences
Transterm
Codon usage, start and stop signals
UniGene
Non-redundant set of eukaryotic gene-oriented clusters
UniVec

Vector sequences, adapters, linkers and primers used in DNA cloning, can be used to check for vector contamination

VectorDB
Characterization and classification of nucleic acid vectors
Xpro

Eukaryotic protein-encoding DNA sequences, both intron-containing and intron- less genes

1.2.2. Gene structure, introns and exons, splice sites

Database name
Full name and/or description
URL
ASAP
Alternative spliced isoforms
ASD

Alternative splicing database at EBI, includes three databases AltSplice, AltExtron and AEdb

ASDB

Alternative splicing database: protein products and expression patterns of alternatively spliced genes

ASHESdb
Alternatively spliced human genes by exon skipping database
EASED
Extended alternatively spliced EST database
ECgene
Genome annotation for alternative splicing
EDAS
EST-derived alternative splicing database
ExInt
Exon�intron structure of eukaryotic genes
HS3D
Homo sapiens splice sites dataset
Intronerator
Alternative splicing in C.elegans and C.briggsae
SpliceDB
Canonical and non-canonical mammalian splice sites
SpliceInfo
Modes of alternative splicing in human genome
SpliceNest
A tool for visualizing splicing of genes from EST data

1.2.3. Transcriptional regulator sites and transcription factors

Database name
Full name and/or description
URL
ACTIVITY
Functional DNA/RNA site activity
DBTBS
Bacillus subtilis promoters and transcription factors
DoOP
Database of orthologous promoters: chordates and plants
DPInteract
Binding sites for E.coli DNA-binding proteins
EPD
Eukaryotic promoter database
HemoPDB

Hematopoietic promoter database: transcriptional regulation in hematopoiesis

JASPAR
PSSMs for transcription factor DNA-binding sites
MAPPER
Putative transcription factor binding sites in various genomes
PLACE
Plant cis-acting regulatory DNA elements
PlantCARE
Plant promoters and cis -acting regulatory elements
PlantProm
Plant promoter sequences for RNA polymerase II
PRODORIC
Prokaryotic database of gene regulation networks
PromEC
E . coli promoters with experimentally identified transcriptional start sites

http://bioinfo.md.huji.ac.il/marg/promec

SELEX_DB

DNA and RNA binding sites for various proteins, found by systematic evolution of ligands by exponential enrichment

TESS
Transcription element search system
TRACTOR db
Transcription factors in gamma-proteobacteria database
TRANSCompel
Composite regulatory elements affecting gene transcription in eukaryotes
TRANSFAC
Transcription factors and binding sites
TRED
Transcriptional regulatory element database
TRRD
Transcription regulatory regions of eukaryotic genes

Nucleotide Sequence Databases