Controlled vocabulary for /db_xref qualifier

New qualifier was introduced in version 1.08 (December 1, 1995) of the Feature table definitions: /db_xref. This new qualifier serves as a vehicle for the linking of DNA sequence records to other external databases.

The text below outlines the format and the present list of allowed database cross references. Inquiries about the addition of other database types should be made to one of the collaborating databases, listed above.

Qualifier: /db_xref=”database:identifier

Definition: database cross-reference: pointer to related information in another database

Scope: all feature keys

Value format: “database:identifier” where database is the name of the database containing related information, and

identifier is the internal identifier of the related information according to the naming conventions of the cross-referenced

database.

Examples:

cross reference to GDB identifier: /db_xref=”GDB:39999″

cross reference to Swiss-Prot entry: /db_xref=”Swiss-Prot:P12345″

For all databases types ‘Case’ is important. All databases member of the International Collaboration (DDBJ, EMBL/EBI and GenBank/NCBI) may make recommendations for additions or removal of databases to this list at their convenience, and need not rely on the release cycle of the Feature Table documentation.

Database: Description of database, and type with example(s).

Presently the list includes:

AceView/WormGenes AceView Worm Genome /db_xref=”AceView/WormGenes:vha-6″  
AFTOL Assembling the Fungal Tree of Life /db_xref=”AFTOL:959″  
AntWeb   Ant Database   /db_xref=”AntWeb:CASENT0058943-D01″  
APHIDBASE Aphid Genome Database /db_xref=”APHIDBASE:ACYPI007424″  
ApiDB Apicomplexan Database Resources /db_xref=”ApiDB:cgd1_1090″  
ApiDB_CryptoDB Cryptosporidium Genome Resources   /db_xref=”ApiDB_CryptoDB:cgd7_20″  
ApiDB_PlasmoDB   Plasmodium Genome Resources   /db_xref=”ApiDB_PlasmoDB: PF11_0344″  
ApiDB_ToxoDB Toxoplasma Genome Resources /db_xref=”ApiDB_ToxoDB:49.m00014″  
Araport Arabidopsis Information Portal /db_xref=”Araport:AT1G01010″  
ASAP A Systematic Annotation Package for Community Analysis of Genomes /db_xref=”ASAP:ABE-0000006″  
ATCC American Type Culture Collection database /db_xref=”ATCC:123456″  
ATCC(in host) American Type Culture Collection database

/db_xref=”ATCC(in host):123456″

 
ATCC(dna) American Type Culture Collection database /db_xref=”ATCC(dna):123456”  
Axeldb A Xenopus laevis database /db_xref=”Axeldb:32B3.1″  
BDGP_EST Berkeley Drosophila Genome Project EST database /db_xref=”BDGP_EST:123456″  
BDGP_INS Berkeley Drosophila Genome Project database — Insertion /db_xref=”BDGP_INS:123456″  
BEEBASE BeeBase db_xref= BEEBASE:GB55480  
BEETLEBASE Tribolium Genome Database — Insertion /db_xref=”BEETLEBASE:TC030551″  
BEI BEI Resources /db_xref=”BEI:NR-50065″  
BGD Bovine Genome Database /db_xref=”BGD:BT10004″  
BOLD  Barcode of Life database /db_xref=”Bold:EPAF263″   
CABRI Common Access to Biological Resources and Information project /db_xref=”CABRI: ACC 424″  
CCAP Culture Collection of algae and protozoa /db_xref=”CCAP: 1460/15″  
CDD Conserved Domain Database /db_xref=”CDD:02194  
CGD Candida Genome Database /db_xref=”CGD:CAL0005934  
dbEST EST database maintained at the NCBI. /db_xref=”dbEST:123456″
/db_xref=”dbEST:BP535535″
 
dbProbe NCBI Probe database Public registry of nucleic acid reagents /db_xref=”dbProbe:38″  
dbSNP Variation database maintained at the NCBI. /db_xref=”dbSNP:4647″
/db_xref=”dbSNP:rs133073″
 
dbSTS STS database maintained at the NCBI. /db_xref=”dbSTS:456789″
/db_xref=”dbSTS:BV210161″
 
dictyBase Dictyostelium genome database /db_xref=”dictyBase:DDB0191090″  
EcoGene Database of Escherichia coli Sequence and Function /db_xref=”EcoGene:EG11277″  
ECOCYC EcoCyc E. coli database /db_xref=”ECOCYC:sroC”
/db_xref=”ECOCYC:C0343″
 
ENSEMBL Database of automatically annotated genomic data /db_xref=”ENSEMBL:HUMAN-Clone-AC005612″
/db_xref=”ENSEMBL:HUMAN-Gene-ENSG00000007102″
 
EnsemblGenomes Extending Ensembl across the taxonomic space /db_xref=”EnsemblGenomes:AAC73116″/db_xref=”EnsemblGenomes:b0005″  
EPD Eukaryotic Promotor Database /db_xref=”EPD:EP00576″  
ERIC Enteropathogen Resource Integration Center /db_xref=”ERIC:ABY-0246137″  
ESTLIB EBI’s EST library identifier /db_xref=”ESTLIB:1200″  
FANTOM_DB Database of Functional Annotation of Mouse /db_xref=”FANTOM_DB:0610005A07″  
FBOL International Fungal Working Group Fungal Barcoding /db_xref=”FBOL:2224″  
FLYBASE Database of Genetic and molecular data of Drosophila. /db_xref=”FLYBASE:FBgn0000024″  
Fungorum Index Fungorum /db_xref=”Fungorum:ID550186″  
GABI Network of Different Plant Genomic Research Projects /db_xref=”GABI:HA05J18″  
GDB Human Genome Database accession numbers /db_xref=”GDB:G00-128-600″  
GeneDB Curated gene database for Schizosaccharomyces pombe, Leishmania major and Trypanosoma brucei /db_xref=”GeneDB:SPCC285.16c”  
GeneID Entrez Gene Database (replaces NCBI Locus Link) /db_xref=”GeneID:3054987″  
GI GenInfo identifier, used as a unique sequence identifier for nucleotide and proteins /db_xref=”GI:1234567890″    
GO Gene Ontology Database identifier /db_xref=”GO:123″    
GOA Gene Ontology Annotation Database Identifier /db_xref=” GOA :P01100″  
Greengenes 16S rRNA gene database /db_xref=”Greengenes:269185″  
GRIN Germplasm Resources Information Network /db_xref=”GRIN:1005973″  
HGNC Human Gene Nomenclature Database /db_xref=”HGNC:2041″  
H-InvDB H-Invitational Database /db_xref=”H-InvDB:HIT000000001″
/db_xref=”H-InvDB:HIX0000001″
 
HMP Human Microbiome Project /db_xref=”HMP:0536″  
HOMD Human Oral Microbiome Database /db_xref=”HOMD:tax_078″

/db_xref=”HOMD:seq_1603”

 
HPM Human Proteome Map /db_xref=”HPM:8106″  
HSSP Database of homology-derived secondary structure of proteins /db_xref=”HSSP:12GS”  
IKMC International Knockout Mouse Consortium /db_xref=”IKMC:66303  
IMGT/GENE-DB Immunogenetics database, immunoglobulin and T-cell receptor genes /db_xref=”IMGT/GENE-DB:IGKC”  
IMGT/LIGM Immunogenetics database, immunoglobulins and T-cell receptors /db_xref=”IMGT/LIGM:U03895″  
IMGT/HLA Immunogenetics database, human MHC /db_xref=”IMGT/HLA:HLA00031″  
Interpro InterPro protein sequence database /db_xref=”InterPro:IPR002928″  
IntrepidBio Intrepid BioInformatics /db_xref=”IntrepidBio:5259707746″  
IRD Influenza Research Database /db_xref=”IRD:CEIRS-CIP045-123456.2″  
ISFinder Insertion sequence elements database /db_xref=”ISFinder:ISA1083-2″  
ISHAM-ITS

ITS reference database for pathogenic fungi

/db_xref=”ISHAM-ITS:MITS310″  
JCM Japan Collection of Microorganisms /db_xref=”JCM:1339″  
JGIDB JGI Genome Portal /db_xref=”JGIDB:Chluvu1_81011″  
JGI’s Phytozome Comparative genomics of plants /db_xref=”Phytozome:Glyma0021s00410″

/db_xref=”Phytozme:POPTR_1446s00200″

 
LocusID NCBI LocusLink ID **Discontinued March 2005 /db_xref=”LocusID:51199″  
MaizeGDB Maize Genome Database unique identifiers /db_xref=”MaizeGDB:635633″  
MarpolBase Genome Database for Marchantia polymorpha /db_xref=”MarpolBase:Mp1g00010.1″  
MedGen Human Medical Genetics /db_xref=”MedGen:C0010674″  
MGI Mouse Genome Informatics /db_xref=”MGI:1894891″  
MIM Mendelian Inheritance in Man numbers /db_xref=”MIM:123456″  
miRBase The microRNA database /db_xref=”miRBase: MI0001857″  
MycoBank Fungal Databases, Nomenclature and Species Banks /db_xref=”MycoBank:MB519473″  
NBRC NITE Biological Resource Center /db_xref=”NBRC:3189″  
NextDB Nematode Expression Pattern DataBase /db_xref=”NextDB:CELK01662″  
niaEST NIA Mouse cDNA Project /db_xref=”niaEST:L0304H12-3″  
NMPDR National Microbial Pathogen Data Resource /db_xref=”NMPDR:fig|306254.1.peg.183″  
NRESTdb Natural Rubber EST database /db_xref=”NRESTdb:Y01A01″  
OrthoMCL Ortholog Groups of Protein Sequences /db_xref=”OrthoMCL:OG5_130679″  
Osa1 Rice Genome Annotation Project /db_xref=”Osa1:LOC_Os01g12345″  
Pathema Pathema Genome Resource /db_xref=”Pathema:BA_4405″
/db_xref=”Pathema:191218″
 
PBmice PiggyBac Mutagenesis Information Center /db_xref=”PBmice:38″  
PDB Biological macromolecule three dimensional structure database /db_xref=”PDB:12GS”  
PFAM Collection of protein families /db_xref=”PFAM:PF00003″  
PGN Plant Genome Network /db_xref=”PGN:aam01-1ms3-a05″  
PIR Protein Information Resource accession numbers /db_xref=”PIR:S12345″  
PomBase Database of Structural and Functional Data for Schizosaccaromyces pombe /db_xref=”PomBase:SPBC1709.20″  
PSEUDO EMBL pseudo protein identifier /db_xref=”PSEUDO:CAC44644.1″  
PseudoCap Pseudomonas Genome Database /db_xref=”PseudoCap:PA0001″  
RAP-DB Rice Annotation Project Database /db_xref=”RAP-DB:Os01g1234567″  
RATMAP Rat Genome Database /db_xref=”RATMAP:5″  
RBGE_garden Royal Botanic Garden Edinburgh Living Collections /db_xref=”RBGE_garden:20021433″  
RBGE_herbarium Royal Botanic Garden Edinburgh Herbarium /db_xref=”RBGE_herbarium:E00217291″  
RFAM RNA families database of alignments and CMs /db_xref=”RFAM:RF00230″  
RGD Rat Genome Database /db_xref=”RGD:620528″  
RiceGenes Rice database accession numbers /db_xref=”RiceGenes:AA231856″  
RNAcentral The non-coding RNA sequence database /db_xref=”RNAcentral:URS00001B9622″  
RZPD Resource Centre Primary Database Clone Identifiers /db_xref=”RZPD:IMAGp998I142450Q6″  
SEED The SEED Database /db_xref=”SEED:fig|83331.1.peg.1″  
SGD Saccharomyces Genome Database /db_xref=”SGD:L0000470″  
SGN SOL Genomics Network /db_xref=”SGN:E553090″  
SK-FST Saskatoon Arabidopsis T-DNA mutant population – SK Collection /db_xref=”SK-FST: FST:SK32219″  
SoyBase Glycine max Genome Database /db_xref=”SoyBase:Satt005″  
SRPDB Signal Recognition Particle Database /db_xref=”SRPDB:Arth.aure._CP000474.fasta  
SubtiList Bacillus subtilis genome sequencing project /db_xref=”SubtiList:BG10001″  
taxon NCBI’s taxonomic identifier /db_xref=”taxon:4932″  
The Arabidopsis IR The Arabidopsis Information Resource  /db_xref=”TAIR:AT1F51370″  
TIGRFAM TIGR protein families /db_xref=”TIGRFAM:TIGR00094″  
TubercuList TubercuList knowledge base /db_xref=”TubercuList:Rv3322c”  
UNILIB Unified Library Database, a library-level view of the EST and SAGE libraries present in dbEST, UniGene and SAGEmap /db_xref=”UNILIB:1002″  
UniProtKB/Swiss-Prot section of the UniProt Knowledgebase, containing annotated records, which include curator-evaluated computational analysis, as well as, information extracted from the literature /db_xref=”UniProtKB/Swiss-Prot:P12345″  
UniProtKB/TrEMBL section of the UniProt Knowledgebase, containing computationally analysed records waiting for full manual annotation /db_xref=” UniProtKB/TrEMBL:Q00177″  
UniSTS Database of Sequence Tagged Sites /db_xref=” UniSTS:486599″  
UNITE Molecular database for the identification of fungi /db_xref=” UNITE:UDB000157″  
VBASE2 Integrative database of germ-line V genes from the immunoglobulin loci of human and mouse /db_xref=”VBASE2:humIGKV165″  
VectorBase Bioinformatics Resource Center for Invertebrate Vectors of Human Pathogens /db_xref=”VectorBase:ENSANGG00000007825″  
VGNC Vertebrate Gene Nomenclature Committee /db_xref= “VGNC:VGNC:4927”  
ViPR Virus Pathogen Resource /db_xref=”ViPR:HRV-A34_p1058_sR263_2008″  
WorfDB C. elegans ORFeome cloning project /db_xref=”WorfDB:pos-1″  
WormBase Caenorhabditis elegans Genome Database /db_xref=”WormBase:R13H7″  
Xenbase Xenopus laevis and tropicalis biology and genomics resource /db_xref=Xenbase:XB-GENE-1019547  
ZFIN Zebrafish Information Network /db_xref=”ZFIN:ZDB-GENE-011205-17″  

Revised October 30, 2014