DDBJ launches a new archive database with analytical tools for next-generation sequence data

Eli Kaminuma; Jun Mashima; Yuichi Kodama; Takashi Gojobori; Osamu Ogasawara; Kousaku Okubo; Toshihisa Takagi; Yasukazu Nakamura

doi:10.1093/nar/gkp847

DDBJ launches a new archive database with analytical tools for next-generation sequence data

Nucleic Acids Res. 2010 Jan;38(Database issue):D33-8. doi: 10.1093/nar/gkp847. Epub 2009 Oct 22.

Authors

Eli Kaminuma¹, Jun Mashima, Yuichi Kodama, Takashi Gojobori, Osamu Ogasawara, Kousaku Okubo, Toshihisa Takagi, Yasukazu Nakamura

Affiliation

¹ Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization for Information and Systems, Yata, Mishima 411-8510, Japan.

Abstract

The DNA Data Bank of Japan (DDBJ) (http://www.ddbj.nig.ac.jp) has collected and released 1,701,110 entries/1,116,138,614 bases between July 2008 and June 2009. A few highlighted data releases from DDBJ were the complete genome sequence of an endosymbiont within protist cells in the termite gut and Cap Analysis Gene Expression tags for human and mouse deposited from the Functional Annotation of the Mammalian cDNA consortium. In this period, we started a novel user announcement service using Really Simple Syndication (RSS) to deliver a list of data released from DDBJ on a daily basis. Comprehensive visualization of a DDBJ release data was attempted by using a word cloud program. Moreover, a new archive for sequencing data from next-generation sequencers, the 'DDBJ Read Archive' (DRA), was launched. Concurrently, for read data registered in DRA, a semi-automatic annotation tool called the 'DDBJ Read Annotation Pipeline' was released as a preliminary step. The pipeline consists of two parts: basic analysis for reference genome mapping and de novo assembly and high-level analysis of structural and functional annotations. These new services will aid users' research and provide easier access to DDBJ databases.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Animals
Computational Biology / methods*
Computational Biology / trends
Databases, Genetic*
Databases, Nucleic Acid*
Databases, Protein
Genome, Bacterial
Humans
Information Storage and Retrieval / methods
Internet
Japan
Software