Computational approaches to analyze various drug/ compound centered analysis often present a need to map attributes from multiple drug databases. In this study, we provide a Neo4j repository that integrates two of the most prominent open source drug databases, DrugBank and ChEMBL, with a goal of establishing an integrated data visualization and analysis tool for drug discovery studies. The drugs present in DrugBank are mapped to their counterparts in ChEMBL. The integration of these resources and the harmonization using knowledge graph serialization using Neo4j lead to identification of relationships between drugs and other related features that are otherwise spread across two different resources. A common data format, a prerequisite to populate the Neo4j database, enables users to identify new relationships central to drug discovery research, like Drug Target Interactions (DTI). The resource is freely available at: https://github.com/ambf0632/CompoundDB4j.
Keywords: ChEMBL; Cheminformatics; Dataintegration; DrugBank; Neo4j.
© 2020 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.