Recent advances in neuro-imaging allowed big brain-initiatives and consortia to create vast resources of brain data that can be mined by researchers for their individual projects. Exploring the relationship between genes, brain circuitry, and behavior is one of the key elements of neuroscience research. This requires fusion of spatial connectivity data at varying scales, such as whole brain correlated gene expression, structural and functional connectivity. With ever-increasing resolution, these tend to exceed the past state-of-the art in size and complexity by several orders of magnitude. Since current analytical workflows in neuroscience involve time-consuming manual data-aggregation, incorporating efficient techniques for handling big connectivity data is a necessity. We propose a novel data structure enabling the interactive exploration of heterogeneous neurobiological connectivity data with billions of edges. Based on this data structure we realized Aggregation Queries, i.e. the aggregated connectivity from, to or between brain areas allows experts to compare the multimodal networks residing at different scales, or levels of hierarchically organized anatomical atlases. Executed on-demand on volumetric gene expression and connectivity data, they allow an interactive dissection of networks in real-time and based on their spatial context. The data structure is optimized in order to be accessible directly from the hard disk, since connectivity of large-scale networks typically exceeds the memory size of current consumer level PCs. This allows experts to embed and explore their own experimental data in the framework of public data resources without the need for their own large-scale infrastructure. Our data structure outperforms state-of-the-art graph engines in retrieving connectivity of arbitrary user defined local brain areas. We demonstrate the feasibility of our approach by analyzing fear-related functional neuroanatomy in mice. Further, we show its versatility by comparing multimodal brain networks linked to autism. Importantly, we achieve cross-species congruence in retrieving human psychiatric traits networks, which facilitates the selection of neural substrates to be further studied in mouse models.
Keywords: Aggregation queries; Big data; Brain networks; Functional connectivity; Hierarchical Parcellation; Interactive data mining; Large networks; Spatial data structures; Structural connectivity.