Opening the Black Box: Interpretable Machine Learning for Geneticists

Christina B Azodi; Jiliang Tang; Shin-Han Shiu

doi:10.1016/j.tig.2020.03.005

Opening the Black Box: Interpretable Machine Learning for Geneticists

Trends Genet. 2020 Jun;36(6):442-455. doi: 10.1016/j.tig.2020.03.005. Epub 2020 Apr 17.

Authors

Christina B Azodi¹, Jiliang Tang², Shin-Han Shiu³

Affiliations

¹ Department of Plant Biology, Michigan State University, East Lansing, MI, USA; Bioinformatics and Cellular Genomics, St. Vincent's Institute of Medical Research, Fitzroy, Victoria, Australia. Electronic address: [email protected].
² Department of Computer Science and Engineering, Michigan State University, East Lansing, MI, USA.
³ Department of Plant Biology, Michigan State University, East Lansing, MI, USA; Department of Computational Mathematics, Science, and Engineering, Michigan State University, East Lansing, MI, USA. Electronic address: [email protected].

PMID: 32396837
DOI: 10.1016/j.tig.2020.03.005

Abstract

Because of its ability to find complex patterns in high dimensional and heterogeneous data, machine learning (ML) has emerged as a critical tool for making sense of the growing amount of genetic and genomic data available. While the complexity of ML models is what makes them powerful, it also makes them difficult to interpret. Fortunately, efforts to develop approaches that make the inner workings of ML models understandable to humans have improved our ability to make novel biological insights. Here, we discuss the importance of interpretable ML, different strategies for interpreting ML models, and examples of how these strategies have been applied. Finally, we identify challenges and promising future directions for interpretable ML in genetics and genomics.

Keywords: deep learning; interpretable machine learning; predictive biology.

Publication types

Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Review

MeSH terms

Computational Biology / methods*
Genetics, Medical*
Genetics, Population*
Genome, Human*
Humans
Machine Learning*