TCRpred: incorporating T-cell receptor repertoire for clinical outcome prediction

Front Genet. 2024 Mar 13:15:1345559. doi: 10.3389/fgene.2024.1345559. eCollection 2024.

Abstract

T-cell receptor (TCR) plays critical roles in recognizing antigen peptides and mediating adaptive immune response against disease. High-throughput technologies have enabled the sequencing of TCR repertoire at the single nucleotide level, allowing researchers to characterize TCR sequences with high resolutions. The TCR sequences provide important information about patients' adaptive immune system, and have the potential to improve clinical outcome prediction. However, it is challenging to incorporate the TCR repertoire data for prediction, because the data is unstructured, highly complex, and TCR sequences vary widely in their compositions and abundances across different individuals. We introduce TCRpred, an analytic tool for incorporating TCR repertoire for clinical outcome prediction. The TCRpred is able to utilize features that can be extracted from the TCR amino acid sequences, as well as features that are hidden in the TCR amino acid sequences and are hard to extract. Simulation studies show that the proposed approach has a good performance in predicting clinical outcome and tends to be more powerful than potential alternative approaches. We apply the TCRpred to real cancer datasets and demonstrate its practical utility in clinical outcome prediction.

Keywords: CDR3; T-cell receptor; TCR repertoire; clinical outcome prediction; high dimensions; high throughput sequencing.

Grants and funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This research is supported by the National Institutes of Health, R01CA223498, R01CA189532, R01GM105785, and P30CA015704.