6533b85ffe1ef96bd12c2750
RESEARCH PRODUCT
Prediction of Specific TCR-Peptide Binding From Large Dictionaries of TCR-Peptide Pairs
Shirit DvorkinHanan BesserIdo SpringerYoram LouzounNili Tickotsky-moskovitzsubject
lcsh:Immunologic diseases. AllergyComputer scienceevaluation methodsT-LymphocytesT cellImmunologyReceptors Antigen T-CellEpitopes T-LymphocyteTarget peptidePeptide bindingPeptidechemical and pharmacologic phenomenaComputational biologyLigandsSoftware implementationautoencoder (AE)AntigenEvaluation methodsmedicineImmunology and AllergyHumansProtein Interaction Domains and MotifsEpitope specificityAntigensDatabases ProteinOriginal Researchchemistry.chemical_classificationBinding SitesT cell repertoireChemistryRepertoirelong short-term memory (LSTM)T-cell receptorepitope specificitydeep learninghemic and immune systemsmedicine.anatomical_structuremachine learningPeptidesSequence motiflcsh:RC581-607SoftwareProtein BindingSignal TransductionTCR repertoire analysisdescription
Abstract The T cell repertoire is composed of T cell receptors (TCR) selected by their cognate MHC-peptides and naive TCR that do not bind known peptides. While the task of distinguishing a peptide-binding TCR from a naive TCR unlikely to bind any peptide can be performed using sequence motifs, distinguishing between TCRs binding different peptides requires more advanced methods. Such a prediction is the key for using TCR repertoires as disease-specific biomarkers. We here used large scale TCR-peptide dictionaries with state-of-the-art natural language processing (NLP) methods to produce ERGO (pEptide tcR matchinG predictiOn), a highly specific classifier to predict which TCR binds to which peptide. We successfully employed ERGO for two related tasks: discrimination between peptide binding and naive TCRs and the more complicated task of distinguishing between TCRs that bind different peptides. We show that ERGO significantly outperforms all current methods for classification of TCRs that bind peptides, but more importantly can distinguish the specific target of a TCR among a large set of peptides. The software implementation and data sets are available at: https://github.com/IdoSpringer/ERGO One Sentence Summary The combination of advanced tools from natural language processing and large-scale dictionaries of T cell receptors and their target peptide precisely predicts whether a T cell would bind a specific target.
| year | journal | country | edition | language |
|---|---|---|---|---|
| 2019-05-26 | Frontiers in Immunology |