Update README.md
Browse files
README.md
CHANGED
|
@@ -5,8 +5,42 @@
|
|
| 5 |
Targeting specific functional motifs, whether conserved viral epitopes, intrinsically disordered regions (IDRs), or fusion breakpoints, is essential for modulating protein function and protein-protein interactions (PPIs). Current design methods, however, depend on stable tertiary structures, limiting their utility for disordered or dynamic targets. Here, we present a motif-specific PPI targeting algorithm (moPPIt), a framework for the de novo generation of motif-specific peptide binders derived solely from target sequence data. The core of this approach is BindEvaluator, a transformer architecture that interpolates protein language model embeddings to predict peptide-protein binding site interactions with high accuracy (AUC = 0.97). We integrate this predictor into a novel Multi-Objective-Guided Discrete Flow Matching (MOG-DFM) framework, which steers generative trajectories toward peptides that simultaneously maximize binding affinity and motif specificity. After comprehensive in silico validation of binding and motif-specific targeting, we validate moPPIt in vitro by generating binders that strictly discriminate between the FN3 and IgG domains of NCAM1, confirming domain-level specificity, and further demonstrate precise targeting of IDRs by generating binders specific to the N-terminal disordered domain of β-catenin. In functional, disease-relevant assays, moPPIt-designed peptides targeting the GM-CSF receptor effectively block macrophage polarization. Finally, we demonstrate therapeutic utility in cell engineering, where binders directed against the tumor antigen AGR2 drive specific CAR T regulatory cell activation. In total, moPPIt serves as a purely sequence-based paradigm for controllably targeting the "undruggable" and disordered proteome.
|
| 6 |
|
| 7 |
## Google Colab Notebooks
|
| 8 |
-
We provide
|
| 9 |
|
| 10 |
-
**Colab
|
| 11 |
|
| 12 |
-
**Colab
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 5 |
Targeting specific functional motifs, whether conserved viral epitopes, intrinsically disordered regions (IDRs), or fusion breakpoints, is essential for modulating protein function and protein-protein interactions (PPIs). Current design methods, however, depend on stable tertiary structures, limiting their utility for disordered or dynamic targets. Here, we present a motif-specific PPI targeting algorithm (moPPIt), a framework for the de novo generation of motif-specific peptide binders derived solely from target sequence data. The core of this approach is BindEvaluator, a transformer architecture that interpolates protein language model embeddings to predict peptide-protein binding site interactions with high accuracy (AUC = 0.97). We integrate this predictor into a novel Multi-Objective-Guided Discrete Flow Matching (MOG-DFM) framework, which steers generative trajectories toward peptides that simultaneously maximize binding affinity and motif specificity. After comprehensive in silico validation of binding and motif-specific targeting, we validate moPPIt in vitro by generating binders that strictly discriminate between the FN3 and IgG domains of NCAM1, confirming domain-level specificity, and further demonstrate precise targeting of IDRs by generating binders specific to the N-terminal disordered domain of β-catenin. In functional, disease-relevant assays, moPPIt-designed peptides targeting the GM-CSF receptor effectively block macrophage polarization. Finally, we demonstrate therapeutic utility in cell engineering, where binders directed against the tumor antigen AGR2 drive specific CAR T regulatory cell activation. In total, moPPIt serves as a purely sequence-based paradigm for controllably targeting the "undruggable" and disordered proteome.
|
| 6 |
|
| 7 |
## Google Colab Notebooks
|
| 8 |
+
We provide two Google Colab notebooks to help you run and evaluate moPPIt without any local setup:
|
| 9 |
|
| 10 |
+
- **moPPIt Colab** (Generating motif-specific binders while optimizing other therapeutic-related properties): [Link](https://colab.research.google.com/drive/16n8PIwKwAiG-oDLm171BWvv-lQH0dHMg?usp=sharing)
|
| 11 |
|
| 12 |
+
- **PeptiDerive Colab** (Computing Relative Interaction Scores (RIS) for residues on the target protein): [Link](https://colab.research.google.com/drive/1aCODZ-WRwhxr-u8nEB6ZrdrhIOTz7-UF?usp=sharing)
|
| 13 |
+
|
| 14 |
+
## Command-line Usage
|
| 15 |
+
You can also run **moPPIt** and **BindEvaluator** from the command line.
|
| 16 |
+
|
| 17 |
+
### Run moPPIt
|
| 18 |
+
|
| 19 |
+
Example command:
|
| 20 |
+
```
|
| 21 |
+
python -u moo.py \
|
| 22 |
+
--output_file './samples.csv' \
|
| 23 |
+
--length 10 \
|
| 24 |
+
--n_batches 600 \
|
| 25 |
+
--weights 1 1 1 4 4 2 \
|
| 26 |
+
--motifs '16-31,62-79' \
|
| 27 |
+
--motif_penalty \
|
| 28 |
+
--objectives Hemolysis Non-Fouling Half-Life Affinity Motif Specificity \
|
| 29 |
+
--target_protein MHVPSGAQLGLRPDLLARRRLKRCPSRWLCLSAAWSFVQVFSEPDGFTVIFSGLGNNAGGTMHWNDTRPAHFRILKVVLREAVAECLMDSYSLDVHGGRRTAAG
|
| 30 |
+
```
|
| 31 |
+
|
| 32 |
+
### Run BindEvaluator
|
| 33 |
+
BindEvaluator predicts the binding sites on the target protein, given a target protein seqeunce and a binder sequence.
|
| 34 |
+
|
| 35 |
+
Example command:
|
| 36 |
+
```
|
| 37 |
+
python -u bindevaluator.py \
|
| 38 |
+
-target MHVPSGAQLGLRPDLLARRRLKRCPSRWLCLSAAWSFVQVFSEPDGFTVIFSGLGNNAGGTMHWNDTRPAHFRILKVVLREAVAECLMDSYSLDVHGGRRTAAG \
|
| 39 |
+
-binder YVEICRCVVC \
|
| 40 |
+
-sm ./classifier_ckpt/finetuned_BindEvaluator.ckpt \
|
| 41 |
+
-n_layers 8 \
|
| 42 |
+
-d_model 128 \
|
| 43 |
+
-d_hidden 128 \
|
| 44 |
+
-n_head 8 \
|
| 45 |
+
-d_inner 64
|
| 46 |
+
```
|