AlienChen commited on
Commit
ebcb28d
·
verified ·
1 Parent(s): 2895fab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +37 -3
README.md CHANGED
@@ -5,8 +5,42 @@
5
  Targeting specific functional motifs, whether conserved viral epitopes, intrinsically disordered regions (IDRs), or fusion breakpoints, is essential for modulating protein function and protein-protein interactions (PPIs). Current design methods, however, depend on stable tertiary structures, limiting their utility for disordered or dynamic targets. Here, we present a motif-specific PPI targeting algorithm (moPPIt), a framework for the de novo generation of motif-specific peptide binders derived solely from target sequence data. The core of this approach is BindEvaluator, a transformer architecture that interpolates protein language model embeddings to predict peptide-protein binding site interactions with high accuracy (AUC = 0.97). We integrate this predictor into a novel Multi-Objective-Guided Discrete Flow Matching (MOG-DFM) framework, which steers generative trajectories toward peptides that simultaneously maximize binding affinity and motif specificity. After comprehensive in silico validation of binding and motif-specific targeting, we validate moPPIt in vitro by generating binders that strictly discriminate between the FN3 and IgG domains of NCAM1, confirming domain-level specificity, and further demonstrate precise targeting of IDRs by generating binders specific to the N-terminal disordered domain of β-catenin. In functional, disease-relevant assays, moPPIt-designed peptides targeting the GM-CSF receptor effectively block macrophage polarization. Finally, we demonstrate therapeutic utility in cell engineering, where binders directed against the tumor antigen AGR2 drive specific CAR T regulatory cell activation. In total, moPPIt serves as a purely sequence-based paradigm for controllably targeting the "undruggable" and disordered proteome.
6
 
7
  ## Google Colab Notebooks
8
- We provide three Google Colab Notebooks so that you can use and evaluate moPPIt easily.
9
 
10
- **Colab Notebook for moPPIt**: [Link](https://colab.research.google.com/drive/16n8PIwKwAiG-oDLm171BWvv-lQH0dHMg?usp=sharing)
11
 
12
- **Colab Notebook for PeptiDerive**: [Link](https://colab.research.google.com/drive/1aCODZ-WRwhxr-u8nEB6ZrdrhIOTz7-UF?usp=sharing)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
  Targeting specific functional motifs, whether conserved viral epitopes, intrinsically disordered regions (IDRs), or fusion breakpoints, is essential for modulating protein function and protein-protein interactions (PPIs). Current design methods, however, depend on stable tertiary structures, limiting their utility for disordered or dynamic targets. Here, we present a motif-specific PPI targeting algorithm (moPPIt), a framework for the de novo generation of motif-specific peptide binders derived solely from target sequence data. The core of this approach is BindEvaluator, a transformer architecture that interpolates protein language model embeddings to predict peptide-protein binding site interactions with high accuracy (AUC = 0.97). We integrate this predictor into a novel Multi-Objective-Guided Discrete Flow Matching (MOG-DFM) framework, which steers generative trajectories toward peptides that simultaneously maximize binding affinity and motif specificity. After comprehensive in silico validation of binding and motif-specific targeting, we validate moPPIt in vitro by generating binders that strictly discriminate between the FN3 and IgG domains of NCAM1, confirming domain-level specificity, and further demonstrate precise targeting of IDRs by generating binders specific to the N-terminal disordered domain of β-catenin. In functional, disease-relevant assays, moPPIt-designed peptides targeting the GM-CSF receptor effectively block macrophage polarization. Finally, we demonstrate therapeutic utility in cell engineering, where binders directed against the tumor antigen AGR2 drive specific CAR T regulatory cell activation. In total, moPPIt serves as a purely sequence-based paradigm for controllably targeting the "undruggable" and disordered proteome.
6
 
7
  ## Google Colab Notebooks
8
+ We provide two Google Colab notebooks to help you run and evaluate moPPIt without any local setup:
9
 
10
+ - **moPPIt Colab** (Generating motif-specific binders while optimizing other therapeutic-related properties): [Link](https://colab.research.google.com/drive/16n8PIwKwAiG-oDLm171BWvv-lQH0dHMg?usp=sharing)
11
 
12
+ - **PeptiDerive Colab** (Computing Relative Interaction Scores (RIS) for residues on the target protein): [Link](https://colab.research.google.com/drive/1aCODZ-WRwhxr-u8nEB6ZrdrhIOTz7-UF?usp=sharing)
13
+
14
+ ## Command-line Usage
15
+ You can also run **moPPIt** and **BindEvaluator** from the command line.
16
+
17
+ ### Run moPPIt
18
+
19
+ Example command:
20
+ ```
21
+ python -u moo.py \
22
+ --output_file './samples.csv' \
23
+ --length 10 \
24
+ --n_batches 600 \
25
+ --weights 1 1 1 4 4 2 \
26
+ --motifs '16-31,62-79' \
27
+ --motif_penalty \
28
+ --objectives Hemolysis Non-Fouling Half-Life Affinity Motif Specificity \
29
+ --target_protein MHVPSGAQLGLRPDLLARRRLKRCPSRWLCLSAAWSFVQVFSEPDGFTVIFSGLGNNAGGTMHWNDTRPAHFRILKVVLREAVAECLMDSYSLDVHGGRRTAAG
30
+ ```
31
+
32
+ ### Run BindEvaluator
33
+ BindEvaluator predicts the binding sites on the target protein, given a target protein seqeunce and a binder sequence.
34
+
35
+ Example command:
36
+ ```
37
+ python -u bindevaluator.py \
38
+ -target MHVPSGAQLGLRPDLLARRRLKRCPSRWLCLSAAWSFVQVFSEPDGFTVIFSGLGNNAGGTMHWNDTRPAHFRILKVVLREAVAECLMDSYSLDVHGGRRTAAG \
39
+ -binder YVEICRCVVC \
40
+ -sm ./classifier_ckpt/finetuned_BindEvaluator.ckpt \
41
+ -n_layers 8 \
42
+ -d_model 128 \
43
+ -d_hidden 128 \
44
+ -n_head 8 \
45
+ -d_inner 64
46
+ ```