| # Model documentation & parameters | |
| **Algorithm Version**: Which model version to use. | |
| **Primer SMILES**: The SMILES string for priming the generation (**has** to be provided). | |
| **Maximal sequence length**: The maximal number of SMILES tokens in the generated molecule. | |
| **Sampling uniquely**: Generate unique sample sequences if set to true. | |
| **Number of samples**: How many samples should be generated (between 1 and 50). | |
| # Model card -- REINVENT | |
| **Model Details**: *REINVENT* is a collection of tools for *de novo* drug design. Here, we showcase the SMILES-based generative model. For details, see [Blaschke et al. (2020); *J. Chem. Inf. Model.*](https://pubs.acs.org/doi/10.1021/acs.jcim.0c00915). | |
| **Developers**: Thomas Blaschke and colleagues from AstraZeneca. | |
| **Distributors**: Original authors' code integrated into GT4SD. | |
| **Model date**: 2020, see the [REINVENT 2.0 paper](https://pubs.acs.org/doi/pdf/10.1021/acs.jcim.0c00915) | |
| **Model version**: N.A. | |
| **Model type**: A sequence-based molecular generator from the REINVENT toolbox. | |
| **Information about training algorithms, parameters, fairness constraints or other applied approaches, and features**: | |
| The code for getting unique sample sequences, randomizing scaffolds, and generation of the dataset as well as the dataloader was taken from the original implementation of [Molecular Reinvent](https://github.com/MolecularAI/Reinvent). Our implementation does not include [BaseAction](https://github.com/MolecularAI/Reinvent/blob/982b26dd6cfeb8aa84b6d7e4a8c2a7edde2bad36/running_modes/lib_invent/rl_actions/sample_model.py#:~:text=class%20BaseAction(abc.ABC)%3A) as a parent class for the [ReinventBase](/gt4sd/algorithms/conditional_generation/reinvent/reinvent_core/core.py) where we have added all the functions of [Molecular Reinvent](https://github.com/MolecularAI/Reinvent). | |
| **Paper or other resource for more information**: | |
| [REINVENT 2.0 -- Blaschke et al. (2020); *J. Chem. Inf. Model.*](https://pubs.acs.org/doi/10.1021/acs.jcim.0c00915). | |
| **License**: MIT | |
| **Where to send questions or comments about the model**: Open an issue on [GT4SD-REINVENT-repository](https://github.com/GT4SD/reinvent-models). | |
| **Intended Use. Use cases that were envisioned during development**: Chemical research, in particular drug discovery. | |
| **Primary intended uses/users**: Researchers and computational chemists using the model for model comparison or research exploration purposes. | |
| **Out-of-scope use cases**: Production-level inference, producing molecules with harmful properties. | |
| **Metrics**: N.A. | |
| **Datasets**: N.A. | |
| **Ethical Considerations**: Unclear, please consult with original authors in case of questions. | |
| **Caveats and Recommendations**: Unclear, please consult with original authors in case of questions. | |
| Model card prototype inspired by [Mitchell et al. (2019)](https://dl.acm.org/doi/abs/10.1145/3287560.3287596?casa_token=XD4eHiE2cRUAAAAA:NL11gMa1hGPOUKTAbtXnbVQBDBbjxwcjGECF_i-WC_3g1aBgU1Hbz_f2b4kI_m1in-w__1ztGeHnwHs) | |
| ## Citation | |
| ```bib | |
| @article{blaschke2020reinvent, | |
| title={REINVENT 2.0: an AI tool for de novo drug design}, | |
| author={Blaschke, Thomas and Ar{\'u}s-Pous, Josep and Chen, Hongming and Margreitter, Christian and Tyrchan, Christian and Engkvist, Ola and Papadopoulos, Kostas and Patronov, Atanas}, | |
| journal={Journal of chemical information and modeling}, | |
| volume={60}, | |
| number={12}, | |
| pages={5918--5922}, | |
| year={2020}, | |
| publisher={ACS Publications} | |
| } | |
| ``` |