|
| 1 | +# OpenFF Iodine Fragment Opt v1.0 |
| 2 | + |
| 3 | +## Description |
| 4 | + |
| 5 | +A dataset containing fragments of molecules from the Zinc, Enamine 10240, Enamine 50240, and CHEMBL datasets, optimized at the B3LYP-D3BJ/DZVP level of theory. Molecules containing I were fragmented into small fragments. |
| 6 | +Fragments containing I then had protomers enumerated. |
| 7 | + |
| 8 | +For each resulting molecule, a set of up to 5 conformers were generated by: |
| 9 | + * generating a set of up to 1000 conformers with a RMS cutoff of 0.5 Å using the OpenEye backend of the OpenFF toolkit |
| 10 | + * applying ELF conformer selection (max 5 conformers) using OpenEye |
| 11 | + |
| 12 | + |
| 13 | +## General information |
| 14 | + |
| 15 | +* Date: 2024-09-10 |
| 16 | +* Class: OpenFF OptimizationDataset |
| 17 | +* Purpose: Geometry optimization for eventual ESP calculation |
| 18 | +* Name: OpenFF Iodine Fragment Opt v1.0 |
| 19 | +* Number of unique molecules: 526 |
| 20 | +* Number of conformers: 531 |
| 21 | +* Number of conformers per molecule (min, mean, max): 1, 1.01, 5 |
| 22 | +* Molecular weight (min, mean, max): (155.97, 242.52, 316.94) |
| 23 | +* Charges: [-1.0, 0.0, 1.0, 2.0] |
| 24 | +* Dataset submitter: Alexandra McIsaac |
| 25 | +* Dataset generator: Alexandra McIsaac |
| 26 | + |
| 27 | + |
| 28 | +## QCSubmit generation pipeline |
| 29 | + |
| 30 | +* `generate-dataset.ipynb`: This notebook shows how the dataset was prepared from the input files. |
| 31 | + * Conformers were generated and selected using OpenEye |
| 32 | + |
| 33 | + |
| 34 | +## QCSubmit Manifest |
| 35 | + |
| 36 | +* `dataset.json.bz2`: Compressed dataset ready for submission |
| 37 | +* `dataset.pdf`: Visualization of dataset molecules |
| 38 | +* `dataset.smi`: Smiles strings for dataset molecules |
| 39 | +* `generate-dataset.ipynb`: Notebook describing dataset generation and submission |
| 40 | +* `enumerated.smi`: Input SMILES |
| 41 | +* `input-environment.yaml`: Environment file used to create Python environment for the notebook |
| 42 | +* `full-environment.yaml`: Fully-resolved environment used to execute the notebook. |
| 43 | + |
| 44 | + |
| 45 | +## Metadata |
| 46 | + |
| 47 | +* Elements: {C, I, Br, Cl, S, N, H, F, O} |
| 48 | +* unique molecules: 526 |
| 49 | +* Spec: default |
| 50 | + * basis: DZVP |
| 51 | + * implicit_solvent: None |
| 52 | + * keywords: {} |
| 53 | + * maxiter: 200 |
| 54 | + * method: B3LYP-D3BJ |
| 55 | + * program: psi4 |
| 56 | + * SCF properties: |
| 57 | + * dipole |
| 58 | + * quadrupole |
| 59 | + * wiberg_lowdin_indices |
| 60 | + * mayer_indices |
0 commit comments