Skip to content

Commit 07247d2

Browse files
amcisaaclilyminium
andauthored
Iodine frag opt (#373)
* adding dataset * Create README.md * correctly name notebook and update spec * adding full env * Update README.md * changed to default spec * Update README.md * Update README.md * Update submissions/2024-09-10-OpenFF-Iodine-Fragment-Opt-v1.0/README.md Co-authored-by: Lily Wang <[email protected]> * Update submissions/2024-09-10-OpenFF-Iodine-Fragment-Opt-v1.0/README.md Co-authored-by: Lily Wang <[email protected]> * Update submissions/2024-09-10-OpenFF-Iodine-Fragment-Opt-v1.0/README.md Co-authored-by: Lily Wang <[email protected]> * Update submissions/2024-09-10-OpenFF-Iodine-Fragment-Opt-v1.0/README.md Co-authored-by: Lily Wang <[email protected]> * updated dataset description, correctly exported full env * submissions/2024-09-10-OpenFF-Iodine-Fragment-Opt-v1.0/dataset.json.bz2: convert to Git LFS --------- Co-authored-by: Lily Wang <[email protected]> Co-authored-by: Lily Wang <[email protected]>
1 parent 62a8290 commit 07247d2

File tree

9 files changed

+2208
-0
lines changed

9 files changed

+2208
-0
lines changed

README.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -252,6 +252,7 @@ These are currently used to find a minimum energy conformation of a molecule.
252252
| `OpenFF Torsion Benchmark Supplement Optimization Dataset v1.0` | [2024-04-18-OpenFF-Torsion-Benchmark-Supplement-Optimization-Dataset-v1.0](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2024-04-18-OpenFF-Torsion-Benchmark-Supplement-Optimization-Dataset-v1.0) | Additional optimizations for benchmarking Sage 2.2.0 proper torsions and new parameters from the torsion multiplicity work | H, C, N, O, F, P, S, Cl, Br | |
253253
| `OpenFF Torsion Multiplicity Optimization Training Coverage Supplement v1.0` | [2024-06-20-OpenFF-Torsion-Multiplicity-Optimization-Training-Coverage-Supplement-v1.0](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2024-06-20-OpenFF-Torsion-Multiplicity-Optimization-Training-Coverage-Supplement-v1.0) | Additional optimization training data for Sage 2.2.0 proper torsions and new parameters from the torsion multiplicity work | C, Cl, S, O, H, P, N, Br | |
254254
| `OpenFF Torsion Multiplicity Optimization Benchmarking Coverage Supplement v1.0` | [2024-06-24-OpenFF-Torsion-Multiplicity-Optimization-Benchmarking-Coverage-Supplement-v1.0](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2024-06-24-OpenFF-Torsion-Multiplicity-Optimization-Benchmarking-Coverage-Supplement-v1.0) | Additional optimization benchmarking data for Sage 2.2.0 proper torsions and new parameters from the torsion multiplicity work | Cl, H, I, S, O, N, Br, C, P | |
255+
|`OpenFF Iodine Fragment Opt v1.0` | [2024-09-10-OpenFF-Iodine-Fragment-Opt-v1.0](https://github.com/openforcefield/qca-dataset-submission/tree/master/submissions/2024-09-10-OpenFF-Iodine-Fragment-Opt-v1.0) | B3LYP-D3BJ/DZVP optimized conformers for a variety of I-containing fragment molecules | C, O, I, S, F, Br, Cl, N, H ||
255256
256257
257258
# TorsionDrive Datasets
Lines changed: 60 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,60 @@
1+
# OpenFF Iodine Fragment Opt v1.0
2+
3+
## Description
4+
5+
A dataset containing fragments of molecules from the Zinc, Enamine 10240, Enamine 50240, and CHEMBL datasets, optimized at the B3LYP-D3BJ/DZVP level of theory. Molecules containing I were fragmented into small fragments.
6+
Fragments containing I then had protomers enumerated.
7+
8+
For each resulting molecule, a set of up to 5 conformers were generated by:
9+
* generating a set of up to 1000 conformers with a RMS cutoff of 0.5 Å using the OpenEye backend of the OpenFF toolkit
10+
* applying ELF conformer selection (max 5 conformers) using OpenEye
11+
12+
13+
## General information
14+
15+
* Date: 2024-09-10
16+
* Class: OpenFF OptimizationDataset
17+
* Purpose: Geometry optimization for eventual ESP calculation
18+
* Name: OpenFF Iodine Fragment Opt v1.0
19+
* Number of unique molecules: 526
20+
* Number of conformers: 531
21+
* Number of conformers per molecule (min, mean, max): 1, 1.01, 5
22+
* Molecular weight (min, mean, max): (155.97, 242.52, 316.94)
23+
* Charges: [-1.0, 0.0, 1.0, 2.0]
24+
* Dataset submitter: Alexandra McIsaac
25+
* Dataset generator: Alexandra McIsaac
26+
27+
28+
## QCSubmit generation pipeline
29+
30+
* `generate-dataset.ipynb`: This notebook shows how the dataset was prepared from the input files.
31+
* Conformers were generated and selected using OpenEye
32+
33+
34+
## QCSubmit Manifest
35+
36+
* `dataset.json.bz2`: Compressed dataset ready for submission
37+
* `dataset.pdf`: Visualization of dataset molecules
38+
* `dataset.smi`: Smiles strings for dataset molecules
39+
* `generate-dataset.ipynb`: Notebook describing dataset generation and submission
40+
* `enumerated.smi`: Input SMILES
41+
* `input-environment.yaml`: Environment file used to create Python environment for the notebook
42+
* `full-environment.yaml`: Fully-resolved environment used to execute the notebook.
43+
44+
45+
## Metadata
46+
47+
* Elements: {C, I, Br, Cl, S, N, H, F, O}
48+
* unique molecules: 526
49+
* Spec: default
50+
* basis: DZVP
51+
* implicit_solvent: None
52+
* keywords: {}
53+
* maxiter: 200
54+
* method: B3LYP-D3BJ
55+
* program: psi4
56+
* SCF properties:
57+
* dipole
58+
* quadrupole
59+
* wiberg_lowdin_indices
60+
* mayer_indices
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
version https://git-lfs.github.com/spec/v1
2+
oid sha256:d324e5dbf3865fbe68e37c2c314ffe423b4c25c2669828bbd0243713920b8e1a
3+
size 134075
Binary file not shown.

0 commit comments

Comments
 (0)