Skip to content

Latest commit

 

History

History
56 lines (36 loc) · 6.02 KB

data_sources.md

File metadata and controls

56 lines (36 loc) · 6.02 KB

Data Sources

Here, we describe the data sources for the crystal structures in the QMOF Database.

Synthesized MOFs

CSD

Most materials in the QMOF Database are taken from the Cambridge Structural Database (CSD) with their free solvents removed. To determine which materials in the CSD are sufficiently MOF-like, we used the list of materials in the CSD MOF Subset and the 2019 CoRE MOF Database. Unless otherwise stated, these structures were taken directly from the CSD so that we could use the valuable CSD meta-data to choose which materials to discard in our pre-screening process. Note that given the loose definition of a MOF in the CSD MOF Subset, some of these materials may be better classified as coordination polymers. You may wish to filter the materials by void fraction or pore size if you are specifically interested in porous materials.

All materials taken from the CSD have _FSR at the end of their name. The source flag is set to 'CSD'.

CoRE 2019

Since v8 of the QMOF Database, some MOFs were taken directly from the 2019 CoRE MOF Database uploaded on Zenodo. This was done in one of two ways. In the first approach uploaded with v8, a curated subset of CoRE MOFs discussed in Kancharlapalli et al. was adopted to reduce the likelihood of obtaining erroneous crystal structures. In the second approach uploaded with v9, a curated subset of CoRE MOFs presented in Chen and Manz was adopted, after removing entries with CSD-identified charge-balancing ions, to reduce the likelihood of obtaining erroneous crystal structures. In both cases, only the FSR (free solvent removed) version of the CoRE MOF Database was used.

All materials taken directly from the CoRE MOF Database have core at the start of their name. The source flag is set to 'CoRE'. Note that there are some materials in the CoRE MOF Database that were instead downloaded from the CSD (as described above) rather than taken from the CoRE MOF Database itself. These structures will not have core in their filenames. Cross-check the refcodes in the QMOF Database with the list of structures in the parent CoRE MOF Database if you are specifically looking for all CoRE MOFs.

Pyrene MOFs

A few pyrene-containing MOFs were taken from prior work using the data uploaded to the Materials Cloud.

All materials taken from this dataset have pyrene in their name. The source flag is set to 'Pyrene'.

Hypothetical MOFs

Since v6 of the QMOF Database, we have now included hypothetical MOFs from various sources.

ToBaCCo

Several of the MOFs in the database were constructed using the ToBaCCo code described in a series of prior studies here and here. These are described below.

Anderson and Gómez-Gualdrón

A database of ToBaCCo-constructed Zr-MOFs was adopted from prior work by Andereson and Gómez-Gualdrón. See here for the dataset. We also exchanged the Zr species for Hf to include hypothetical Hf-MOFs as well.

All materials taken from this dataset have tobacco and _SR_ in their names. The source flag is set to 'Anderson'.

Colón, Gómez-Gualdrón, and Snurr

A database of Cu triangle MOFs was adopted from prior work by Colón, Gómez-Gualdrón, and Snurr. These structures had H atoms in the center of the Cu triangles, which were removed before relaxing their structures with DFT. See here for the dataset.

All materials taken from this dataset have tobacco in their names. The source flag is set to 'ToBaCCo'.

Boyd and Woo

Several of the MOFs in the database were constructed using the TOBASCCO code as described in prior work by Boyd and Woo. These were obtained from prior work by Boyd et al. using the Materials Cloud dataset here. We made modifications to several of these MOFs prior to structure relaxation to diversify our dataset. For instance, we occasionally exchanged the metals in the inorganic node, and we constructed Al rod MOFs by exchanging the metals in the pre-existing V rod MOFs and protonating the bridging oxo ligands.

All materials taken from this dataset have boydwoo in their names. The source flag is set to 'BoydWoo'.

GMOFs

Hypothetical MOFs were also taken from the genomic MOF (GMOF) database here. All materials taken from this dataset have gmof in their names. The source flag is set to 'GMOF'.

Haranczyk Datasets

Several of the MOFs in the database were obtained from prior work by Maciej Haranczyk and coworkers.

MOF-5

Hypothetical MOF-5 analogues were obtained from prior work by Haranczyk and colleagues. See here for the dataset.

All materials taken from this dataset have MOF-5 in their names. The source flag is set to 'Haranczyk_MOF5'.

MOF-74

Hypothetical MOF-74 analogues were obtained from prior work by Haranczyk and colleagues.

All materials taken from this dataset have mof74 in their names. The source flag is set to 'Haranczyk_MOF74'.