Open the structure and the volume map, open the Interactive
tab, click Fit
.
The whole UI design is with the hope to allow users to be able to use DiffFit without training. This is especially the case if the user is familiar with ChimeraX's Fit in Map tool or fitmap command. We provide a detailed Doc below for different usage scenarios.
We suggest using DiffFit as an initial fitting tool and then using the ChimeraX built-in "Fit in Map" to refine DiffFit's results. Because we observed in most cases, the "Fit in Map" refined results' RMSD is slightly better than DiffFit's.
A CUDA-compatible GPU will give the best performance.
But you can also run DiffFit on the CPU.
If you run on the CPU,
you may click Fast
in the Options
button to get the results faster.
You may even decrease # shifts
and/or # quaternions
if you are really in a hurry.
However, the chance that you find a plausible fit is lower.
- Open the source structure and the target map in ChimeraX. Let's use
PDB-6WTI
and its associated EM MapEMD-21897
as an example.- Option 1, via ChimeraX command line. Run the following command in the command line at the very bottom of the ChimeraX windows.
open 6WTI
open 21897 from emdb
- Option 2, via download the files and drag and drop.
- You can find the official RCSB webpage for a PDB ID by either Googling the ID or composing a URL: https://www.rcsb.org/structure/6WTI. Change
6WTI
to another ID if needed. Download Files > PDBx/mmCIF Format
- Click the first
EMDB
, afterEM Map EMD-ID
to access its EM map webpage Download > 3D volume (map.gz)
; unzip the downloaded map file.- Drag and drop both files (
6wti.cif
andemd_21897.map
) into the ChimeraX window.
- You can find the official RCSB webpage for a PDB ID by either Googling the ID or composing a URL: https://www.rcsb.org/structure/6WTI. Change
- Option 1, via ChimeraX command line. Run the following command in the command line at the very bottom of the ChimeraX windows.
- Change the iso-surface threshold level of the map.
- The Volume Viewer is usually located at the bottom-right of the ChimeraX window. Move the slide to change.
- DiffFit is very robust against this parameter, so very often, you don't have to change it.
- To get the best performance, change the level to a value where you can see some secondary structures (alpha-helices or beta sheets).
- In the DiffFit panel, go to the
Interactive
tab. ClickFit
. - After computing, DiffFit will automatically go to the
View
tab and select the top fit. You may change the threshold values. Usually, the defaults work fine. - You can now click on the rows to go through the fitting results. You may sort the table by a different metric by clicking the header (by default, the table is sorted by
Density
). - Once you find a plausible fit, you may use ChimeraX's Fit in Map tool (or a command similar to
fit #1 in #2
) to refine the placement. - If you have a ground truth structure to compare with, you may open that structure and use a command similar to
rmsd #1 to #3
to calculate the RMSD. Check the RMSD doc for more. - Save a molecule by clicking
Structure
if desired; and zero the density occupied by the current molecule:- Click
Simulate volume
- Change the surface level threshold for the simulated volume if necessary
- Click
Zero density
- Click
- Repeat the last step (Save, Simulate, Zero) for other plausible fits
- Save the last
working volume
byFile > Save > Files of type: MRC > Map: working volume
and use it as the input for another round if needed. - In the
Interactive
tab, you may clickOptions
to change the fitting parameters. - In the
Settings
tab, you may change the global settings of DiffFit. The benchmark table in our paper usesFit atoms: All atoms
on an Nvidia RTX 4090 GPU. - You can find the computing time in the ChimeraX log window, usually at the right. Look for
DiffFit total time elapsed:
. The first run is slightly slower as there are some global initialization processes. - If you are interested in comparing DiffFit with the ChimeraX Fit in Map command, you may run a command similar to
fit #1 in #2 search 1000
. Check the fitmap doc for more.
The logic of compositing is you fit multiple structures in one run to one Cryo-EM volume map.
Apart from the target volume file, you will also need to prepare all the individual structures
that you want to fit and simulate a map for each structure.
You will run this functionality in Disk
mode by specify the path to the involved files or folders.
As a demo, let's composite the individual chains from PDB-8SMK into EMD-40589.
-
Open
8SMK
in ChimeraX -
Go to the
Utilities
tab. Under theSplit a structure into individual chains
section:- Set an
Output Folder
. The default is a new folder calledsplit_out
in your working directory, which usually is your desktop. - Select
8smk.cif
as theStructure
. - Press
Split
. - Optional: Go to the output folder, delete chain D, E, and F. Because they are the same as chain A, B, and C.
- Set an
-
Go to the
Utilities
tab. Under theSimulate a map for each structure in the folder
section:- If your individual chains are in
split_out
. Then you may directly clickSimulate
. By default, the simulated maps are in a new folder calledsim_out
in your working directory, which usually is your desktop.
- If your individual chains are in
-
Go to the
Disk
tab, set all the file and folder paths, set theTarget Surface Threshold
(you may open the map in the same ChimeraX window to decide a good surface level value). -
Click
Run!
. -
Use the same way as in
Scenario 1: Fit a single structure
to view the results. -
Save the working volume and go for another round of fitting if needed.
The whole procedure is similar to Scenario 2: Composite multiple structures, only that the input files are different.
All the necessary files are at: https://github.com/nanovis/DiffFit/tree/main/dev_data/input/domain_fit_demo_3domains.
Use the volume map from the above link.
Use the structure files from https://osf.io/download/vgk7f/.
The computing time for searching the whole candidate library on one RTX 4090 is about 7 minutes.