Chromatin is a highly organised structure and the changes in its condensation level reflect many cellular processes, including cell division and cell death. Recently, advances in machine learning have enabled researchers to automatically classify chromatin morphology in fluorescence microscopy images. In this protocol, we develop user friendly tools to perform this task. We provide an open-source annotation tool, and a cloud-based computational framework to train and utilise a convolutional neural network to automatically classify chromatin morphology. Using cloud compute enables users without significant resources or computational experience to use a machine learning approach to analyse their own microscopy data.
- At the terminal make sure you are in the
cnn-annotator
directory and typejupyter notebook
, you will then be redirected to a jupyter notebook environment in your default browser. - Open
A_CNN_Annotation_with_Napari.ipynb
by clicking on the filename. A separate window will open. - In the notebook, modify the file path to your image data (default:
../data/MDCK_H2B_GFP_movie.tif
). If you are using your own classification labels, create those labels in the notebook using theCellState
structure. - Run the notebook and a separate Napari instance with a GUI window will pop up.
- Inside Napari, select a label corresponding to a particular chromatin morphology and annotate the image data by placing a small marker at the centre of each cell with the appropriate morphology.
- Aim to annotate a few hundred examples of each state that you wish to classify.
- Export the data using the
export
button. This writes a single zip file containing all of the image patches as well as the locations in T(Z)YX and ground truth labels. This zip file can be used as a source of training data for the CNN. - You can close the Napari instance when done. Repeat steps 5-8 to create additional annotation zip files for validation and future inference on new images.
- (Optional) If you wish to assess the class balance and visualise your annotations, return to the jupyter notebook interface, then open and run
B_CNN_Data_Preview_Images.ipynb
.
- In the github repository, click on the
Open in Colab
notebook for Training. - Follow the instructions in the notebook, and upload the training and validation datasets to the Colab
/content/train/
and/content/validation/
subfolders. Make sure that you set aside some data for validation. - Set the hyperparameters required for training.
- Run the Colab notebook to train the model.
- Monitor the training performance using TensorBoard.
- Save out and download the trained model (
model.h5
) for future use.
- Calculate the confusion matrix using the notebook.
- Calculate other performance metrics, including the F1-score, precision and recall.
- Using the CNN for chromatin morphology classification on unseen data.
- Once the CNN has been successfully trained, the goal is to use it to explore unseen data.
- In the github repository, click on the
Open in Colab
notebook for Inference. - Upload the test dataset(s) to the Colab
/content/test/
subfolder. - Upload the trained CNN model (
model.h5
) to the notebook. Run the notebook to perform predictions on real data. - Perform UMAP dimensionality reduction along with a manifold projection of single-cell image patches to visualise the network’s classification in 2D.