Skip to content

Commit

Permalink
Update 01 - Finetune Virtual EVE.ipynb
Browse files Browse the repository at this point in the history
Update language
  • Loading branch information
aespaldi authored Oct 21, 2024
1 parent b17066e commit 38f2b06
Showing 1 changed file with 10 additions and 10 deletions.
20 changes: 10 additions & 10 deletions notebooks/camera_ready/virtual_eve/01 - Finetune Virtual EVE.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -14,21 +14,22 @@
"### Background\n",
"This notebook provides two examples of finetuning SDOFM for scientific use cases. First we finetune a virtual eve virtual instrument starting from a SDOFM pretrained foundation model, accomplishing a production ready model much faster and resource efficient than training from scratch. As a second example, we finetune train a missing data data generator\n",
"\n",
"There are two further examples shared in this directory:\n",
"There are three further examples shared in this directory:\n",
" - `02 - Finetune Missing Data`\n",
" - `03 - Finetune Instrument Degradation`\n",
" - `04 - Embeddings to F10.7`\n",
" \n",
"However, this notebook has been annotated as it contains multiple aspects and examples of our foundation model. \n",
"This notebook also contains ample annotation and multiple aspects and examples of our foundation model. \n",
"\n",
"#### Foundation Models\n",
"In principle, one may train an AI model from scratch for a variety of SDO data-driven use cases. This is a wasteful process however as all models have a common foundational training on the same SDO dataset. The aim of producing a foundation model for SDO use cases is to avoid this inefficient process; we first run a process called pretraining to generate a base model that one can think of as a compressed representation of the dataset it was trained on, in this case SDO, and start from there. A loose analogy would be that to building a space elevator for launching new space vessels. \n",
"It is possible to train an AI model from scratch for a variety of SDO data-driven use cases using the same SDO dataset. However, the aim of producing a foundation model for SDO use cases is to avoid this inefficient process. The first step is to run a process called pretraining to generate a base model that one can think of as a compressed representation of the dataset it was trained on, in this case SDO. \n",
"\n",
"This process of training a common foundation model from which many others can be adapted is akin to that of transfer learning, a method that has been around for about 15 years or so. For an extensive treatment of transfer learning please see [review paper](https://arxiv.org/abs/1811.08883). The creation of a foundation model takes this theoretical underpinning further by conducting the foundational model training on a much larger dataset, with a much larger model, which is thought to have the expressivity capable of understanding deep underlying dynamics in the data. This could be considered the holy grail of scientific foundation models; to train a massive neural network on a massive dataset such as that from CERN's Large Hadron Collider, and somehow have the model express back to us the Standard Model of Particle Physics.\n",
"This process of training a common foundation model from which many others can be adapted similar to transfer learning [review paper](https://arxiv.org/abs/1811.08883). The foundation model training is conducted on a much larger dataset, with a much larger model, which is thought to have the expressivity capable of understanding deep underlying dynamics in the data. \n",
"\n",
"##### Typical Foundation Model Architectures\n",
"Over the course of the past few years AI researchers have began to converge on architectures that lend themselves well to the foundation model archetype. Typically the pretrained portion of the neural network that is later then repurposed for specific use cases is called the \"Head\". The Head can be thought of a feature extractor; that is, a function $f(\\cdot)$ that maps input data, for example AIA images, to a compressed latent space representation which underlines the important dynamics present in this sample. Though this latent space representation is encoded in a digital language unbeknownst to us, this latent space \"language\" representation underlines the important aspects of this sample such as if it may contain active regions, or if it is part of a calm solar cycle. This latent space representation is learned from the data itself with no other input; understanding how to further encode our current physics understanding relevant to various foundation models is an open area of research†.\n",
"\n",
"Only the head is trained during model pretraining, the resulting trained model being what we call a foundation model. Once created, the foundation model can be utilized by the scientific community at large for their own particular studies without needing the burden of the expensive and resource requirements to train a large foundation model. This is for example what Meta does by sharing the pretrained model weights of it's llama family of models so researchers and practitioners do not have to begin from scratch every time. Reproducing this approach for specific scientific domains is the main aim and motivation of the SDOFM collaboration.\n",
"Only the Head is trained during model pretraining, the resulting trained model being what we call a foundation model. Once created, the foundation model can be utilized by the scientific community at large without needing to duplicate the original training. \n",
"\n",
"†[Opportunities for Machine Learning in Physics - Max Welling](https://www.youtube.com/watch?v=DmoVHzMbZGI)\n",
"\n",
Expand All @@ -40,15 +41,15 @@
"metadata": {},
"source": [
"### Setting up the environment [skip if on running sdofm.org!]\n",
"For this section, please be sure to be located in the project root directory before executing any commands. None of the cells in this section are meant to be ran from the notebook IDE, but rather your terminal.\n",
"For this section, please be sure to be located in the project root directory before executing any commands. None of the cells in this section are meant to be run from the notebook IDE, but rather your terminal.\n",
"\n",
"#### System Requirements\n",
"This tutorial assumes that you have conda or miniconda installed and are on a linux or macos machine. It's advisable to install miniconda if you have to decide between the two (smaller install), however, if you already have conda installed, you can skip on to the next step.\n",
"This tutorial assumes that you have conda or miniconda installed and are on a linux or macos machine. We recomment miniconda (smaller install).\n",
"\n",
"For instructions on installing miniconda, please see [Miniconda Installation](https://docs.anaconda.com/miniconda/miniconda-install/).\n",
"\n",
"##### Python environment setup\n",
"After you're sure you have conda installed on your system, please run the following command from the project root directory to install a new conda environment\n",
"After installing conda or miniconda, run the following command from the project root directory to install a new conda environment\n",
"\n",
"```bash\n",
"conda env create -f notebooks/camera_ready/virtual_eve/conda_env.yml\n",
Expand All @@ -63,8 +64,7 @@
"pip install -e .\n",
"```\n",
"Lastly, make sure to select the correct python kernel associated with this environment, (likely located in `${CONDA_PREFIX_1}/envs/virtual-eve-finetuning/bin/python`)\n",
"\n",
"Nice, you should now be all set to go!"
"\n"
]
},
{
Expand Down

0 comments on commit 38f2b06

Please sign in to comment.