Infrastructure for MLGO - a Machine Learning Guided Compiler Optimizations Framework.

MLGO is a framework for integrating ML techniques systematically in LLVM. It replaces human-crafted optimization heuristics in LLVM with machine learned models. The MLGO framework currently supports two optimizations:

inlining-for-size(LLVM RFC);
register-allocation-for-performance(LLVM RFC)

The compiler components are both available in the main LLVM repository. This repository contains the training infrastructure and related tools for MLGO.

We currently use two different ML algorithms: Policy Gradient and Evolution Strategies to train policies. Currently, this repository only support Policy Gradient training. The release of Evolution Strategies training is on our roadmap.

Check out this demo for an end-to-end demonstration of how to train your own inlining-for-size policy from the scratch with Policy Gradient, or check out this demo for a demonstration of how to train your own regalloc-for-performance policy.

For more details about MLGO, please refer to our paper MLGO: a Machine Learning Guided Compiler Optimizations Framework.

For more details about how to contribute to the project, please refer to contributions.

Pretrained models

We occasionally release pretrained models that may be used as-is with LLVM. Models are released as github releases, and are named as [task]-[major-version].[minor-version].The versions are semantic: the major version corresponds to breaking changes on the LLVM/compiler side, and the minor version corresponds to model updates that are independent of the compiler.

When building LLVM, there is a flag -DLLVM_INLINER_MODEL_PATH which you may set to the path to your inlining model. If the path is set to download, then cmake will download the most recent (compatible) model from github to use. Other values for the flag could be:

# Model is in /tmp/model, i.e. there is a file /tmp/model/saved_model.pb along
# with the rest of the tensorflow saved_model files produced from training.
-DLLVM_INLINER_MODEL_PATH=/tmp/model

# Download the most recent compatible model
-DLLVM_INLINER_MODEL_PATH=download

Prerequisites

Currently, the assumptions for the system are:

Recent Ubuntu distro, e.g. 20.04
python 3.8.x/3.9.x/3.10.x
for local training, which is currently the only supported mode, we recommend a high-performance workstation (e.g. 96 hardware threads).

Training assumes a clang build with ML 'development-mode'. Please refer to:

LLVM documentation
the build bot script

The model training - specific prerequisites are:

Pipenv:

pip3 install pipenv

The actual dependencies:

pipenv sync --system --categories "packages dev-packages ci"

Note that the above command will only work from the root of the repository since it needs to have Pipfile.lock in the working directory at the time of execution.

The above command will also install all the packages, including development packages (the dev-packages category), and packages only needed in CI (the ci category). If you do not need those, you can omit them from the categories option.

Optionally, to run tests (run_tests.sh), you also need:

sudo apt-get install virtualenv

Note that the same tensorflow package is also needed for building the 'release' mode for LLVM.

Docs

An end-to-end demo using Fuchsia as a codebase from which we extract a corpus and train a model.

How to add a feature guide. Extensibility model.

Name		Name	Last commit message	Last commit date
Latest commit History 495 Commits
.github/workflows		.github/workflows
buildbot		buildbot
compiler_opt		compiler_opt
docs		docs
experimental/docker		experimental/docker
.gitignore		.gitignore
.pylintrc		.pylintrc
.style.yapf		.style.yapf
Dockerfile		Dockerfile
LICENSE		LICENSE
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
check-license.sh		check-license.sh
license-header.txt		license-header.txt
pytest.ini		pytest.ini
run_tests.sh		run_tests.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Infrastructure for MLGO - a Machine Learning Guided Compiler Optimizations Framework.

Pretrained models

Prerequisites

Docs

About

Licenses found

Releases 5

Packages

Contributors 23

Languages

License

Licenses found

google/ml-compiler-opt

Folders and files

Latest commit

History

Repository files navigation

Infrastructure for MLGO - a Machine Learning Guided Compiler Optimizations Framework.

Pretrained models

Prerequisites

Docs

About

Resources

License

Licenses found

Code of conduct

Security policy

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 23

Languages

Packages