Skip to content

marcinwrochna/qisabelle

Repository files navigation

QIsabelle

QIsabelle aims to give a simple, reproducible environment for evaluating machine-learning models with the Isabelle proof assistant.

This is a mini version of PISA, a Python interface to the Isabelle proof assistant by Albert Qiaochu Jiang, Wenda Li, Jesse Michael Han, and Yuhuai Wu. Both PISA and QIsabelle rely on scala-isabelle by Dominique Unruh.

Usage

QIsabelle contains:

  • a server (written in Scala) that spawns an Isabelle process and provides an HTTP API to interact with it,
  • a Python client library for calling the HTTP API (session.py), with examples in main.py.

Example

    with QIsabelleSession(session_name="HOL", session_roots=[]) as session:
        # Initialize a new theory with imports from HOL, store as "state0".
        session.new_theory(
            theory_name="Test",
            new_state_name="state0",
            imports=["Complex_Main", "HOL-Computational_Algebra.Primes"],
            only_import_from_session_heap=False,
        )
        print(session.describe_state("state0"))

        # Execute a lemma statement, store result as "state1".
        lemma = 'lemma foo: "prime p \\<Longrightarrow> p > (1::nat)"'
        is_proof_done, proof_goals = session.execute("state0", lemma, "state1")
        assert not is_proof_done
        print(indent(proof_goals))  # "proof (prove) goal (1 subgoal):"...

        # Execute a proof and check that it proved the lemma.
        proof = "using prime_gt_1_nat by simp"
        is_proof_done, proof_goals = session.execute("state1", proof, "state2")
        assert is_proof_done and not proof_goals

        # Find an alternative proof with Sledgehammer.
        proof = session.hammer("state1", deleted_facts=["prime_gt_1_nat"])
        print(indent(proof))  # "by (simp add: prime_nat_iff)"
        is_proof_done, proof_goals = session.execute("state1", proof, "state3")
        assert is_proof_done and not proof_goals

HTTP API

The server provides a HTTP API defined and documented in server/src/QIsabelleServer.scala. It uses JSON objects (dicts) as inputs and outputs. It should be easy to use from any language, see client/session.py for a Python wrapper and client/main.py for more usage examples.

Setup

0. Requirements

Python ≥3.10, docker or podman, curl, brotli (for decompressing, install it with your system's package manager).
You do not need to install Isabelle, scala, or Java, as they are included in the container.

1. Clone the repo

git clone [email protected]:marcinwrochna/qisabelle.git

or (if you don't have SSH keys set up with GitHub):

git clone https://github.com/marcinwrochna/qisabelle.git

2. Download heaps

A heap is a saved memory state of the Isabelle/ML process, usually after fully executing an Isabelle session. They are too large to be included in a docker image, so pre-built heaps of all of AFP are available for download. These take 40GB after decompression (and 7GB more is temporarily needed for the compressed download).

By default, QIsabelle uses heaps from the main 2024 (May) AFP release (2024_361b8b643a1d). (Alternatively, you can choose a different one from this page and modify the .env file accordingly, or build a heap yourself from any AFP version: see Building your own heap below).

    cd qisabelle
    source .env
    echo $AFP_ID
    # Download the AFP release (just .thy files, including files generated during heap building).
    curl -u u363828-sub1:7K5XEQ7RDqvbjY8v https://u363828-sub1.your-storagebox.de/afp_$AFP_ID.tar.gz -O
    tar -xf afp_$AFP_ID.tar.gz
    rm afp_$AFP_ID.tar.gz
    mkdir dockerheaps
    cd dockerheaps
    # Download an decompress heaps.
    curl -u u363828-sub1:7K5XEQ7RDqvbjY8v https://u363828-sub1.your-storagebox.de/Isabelle2024_afp_$AFP_ID.tar.br -O
    tar --use-compress-program=brotli -xf Isabelle2024_afp_$AFP_ID.tar.br
    rm Isabelle2024_afp_$AFP_ID.tar.br
    cd ..

Afterwards you should have at least the following directories:

qisabelle
├── afp_$AFP_ID
│   └── thys
├── client
├── dockerheaps
│   └── Isabelle2024_afp_$AFP_ID
│       └── polyml-5.9_x86_64_32-linux
└── server

3. Start the server and client

On port 17000 (change docker-compose.yaml to change the port or to add more replicas):

    docker-compose up

or with podman:

    podman-compose --podman-build-args='--format docker' up

To start the Python client, in another console, run:

    python -um client.main

In case of permission errors, use chown -R 1000:1000 on heaps or chmod -R a+rwX on AFP. If you use podman, do podman build -t qisabelle-server -f ServerDockerfile --format docker . before.

Caveats

  • Initializing Isabelle (API call openIsabelleSession) can take a dozen seconds on a powerful server. And you need to do it every time you change the loaded Isabelle session (so every time you want a different set of pre-built in-heap theories available).
  • When Sledgehammer is used, timeouts make it hard to get reproducible results, success depends on server load, computing power and just random factors.

Heaps – details

Pre-built heaps for QIsabelle are mounted read-only (for reproducibility), as system heaps (at /home/isabelle/Isabelle/heaps/ inside the docker container), in order to keep user heaps writable (at /home/isabelle/.isabelle/heaps/).

Note that heaps include absolute paths, unfortunately, so they cannot be moved around. This means:

  • Heaps downloaded from here can be placed anywhere as long as you mount them as /home/isabelle/Isabelle/heaps/ in a docker container.
  • If you want to use downloaded heaps without docker, you will need to place them at /home/isabelle/Isabelle/heaps/.
  • Heaps you built yourself (if you use Isabelle) cannot be used with QIsabelle, unless you built them at /home/isabelle/Isabelle/heaps/.

Building your own heap

  1. Clone the latest version of AFP (~700MB temporarily) and take just the theory files (~300MB). You can also select a specific tag, branch, or revision (see here) using hg clone -r Isabelle2024.
    hg clone https://foss.heptapod.net/isa-afp/afp-devel
    cd afp-devel
    export AFP_ID=$(hg log -l 1 --template '{date|shortdate}_{node|short}\n' -r .)
    echo $AFP_ID
    hg archive -I "thys/" -I "etc/" ../afp_$AFP_ID/
    cd ..
    rm -r afp-devel
  1. Build all of AFP as system heaps. This takes ~5h on a powerful server and produces ~30-40GB. Timeout errors are normal, just repeat the command to retry failed sessions. You can Ctrl+C and restart to continue at any time. Note this will modify the AFP thys directory (some theories generate code); if you mount it as read-only, a few theories will fail (which would be OK). The -j option specifies the number of parallel workers, more than 30 is probably waste.
    mkdir -p dockerheaps/Isabelle2024_afp_$AFP_ID
    chmod a+rwx dockerheaps/Isabelle2024_afp_$AFP_ID
    chmod -R a+rwX afp_$AFP_ID/
    docker run -it --rm \
        -v $(pwd)/afp_$AFP_ID:/afp:z \
        -v $(pwd)/dockerheaps/Isabelle2024_afp_$AFP_ID:/home/isabelle/Isabelle/heaps:z \
        qisabelle-server \
        isabelle build -b \
        -o system_heaps=true \
        -j 4 -o timeout_scale=3 \
        -D /afp/thys

You can use -D /afp/thys/Hello_World for testing (~7 min, 370MB of heaps).

  1. Optionally, compress and upload the heaps (and modified theories). This takes a few hours.
tar --gzip -cf afp_$AFP_ID.tar.gz afp_$AFP_ID/
cd dockerheaps
tar -cf Isabelle2024_afp_$AFP_ID.tar Isabelle2024_afp_$AFP_ID/
brotli -q 5 --rm Isabelle2024_afp_$AFP_ID.tar
cd ..
scp afp_$AFP_ID.tar.gz dockerheaps/Isabelle2024_afp_$AFP_ID.tar.br hetzner:isabelle_heaps/
rm afp_$AFP_ID.tar.gz dockerheaps/Isabelle2024_afp_$AFP_ID.tar.br

Development

Client requirements

The Python client only uses standard libraries with Python ≥3.10.

It is recommended to use mypy and ruff (or black, isort, flake8) for development.

Building the server docker image

    docker-compose build

To run tests inside it:

    docker-compose -f docker-tests.yaml up

Scala-Isabelle development version

At the moment to work locally (without docker) you will need to install JDK ≥17, SBT, and use the git version of scala-isabelle:

    cd ..
    curl -L https://isabelle.in.tum.de/dist/Isabelle2024_linux.tar.gz -O
    tar -xf Isabelle2024_linux.tar.gz
    # Editing .env should be enough, but you could also make symlinks:
    # ln -sT Isabelle2024 Isabelle
    # ln -sT /home/isabelle `pwd`
    # ln -sT qisabelle/dockerheapds/Isabelle2024_afp_$AFP_ID  /home/isabelle/Isabelle/heaps
    # ln -sT /afp qisabelle/afp_$AFP_ID
    git clone -b test2024 --single-branch https://github.com/marcinwrochna/scala-isabelle.git
    # Or if upstream is updated: git clone https://github.com/dominique-unruh/scala-isabelle.git
    cd scala-isabelle
    sbt publishLocal
    cd ../qisabelle

VS Code

Most of the server development can easily be done with VS Code with the "Scala (Metals)" extension installed. You may need to open build.sc to trigger project building and then server/test/src/IsabelleSessionTests.scala to make the test appear in the test explorer.

With you own version of scala-isabelle

You can clone scala-isabelle, modify it and built it locally using sbt publishLocal. Then change the version in QIsabelle's build.sc to scala-isabelle:master-SNAPSHOT. You will also need to modify ServerDockerfile if you want to build it with a modified scala-isabelle.

About

Query the Isabelle proof assistant with python

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published