Isa Proof Shell

A minimal language eliminates human-oriented complexities in Isabelle, making it particularly suitable for AI agents.

Any suggestions, feature requests, and concerns through GitHub issues are higly welcome! Looking for feedbacks!

Key Features

Communication: Text-based standard I/O with JSON format
Minimized Interface: Minimal number of commands integrating powerful automation tools like Sledgehammer
Concurrency: Efficient concurrent proof sessions powered by Isabelle's built-in thread scheduler
State Management: Support for restore historical proof states.
Bidirectional Portal: That also eases Isabelle to call AI agents in future.
- A shell script that successfully proves a goal, also produces an Isabelle tactic script able to reproduce the proof without using the shell.

Motivation

Isabelle/HOL has evolved into a sophisticated system with numerous features optimized for human interaction. While powerful, many of these features—such as the human-readable Isar language—are unnecessary overhead when working with AI agents. This shell aims to:

Strip away non-essential elements and provide a minimal, agent-friendly interface
Integrate existing automation mechanisms

Example of Simplification

The current Isabelle system includes:

Numerous Isar commands (have, moreover, ultimately, obtain, hence, thus, show, shows, assume, assumes, lemma, theorem, corollary, by, apply, done, qed, proof, etc.)
Multiple subsystems (attributes, tactics, documentation, modules)
Various tactics (this, rule, erule, drule, subst, simp, auto, linarith, blast, fast, fastforce, force, smt, metis, meson)

Many of these commands and tactics have subtle differences and overlapping functionality. Our shell reduces this complexity to fewer than 10 essential proof commands (see Language Reference for details).

As an example, take the proof for "the square root of two is irrational" given in the Wikipedia page of Isabelle

theorem sqrt2_not_rational:
  "sqrt 2 ∉ ℚ"
proof
  let ?x = "sqrt 2"
  assume "?x ∈ ℚ"
  then obtain m n :: nat where
    sqrt_rat: "¦?x¦ = m / n" and lowest_terms: "coprime m n"
    by (rule Rats_abs_nat_div_natE)
  hence "m^2 = ?x^2 * n^2" by (auto simp add: power2_eq_square)
  hence eq: "m^2 = 2 * n^2" using of_nat_eq_iff power2_eq_square by fastforce
  hence "2 dvd m^2" by simp
  hence "2 dvd m" by simp
  have "2 dvd n" proof -
    from ‹2 dvd m› obtain k where "m = 2 * k" ..
    with eq have "2 * n^2 = 2^2 * k^2" by simp
    hence "2 dvd n^2" by simp
    thus "2 dvd n" by simp
  qed
  with ‹2 dvd m› have "2 dvd gcd m n" by (rule gcd_greatest)
  with lowest_terms have "2 dvd 1" by simp
  thus False using odd_one by blast
qed

This proof is translated into our language as follows

GOAL "sqrt 2 ∉ ℚ"
    CRUSH
    LET ?x = "sqrt 2"
    OBTAIN m n :: nat where "¦?x¦ = m / n" and "coprime m n"; END
    HAVE "m^2 = ?x^2 * n^2"; END
    HAVE "m^2 = 2 * n^2"; END
    HAVE "2 dvd m^2"; END
    HAVE "2 dvd m"; END
    HAVE "2 dvd n"
        OBTAIN k where "m = 2 * k"; END
        HAVE "2 * n^2 = 2^2 * k^2"; END
    END
    HAVE "2 dvd gcd m n"; END
    HAVE "2 dvd 1"; END
    HAVE "False"; END
END

Our language is more concise because we use Sledgehammer to automatically generate every small proof script used in the original code. The idea is to let AI agents focus on the macroscopic proof route planning, and leave all the trivial small stuff to existing proof automation mechanisms like Sledgehammer.

Clients just need to start up our shell, and send the above text through the standard input pipeline (stdin), and that is all required to prove lemmas using our shell. If the proof works, our shell shall return an Isabelle script allowing anyone to reproduce the proof.

Schedule (Planned)

Due Date	Phase
Week 1, November	✅ Proposal Draft Release
Week 4, November	✅ Design Specification Freeze
Week 2, January	✅ The language is almost done. Example Gallery
Week 4, January	⏳ Translating AFP to this lang.

Design Specification (Draft)

Language and Protocol

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
doc		doc
library		library
.gitignore		.gitignore
COPYING		COPYING
COPYING.LESSER		COPYING.LESSER
COPYRIGHT		COPYRIGHT
MS_Test.thy		MS_Test.thy
MS_Test_BicategoryOfSpans.thy		MS_Test_BicategoryOfSpans.thy
MS_Test_Divisibility.thy		MS_Test_Divisibility.thy
MS_Test_Sublist.thy		MS_Test_Sublist.thy
MS_Test_Word.thy		MS_Test_Word.thy
MS_Translator.thy		MS_Translator.thy
MinLang_Translator.thy		MinLang_Translator.thy
Proof_Shell.thy		Proof_Shell.thy
Readme.md		Readme.md
Translation_Test.thy		Translation_Test.thy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Isa Proof Shell

Key Features

Motivation

Example of Simplification

Schedule (Planned)

Design Specification (Draft)

About

Licenses found

Releases

Packages

Languages

License

Licenses found

xqyww123/Isa-Proof-Shell

Folders and files

Latest commit

History

Repository files navigation

Isa Proof Shell

Key Features

Motivation

Example of Simplification

Schedule (Planned)

Design Specification (Draft)

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages