Theory Micro_Rust_Examples.Crush_Examples

(* Copyright Amazon.com, Inc. or its affiliates. All Rights Reserved.
   SPDX-License-Identifier: MIT *)

theory Crush_Examples
  imports Micro_Rust_Std_Lib.StdLib_All
begin

section‹▩‹crush› guide›

text‹This file is a brief guide to usage and configuration of the ▩‹crush› family of tactics.›

subsection‹Overview›

text‹▩‹crush› is a high-level tactic for dealing with goals in classical and
separation logic. ▩‹crush› is never trying to be 'clever'. Its goal, instead,
is to do the (seemingly) obvious thing reliably, and to discharge hundreds or thousands
of elementary reasoning steps without human intervention. The proof author is then left with
the tasks of calling ▩‹crush› suitably -- which is what the bulk of this guide is about -- and
filling in the steps that ▩‹crush› cannot solve (such as providing loop invariants, for example).

The birds'-eye view of ▩‹crush› is that of a long disjunction of different branches.
That is, try branch A; if it fails, try branch B; if it fails, try branch C, and so on.
Care has to be taken in choosing the order of branches correctly, both for correctness and
for performance: Regarding correctness, for example, introducing schematic variables must
be handled with a lower priority than operations introducing free variables -- otherwise, the
schematic variables may not instantiate correctly. Regarding performance, powerful tactics
such as a plain ▩‹simp› or ▩‹clarsimp› have to be called with low priority, or otherwise ▩‹crush›
will be too slow for the highly complex goals that we use it for. Even more powerful tactics
such as ▩‹force›, ▩‹blast› or ▩‹auto› are not considered at all. Instead, branches aim to implement
their solution strategy through a mixture of standard tactics ▩‹simp only›, ▩‹elim›, and ▩‹intro›,
or custom ML.

▩‹crush› is highly configurable and shares part of its configuration mechanism with the classical
reasonders such as ▩‹clarsimp›, ▩‹force› or ▩‹auto›: That is, simplification rules can be added/removed
via ▩‹simp add/del: ...›, introduction, split and elimination rules can be added via ▩‹intro: ...›,
▩‹elim: ...› or ▩‹split: ...›. However, there are various other configuration options that the user
needs to be aware of, some specific to separation logic, some refining the classical reasoning.
Those configuration options will be described and exemplified below. It is essential that the users
familiarizes themselves with the configurations, as a bare ▩‹apply crush› is rarely getting far.

▩‹crush› comes in three variants:

- ▩‹crush_base_step›: A single iteration of the ▩‹crush› loop
- ▩‹fastcrush_base›: Run the `crush` loop on the current goal and all emerging goals, until no
   progress can be made anymore.
- ▩‹crush_base›: Run the ▩‹crush› loop on all goals and all emerging goals, until no progress
   can be made anymore.

Since, like ▩‹auto› for example, ▩‹crush_base› can affect the entire goal set, using it in the middle
of proofs can make them harder to maintain. Relying on ▩‹fastcrush_base› usually leads to more robust
proofs. Finally, ▩‹crush_base_step› is mostly useful for debugging purposes; complex proofs require
hundreds of iterations of the ▩‹crush› loop, so manual invocation of ▩‹crush_base_step› is typically
not useful.
›

subsection‹▩‹crush› arguments and configurations›

text‹In this section, we go through various arguments and configuration options of ▩‹crush›
and explain and exemplify their use.

All options apply to all of ▩‹crush›, ▩‹fastcrush› and ▩‹crush_step›, but we illustrate them for
▩‹crush› only.›

subsubsection‹Classical reasoning options›

text‹▩‹crush› supports all arguments that classical reasoning tactics like ▩‹clarsimp›,
▩‹auto›, ▩‹fastforce› or ▩‹force› do. Some examples:›

paragraph‹Simplification rules›

notepad
begin
  fix a b :: bool
  assume h: ‹a ≡ b›
     and b: ‹b›
  have ‹a›
  apply (clarsimp simp add: h)
  apply (rule b)
  done
next
  fix a b :: bool
  assume h: ‹a ≡ b›
     and b: ‹b›
  have ‹a›
  ―‹Same behavior for ▩‹crush››
  apply (crush_base simp add: h)
  apply (rule b)
  done
end

paragraph‹Elimination rules›

notepad
begin
  fix a b c :: bool
  assume h: ‹⋀R. a ⟹ (b ⟹ R) ⟹ R›
     and j: ‹b ⟹ c›
  have ‹a ⟹ c›
  apply (clarsimp elim!: h) ―‹Eliminates ▩‹a› using ▩‹h››
  apply (rule j, force)
  done
next
  fix a b c :: bool
  assume h: ‹⋀R. a ⟹ (b ⟹ R) ⟹ R›
     and j: ‹b ⟹ c›
  have ‹a ⟹ c›
  ―‹Same behavior for ▩‹crush››
  apply (crush_base elim!: h) ―‹Eliminates ▩‹a› using ▩‹h››
  apply (rule j, force)
  done
end

paragraph‹Split rules›

notepad
begin
     fix a b c :: bool
  assume 1: ‹a ⟹ b›
     and 2: ‹¬ a ⟹ c›
  have ‹if a then b else c›
  apply (clarsimp split!: if_splits) ―‹Case split along ▩‹a››
  using 1 2 apply auto
  done
next
     fix a b c :: bool
  assume 1: ‹a ⟹ b›
     and 2: ‹¬ a ⟹ c›
  have ‹if a then b else c›
  ―‹Same behavior for ▩‹crush››
  apply (crush_base split!: if_splits)
  using 1 2 apply auto
  done
end

text‹Note, however, that ▩‹clarsimp› is attempted by ▩‹crush› with very low priority. This can not
only be an issue for performance, but also for correctness, as the following example shows. It is
a bit artifical, but instances of this pattern do emerge in practice:›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
  assumes PQ: ‹⋀x. P x ⤏ Q x›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹Some_Ex ⤏ (⨆x. Q x)›
    apply (crush_base simp add: Some_Ex_def)
    ―‹Leaving us with ▩‹ ⋀x. P x ⤏ Q ?x› which we cannot solve because ▩‹?x› was introduced
    too early and cannot depend on ▩‹x›.›
    oops

  text‹We can confirm that the simplification happened too late by single-stepping through the proof:›

  lemma ‹Some_Ex ⤏ (⨆x. Q x)›
    apply (crush_base_step simp add: Some_Ex_def) ―‹Introducing existential for ▩‹Q ?x››
    apply (crush_base_step simp add: Some_Ex_def)
    apply (crush_base_step simp add: Some_Ex_def) ―‹Unfolding ▩‹Some_Ex› -- too late!›
    oops
end

text‹The next section shows how to fix the situation through 'eager' simplification.›

subsubsection‹Premise/Conclusion simplifications›

text‹The previous section explained that simplifications registered via ▩‹... simp add: ...›
are unfolded with very low priority, which can lead to ▩‹crush› introducing unsolvable schematics
too early. To remedy, ∗‹premises› can be unfolded with higher priority by using the
▩‹simp prems add: ...› parameter. It appplies to both pure and spatial premises.›

text‹Let's see how the previous example can be fixed using ▩‹simp prems add: ...›:›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
  assumes PQ: ‹⋀x. P x ⤏ Q x›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹Some_Ex ⤏ (⨆x. Q x)›
    apply (crush_base simp prems add: Some_Ex_def)
    ―‹Now the goal is ▩‹⋀x. P x ⤏ Q (?x1 x)›, which can actually be solved by setting
    ▩‹?x1 x ≡ x››
    apply (rule PQ)
    done

  text‹We can confirm that the simplification happened earlier than in the previous example
  by single-stepping through the proof:›

  lemma ‹Some_Ex ⤏ (⨆x. Q x)›
    apply (crush_base_step simp prems add: Some_Ex_def)
    apply (crush_base_step simp prems add: Some_Ex_def)  ―‹Premise unfold!›
    apply (crush_base_step simp prems add: Some_Ex_def)  ―‹Introducing existential for Q ?x›
    apply (rule PQ)
    done

  text‹▩‹simp prems/concls› should automatically eta expand definitions:›
  definition not_eta_expanded where ‹not_eta_expanded ≡ λx f. f x›  
  lemma ‹⋀foo. not_eta_expanded f t ⤏ foo›
    apply (crush_base simp prems add: not_eta_expanded_def) 
    oops

end

text‹There is also ▩‹simp concls add: ...› for unfolding pure or spatial ∗‹conclusions›. In contrast
to ▩‹simp prems add: ...›, this is less about correctness than about performance: Unfolding conclusions
late should not lead to bad schematics, but it can lead to performance issues because ▩‹simp add: ...›
is costly and (therefore) only attempted late withihn ▩‹crush›.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
  assumes PQ: ‹⋀x. P x ⤏ Q x›
begin
  definition ‹Some_Ex T ≡ ⨆x. T x›

  lemma ‹Some_Ex P ⤏ Some_Ex Q›
    apply (crush_base simp prems add: Some_Ex_def simp concls add: Some_Ex_def)
    ―‹Now the goal is ▩‹⋀x. P x ⤏ Q (?x1 x)›, which can actually be solved by setting
    ▩‹?x1 x ≡ x››
    apply (rule PQ)
    done

  text‹We can confirm that ▩‹simp prems add: ...› takes precedence over ▩‹simp concls add: ...›,
  that is, premises are unfolded with higher priority than conclusions:›

  lemma ‹Some_Ex P ⤏ Some_Ex Q›
    apply (crush_base_step simp prems add: Some_Ex_def simp concls add: Some_Ex_def)
    apply (crush_base_step simp prems add: Some_Ex_def simp concls add: Some_Ex_def)
    apply (crush_base_step simp prems add: Some_Ex_def simp concls add: Some_Ex_def)
    apply (crush_base_step simp prems add: Some_Ex_def simp concls add: Some_Ex_def)
    apply (rule PQ)
    done
end

text‹You can also combine ▩‹simp prems› and ▩‹simp concls› by providing a list of
simpsets which should be modified. The 'normal' simpset used for the ▩‹[clar]simp›
branch in ▩‹crush› is identified by ▩‹generic›:›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
  assumes PQ: ‹⋀x. P x ⤏ Q x›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹Some_Ex ⤏ (⨆x. Q x)›
    apply (crush_base simp [prems, concls, generic] add: Some_Ex_def)
    ―‹Now the goal is ▩‹⋀x. P x ⤏ Q (?x1 x)›, which can actually be solved by setting
    ▩‹?x1 x ≡ x››
    apply (rule PQ)
    done

end

text‹If you write ▩‹simp add: ...›, the default set of simpsets is determined by the
configuration of ▩‹Crush_Config.crush_simp_general_implies_prems› and
▩‹Crush_Config.crush_simp_general_implies_concls›. Depending on whether they are set/unset,
▩‹simp add: ...› will by default include ▩‹prems/concls›; it will always include ▩‹generic›.
Currently, ▩‹Crush_Config.crush_simp_general_implies_prems› and
▩‹Crush_Config.crush_simp_general_implies_concls› are both globally ▩‹false› by default.›

text‹Finally, there is also ▩‹simp early add/del: ...› which are simplifications which are attempted
at highest priority in the first branch of ▩‹crush›. The underlying named theorem list is ▩‹crush_early_simps›.

WARNING: Be careful with adding to ▩‹crush_early_simps› globally -- it may not only break existing proofs
but also cause a significant slowdown since early simplification is attempted at high priority.›

subsubsection‹High-priority introduction rules›

text‹In the previous sections, we discussed how ▩‹crush› includes a low priority branch for  ▩‹clarsimp›
and its configuration via ▩‹simp add/del: ...›, but that for performance and correctness dedicated
parameters ▩‹simp prems/concls/early add/del: ...› should be used.

The situation for introduction rules is similar: They can be registered via ▩‹intro[!]: ...›, in which
case they will be attempted as part of ▩‹clarsimp›, but only with very low priority.

To apply safe introduction rules at high priority, you can instead use ▩‹intro add: ...›, which will
make ▩‹crush› attempt the respective rules in a high priority branch. The underlying named theorem list
is ▩‹crush_intros›, which can also be populated locally/globally via ▩‹declare/notes› as usual.

Beware of the minor syntactic difference between ▩‹crush intro: foo›, which affects only the low-priority
▩‹clarsimp› branch, and ▩‹crush intro add: foo›, which adds to high-priority introduction rules.

Note that ▩‹intro add: ...› registers classical introduction rules. Introduction rules for
separation logic are discussed next.›

subsubsection‹Separation logic introduction and destruction rules›

text‹Separation logic introduction rules can be registered with ▩‹seplog rule add: ...›. Under the
hood, this applies the ▩‹aentails_rule› tactic that we already discussed in 🗏‹../Crush/Examples.thy›.›

text‹Using ▩‹seplog drule add: ...›, the previous example can be subsumed into a single ▩‹crush›
call.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
  assumes PQ: ‹⋀x. P x ⤏ Q x›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹Some_Ex ⤏ (⨆x. Q x)›
    by (crush_base simp prems add: Some_Ex_def seplog rule add: PQ)

text‹Similarly, spatial assertions can be destructed using ▩‹seplog drule add: ...›, which wraps
▩‹aentails_drule›.›
  lemma ‹Some_Ex ⤏ (⨆x. Q x)›
    by (crush_base simp prems add: Some_Ex_def seplog drule add: PQ)
end

text‹Separation logic introduction and destruction rules can have pure premises as well, in which
case those premises will be introduced as new subgoals:›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⤏ Q x›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹R ⟹ Some_Ex ⤏ (⨆x. Q x)›
    by (crush_base simp prems add: Some_Ex_def seplog rule add: PQ)

  lemma ‹R ⟹ Some_Ex ⤏ (⨆x. Q x)›
    by (crush_base simp prems add: Some_Ex_def seplog drule add: PQ)
end

subsubsection‹Generic simplifications›

text‹We have already seen that ▩‹crush› applies simplification rules for separation logic
automatically. For example, the previous example requires ▩‹crush› to use

 ▩‹
lemma aexists_entailsL':
  assumes ‹⋀x. x ∈ S ⟹ φ x ⤏ ψ›
    shows ‹(⨆x ∈ S. φ x) ⤏ ψ›

lemma aexists_entailsR:
  assumes ‹φ ⤏ ψ x›
    shows ‹φ ⤏ (⨆x. ψ x)›
›

from 🗏‹../Shallow_Separation_Logic/Assertion_Language.thy›, to destruct the existential premise
and to introduce a schematic for the existentially quantified variable in the goal.›

text‹There are many more separation logic simplification and introduction steps that are routinely
applied by ▩‹crush›, such as associativity normalization for ▩‹⋆›, distributivity of ▩‹⋆› and ▩‹⨆›,
cancellation, or hoisting out of pure premises. Here is another example:›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    by (crush_base simp prems add: Some_Ex_def seplog rule add: PQ)

  text‹This example would already be rather tedious to prove by hand. Indeed, already the number of
  ▩‹crush› steps has increased notably:›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    done
end

text‹Usually, ▩‹crush›'s simplifications are robust and do not need tweaking, and in the rare case
where simplifications need to be watched closely, one can either call individual ▩‹crush› steps
or fall back to lower level tactics and rules.

One notable exception is the omission of rules introducting schematics, which may sometimes need to
be withheld to avoid unsolvable schematics (even though ▩‹simp prems add: ...› and guards -- described below --
already help to avoid those situations).
If you want to prevent ▩‹crush› from introducing schematics for existentially quantified goals,
use ▩‹no_schematics›.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›
  ―‹Helper command for automatically proving ▩‹ucincl› assertions and registering them
  with ▩‹crush›.›
  ucincl_auto Some_Ex

  lemma ‹Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ γ›
    ―‹This would not have worked without ▩‹no_schematics› because ▩‹Some_Ex› needs early unfolding.
    Alternatively to ▩‹simp prems add: ..›, we can also avoid introducing schematics for now, and this
    way ensure that ▩‹Some_Ex› gets unfolded before schematics are introduced:›
    apply (crush_base simp add: Some_Ex_def no_schematics)
    apply (crush_base seplog rule add: PQ)
    done
end

subsection‹Disabling case splits›

text‹By default ▩‹crush› uses ▩‹safe› to split disjunctive premises, and to split conjunctive
subgoals. While safe, this can sometimes lead to an explosion in the number of subgoals, which
is problematic from a performance perspective, esp. if the various cases still have reasoning in
common. ▩‹crush› itself applies ▩‹safe› with a low priority to do as much common reasoning on the
cases as possible, but if such common reasoning happens outside of ▩‹crush›, ▩‹crush› must be forced
not to use ▩‹safe› at all. This is done by passing the ▩‹no_safe› argument.›

experiment
  fixes A B C D :: bool
begin
  lemma ‹A ∨ B ⟹ C ∧ D›
    apply crush_base ―‹Now we have 4 subgoals›
    oops

  text‹Trying again without ▩‹safe›:›

  lemma ‹A ∨ B ⟹ C ∧ D›
    apply ((crush_base no_safe)?) ―‹No progress!›
    oops
end

text‹WARNING: As it stands, ▩‹crush› will unconditionally split ▩‹spatial› goals, and ∗‹never›
split spatial disjunctions:›

experiment
  fixes P Q R S :: ‹'s::sepalg assert›
    and A B C D :: bool
begin
  lemma ‹A ∨ B ⟹ (P ⊔ Q ⤏ R ⊓ S)›
    ―‹TODO: I was expecting ▩‹P ⊔ Q› would be split by default, but that's not the case...›
    apply crush_base ―‹Now we have 4 subgoals›
    oops

  text‹Trying again without ▩‹safe›:›

  lemma ‹A ∨ B ⟹ (P ⊔ Q ⤏ R ⊓ S)›
    apply (crush_base no_safe) ―‹Spatial conjunction still split...›
    oops
end

subsubsection‹Adding branches›

text‹As mentioned, ▩‹crush› never tries to be 'clever' and deliberately relies on foundational tactics
only. If you find that within the realm of some proof(s) you repeatedly need to switch between ▩‹crush›
and another context-specific proof step, you can locally register this step as another branch of ▩‹crush›
using ▩‹branch add: ...›, which takes a ∗‹method› as an argument.›

experiment
  fixes A B C D :: bool
  assumes AB_CD: ‹A ⟹ C› ‹A ⟹ D› ‹B ⟹ C› ‹B ⟹ D›
begin
  lemma ‹A ∨ B ⟹ C ∧ D›
    by (crush_base branch add: ‹intro AB_CD; assumption›)
end

subsubsection‹Debugging ▩‹crush››

text‹If ▩‹crush› behaves surprisingly and you find yourself in need of inspecting what is happening,
there are multiple options.

A useful first step is typically to reproduce the issue with ▩‹fastcrush› rather than ▩‹crush›.
The former should be easier to debug since it applies only to the current goal, while ▩‹crush› applies
to ∗‹all› goals. Also, you should try to minimize the number of arguments passed to ▩‹crush›:
Can you ascribe the unexpected behavior to the addition/deletion of a certain argument?

When that's done, consider one of the following steps:›

paragraph‹Single-stepping›

text‹You can replace ▩‹crush/fastcrush› by ▩‹crush_[base_]step› to trace what ▩‹crush› is doing.
Note, however, that for complex ▩‹crush› calls this may require you to copy-paste the respective
▩‹crush› call dozens or hundreds of times, which is not terribly user friendly.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    by (crush_base simp prems add: Some_Ex_def seplog rule add: PQ)

  text‹This example would already be rather tedious to prove by hand. Indeed, already the number of
  ▩‹crush› steps has increased notably:›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    apply (crush_base_step simp prems add: Some_Ex_def seplog rule add: PQ)
    done
end

paragraph‹Use gas›

text‹You can limit the maximum number of ▩‹crush› iterations by providing the numeric ▩‹gas› argument
in conjunction with ▩‹bigstep: false›. This allows you to conduct a binary search to look out for
where ▩‹crush›'s behavior starts to diverge from your expectation.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    by (crush_base simp prems add: Some_Ex_def seplog rule add: PQ)

  ―‹How many steps did we actually need?›
  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    apply (crush_base simp prems add: Some_Ex_def seplog rule add: PQ bigstep: false gas: 10)
    done ―‹10 was enough!›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    apply (crush_base simp prems add: Some_Ex_def seplog rule add: PQ bigstep: false gas: 5)
    oops ―‹5 not enough›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    apply (crush_base simp prems add: Some_Ex_def seplog rule add: PQ bigstep: false gas: 8)
    done ―‹8 was enough!›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    apply (crush_base simp prems add: Some_Ex_def seplog rule add: PQ bigstep: false gas: 7)
    oops ―‹7 not enough›

  ―‹So we need exactly 8 steps. In this example, we could of course have found out more quickly
  by the 'copy-paste-method', but it does not scale as well.›

end

paragraph‹Use single-stepping›

text‹You can also run ▩‹crush› in single-stepping mode by passing ▩‹stepwise›.
Then, the ▩‹step› command can be used to run the ▩‹crush› once a time. Multiple steps can be taken
using ▩‹step n›, and the entire tactic can be run via ▩‹step *›. You can also run until one of the
goals matches a certain pattern using ▩‹step to goal pattern "PATTERN"›, or until one of the premises
of one of the goals satisfies a certain pattern, using ▩‹step to premise pattern "PATTERN"›. In both
cases, the pattern is expected to be of type ▩‹bool› (not ▩‹prop›). To stop at the occurrence of a
specific schematic, use ▩‹step to schematic "name" index›, where ▩‹?name'idx› is the target schematic.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    using [[crush_log_toplevel]]
    apply (crush_base simp prems add: Some_Ex_def seplog rule add: PQ stepwise)
    step to goal pattern "_ ⤏ (⨆_. _)"
    step
    step 2
    step *
    done

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    using [[crush_log_toplevel]]
    apply (crush_base simp prems add: Some_Ex_def seplog rule add: PQ stepwise)
    step to premise pattern "R"
    step *
    done

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    using [[crush_log_toplevel]]
    apply (crush_base simp prems add: Some_Ex_def seplog rule add: PQ stepwise)
    step to schematic "x" 7
    step *
    done
end

paragraph‹Use backtracking›

text‹You can also use backtracking to retroactively inspect the path ▩‹crush› has taken. For this,
you set the  ▩‹history› argument and use Isabelle's normal backtracking command
▩‹back› after the tactic invocation.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    ―‹TODO: This should be configurable through the ▩‹crush› command line›
    apply (crush_base simp prems add: Some_Ex_def seplog rule add: PQ history)
    back back back back
    back back back
    ―‹Back at the original goal!›
    oops
end

text‹Using backtracking is often more effective than ▩‹gas› as one can easily paste a very large
number of ▩‹back back back ...› blocks and do a binary search by moving the cursor through them,
which will show the intermediate states of ▩‹crush› in the output panel.›

text‹WARNING: You should ∗‹not› use backtracking to manually revert ▩‹crush› to the last good intermediate
goal and continue manually from there: Instead, it should only be used temporarily to identify what
went wrong where, and fix the proof. Manual backtracking would lead to proofs that are very hard to
maintain in the face of changes to ▩‹crush›. For the same reason, ▩‹gas› should only be used as a
debugging mechanism, not as a way to control ▩‹crush›.›

paragraph‹Aborting ▩‹crush››

text‹Another mechanism by which ▩‹crush› can be debugged is by stopping it once the goal matches
a given criterion. The motivation, here, is to facilitate identifying the ▩‹crush› step at which
a certain goal has emerged.

There are two variants: First, you can specify a pattern against which the meta-conclusion of the
intermediate goals should be matched, stopping ▩‹crush› if so. More generally, via ▩‹abort at filter: ...›
you can specify an arbitrary abort filter of type ▩‹Proof.context -> term -> bool› which goal
meta-conclusions will be fed through.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›
  ucincl_auto Some_Ex

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    ―‹Stopping once the existential quantification on the right has reached the top-level›
    apply (crush_base bigstep:false abort at pattern: "_ ⤏ (⨆_. _)")
    oops

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    ―‹Stopping on the first non-separation logic goal›
    apply (crush_base bigstep:false simp prems add: Some_Ex_def seplog rule add: PQ abort at filter: ‹
      fn (ctxt : Proof.context) => fn (goal : term) =>
        (* Double-check that the filter is run on all intermediate goals *)
        let val _ = Pretty.block [Pretty.str "Current goal: ",
                                  goal |> Syntax.pretty_term ctxt] |> Pretty.writeln
        in not (Separation_Logic_Tactics.is_entailment goal) end›)
    oops
end

paragraph‹Case splitting›

experiment
begin
lemma ‹(case y of Some t ⇒ True | None ⇒ True) ⟹ α ⤏ 𝒲𝒫 Γ ⟦ match x { Some(y) ⇒ f(), None ⇒ g() } ⟧ β γ δ›
  ―‹You could use the following, but that would aggressively split ▩‹option› everywhere.›
  apply (crush_base split!: option.splits)
  ―‹▩‹ 1. ⋀x2 x2a. x = Some x2 ⟹ y = Some x2a ⟹ α ⤏ 𝒲𝒫 Γ (call f) β γ δ
        2. ⋀x2. x = Some x2 ⟹ y = None ⟹ α ⤏ 𝒲𝒫 Γ (call f) β γ δ
        3. ⋀x2. x = None ⟹ y = Some x2 ⟹ α ⤏ 𝒲𝒫 Γ (call g) β γ δ
        4. x = None ⟹ y = None ⟹ α ⤏ 𝒲𝒫 Γ (call g) β γ δ››
  oops

lemma ‹(case y of Some t ⇒ True | None ⇒ True) ⟹ α ⤏ 𝒲𝒫 Γ ⟦ match x { None ⇒ f(), Some(y) ⇒ g() } ⟧ β γ δ›
  ―‹If you instead want to split only micro rust expressions, use ▩‹wp split: ...›› 
  apply (crush_base wp split: option.splits)
  ―‹▩‹ 1. ⋀x2. case y of None ⇒ True | _ ⇒ True ⟹ x = Some x2 ⟹ α ⤏ 𝒲𝒫 Γ (call g) β γ δ
        2. case y of None ⇒ True | _ ⇒ True ⟹ x = None ⟹ α ⤏ 𝒲𝒫 Γ (call f) β γ δ››
  oops
end

paragraph‹Logging ▩‹crush››

text‹You can get a trace of the steps that ▩‹crush› has run by adding ▩‹crush_log_toplevel›.
If you want to see all intermediate goals, add ▩‹crush_log_goal›:›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    using [[crush_log_toplevel]]
    by (crush_base simp prems add: Some_Ex_def seplog rule add: PQ)
    (* crush_branch_aentails_cancel_tac success (0.000091s)
       crush_branch_unfold_prems_tac success (0.000019s)
       crush_branch_aentails_core_tac success (0.000321s)
       crush_branch_schematics_tac success (0.000021s)
       crush_branch_aentails_cancel_tac success (0.000063s)
       crush_branch_aentails_rule_tac success (0.000080s)
       crush_branch_base_simps_tac success (0.000015s)
       crush_branch_aentails_cancel_tac success (0.000049s) *)

  ―‹The same, but now also showing all intermediate goals:›
  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    using [[crush_log_toplevel, crush_log_goal]]
    by (crush_base simp prems add: Some_Ex_def seplog rule add: PQ)
end

text‹There are various other logging options which can be used to trace individual branches.
See 🗏‹../Crush/config.ML› for a complete list.›

paragraph‹Comparing two ▩‹crush› calls›

text‹It sometimes happens that one notices one ▩‹crush› call succeeding and another failing, but it
not being clear where the divergence is emerging. In this case, you can use the ▩‹divergence›
tactical, which takes two methods as arguments and repeats them in lock-step to find the point
of divergence. It then stops and uses backtracking to allow the user to inspect the diverging states.

Note that the tactics passed to ▩‹divergence› have to be repeatable, to for use with ▩‹crush›
one needs to pass ▩‹crush_base_step›.

Here is a simple example demonstrating the higher priority of ▩‹simp prems ...› vs. ▩‹simp›:›


experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
  assumes PQ: ‹⋀x. P x ⤏ Q x›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹Some_Ex ⤏ (⨆x. Q x)›
    apply (divergence
       ‹crush_base_step simp prems add: Some_Ex_def›
       ‹crush_base_step simp add: Some_Ex_def›
    )
    ―‹Use backtracking to inspect diverging states›
    back ―‹State when applying LHS ▩‹crush_base_step simp prems add: Some_Ex_def››
    back ―‹State when applying RHS ▩‹crush_base_step simp add: Some_Ex_def››
    oops
end

subsubsection‹Profiling ▩‹crush››

text‹Despite all efforts to keep ▩‹crush› fast, some ▩‹crush› invocations can take a long time.
In this case, the general profiling variant ▩‹applyτ› can be used in conjunction with some or all
of the configuration options ▩‹crush_time_...› to get a timing profile for ▩‹crush›.

NB: Unfortunately, timing profiles are proof local, so you cannot currently aggregate performance
statistics along multiple proofs.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    ―‹By default, the timing mechanism ignores runtimes < 2ms, which is too high to observe
    the runtime of the tactics in this trivial example. We reduce the threshold to 5ns.›
    using [[crush_time_toplevel, crush_timing_threshold=5]]
    applyτ (crush_base simp prems add: Some_Ex_def seplog rule add: PQ)
    show_timelogs
(* Top ten time sinks
- crush_branch_aentails_core_tac: 0.000374s (0.000073s failing, 0.000301s succeeding)
- crush_branch_aentails_cancel_tac: 0.000301s (0.000118s failing, 0.000183s succeeding)
- crush_branch_base_simps_tac: 0.000261s (0.000250s failing, 0.000011s succeeding)
- crush_branch_aentails_rule_tac: 0.000079s (0.000000s failing, 0.000079s succeeding)
- crush_branch_unfold_prems_tac: 0.000041s (0.000028s failing, 0.000013s succeeding)
- crush_branch_focus_tac: 0.000026s (0.000026s failing, 0.000000s succeeding)
- crush_branch_schematics_tac: 0.000022s (0.000000s failing, 0.000022s succeeding)
- crush_branch_unfold_concls_tac: 0.000017s (0.000017s failing, 0.000000s succeeding)
Timing statistics for: crush_branch_aentails_core_tac
- Total time: 0.000374s
- SUCCESSES
  * Number of time reports: 1
  * total 0.000301s, average 0.000301s, median 0.000301s
  * Percentiles:  0.000301s 0.000301s 0.000301s 0.000301s 0.000301s 0.000301s ...
- FAILURES
  * Number of time reports: 2
  * total 0.000073s, average 0.000036s, median 0.000039s
  * Percentiles:  0.000034s 0.000034s 0.000034s 0.000034s 0.000034s 0.000039s ...
Timing statistics for: crush_branch_aentails_cancel_tac
- Total time: 0.000301s
- SUCCESSES
  * Number of time reports: 3
  * total 0.000183s, average 0.000061s, median 0.000064s
  * Percentiles:  0.000044s 0.000044s 0.000044s 0.000044s 0.000064s 0.000064s ...
- FAILURES
  * Number of time reports: 4
  * total 0.000118s, average 0.000029s, median 0.000047s
  * Percentiles:  0.000006s 0.000006s 0.000006s 0.000007s 0.000007s 0.000047s ...
Timing statistics for: crush_branch_base_simps_tac
- Total time: 0.000261s
- SUCCESSES
  * Number of time reports: 1
  * total 0.000011s, average 0.000011s, median 0.000011s
  * Percentiles:  0.000011s 0.000011s 0.000011s 0.000011s 0.000011s 0.000011s ...
- FAILURES
  * Number of time reports: 7
  * total 0.000250s, average 0.000035s, median 0.000038s
  * Percentiles:  0.000026s 0.000026s 0.000027s 0.000031s 0.000031s 0.000038s ...
Timing statistics for: crush_branch_aentails_rule_tac
- Total time: 0.000079s
- SUCCESSES
  * Number of time reports: 1
  * total 0.000079s, average 0.000079s, median 0.000079s
  * Percentiles:  0.000079s 0.000079s 0.000079s 0.000079s 0.000079s 0.000079s ...
- FAILURES
  * Number of time reports: 0
  * total 0.000000s, average 0.000000s, median 0.000000s
  * Percentiles:
Timing statistics for: crush_branch_unfold_prems_tac
- Total time: 0.000041s
- SUCCESSES
  * Number of time reports: 1
  * total 0.000013s, average 0.000013s, median 0.000013s
  * Percentiles:  0.000013s 0.000013s 0.000013s 0.000013s 0.000013s 0.000013s ...
- FAILURES
  * Number of time reports: 3
  * total 0.000028s, average 0.000009s, median 0.000010s
  * Percentiles:  0.000006s 0.000006s 0.000006s 0.000006s 0.000010s 0.000010s ...
Timing statistics for: crush_branch_focus_tac
- Total time: 0.000026s
- SUCCESSES
  * Number of time reports: 0
  * total 0.000000s, average 0.000000s, median 0.000000s
  * Percentiles:
- FAILURES
  * Number of time reports: 1
  * total 0.000026s, average 0.000026s, median 0.000026s
  * Percentiles:  0.000026s 0.000026s 0.000026s 0.000026s 0.000026s 0.000026s ...
Timing statistics for: crush_branch_schematics_tac
- Total time: 0.000022s
- SUCCESSES
  * Number of time reports: 1
  * total 0.000022s, average 0.000022s, median 0.000022s
  * Percentiles:  0.000022s 0.000022s 0.000022s 0.000022s 0.000022s 0.000022s ...
- FAILURES
  * Number of time reports: 0
  * total 0.000000s, average 0.000000s, median 0.000000s
  * Percentiles:
Timing statistics for: crush_branch_unfold_concls_tac
- Total time: 0.000017s
- SUCCESSES
  * Number of time reports: 0
  * total 0.000000s, average 0.000000s, median 0.000000s
  * Percentiles:
- FAILURES
  * Number of time reports: 1
  * total 0.000017s, average 0.000017s, median 0.000017s
  * Percentiles:  0.000017s 0.000017s 0.000017s 0.000017s 0.000017s 0.000017s ...
Top ten time sinks *)
  done
end

text‹If you suspect that poor performance of ▩‹crush› is due to a particular branch running slowly,
you can instruct ▩‹crush› to stop when the duration of single step surpasses a configurable threshold.
For example, ▩‹[clar]simp› invocations on highly complex goals can sometimes take seconds, and in this
case the abort mechanism is a good way to identify and debug them.›

experiment
  fixes P Q :: ‹'a ⇒ 's::sepalg assert›
    and α β γ δ :: ‹'s assert›
    and R :: bool
  assumes PQ: ‹⋀x. R ⟹ P x ⋆ β ⤏ Q x ⋆ γ›
     and P_ucincl[ucincl_intros]: ‹⋀x. ucincl (P x)›
begin
  definition ‹Some_Ex ≡ ⨆x. P x›

  lemma ‹δ ⋆ Some_Ex ⋆ (α ⋆ β) ⋆ ⟨R⟩ ⤏ α ⋆ (⨆x. Q x) ⋆ (γ ⋆ δ)›
    ―‹Stop if ▩‹crush› hits a branch that takes more than 1s›
    using [[crush_time_steps, crush_time_step_bound_ms=1000]]
    apply (crush_base simp prems add: Some_Ex_def bigstep: false branch add: ‹sleep 2›)
    (* step time bound exceeded: 2.001638s
       inner tactic raised stop exception at step 5 *)
    oops
end

subsubsection‹Reasoning in the weakest precondition calculus›

text‹In addition to classical and generic separation logic reasoning, ▩‹crush› supports reasoning
in the weakest precondition (WP) calculus for μRust. ▩‹crush› is aware of WP-rules for various μRust
program constructs and discharges them automatically, thereby effectively stepping through programs.›

text‹We consider a simple example of a μRust expression swapping the values of two references:›

context reference
begin
adhoc_overloading store_update_const ⇌
  update_fun

definition swap_ref :: ‹('a, 'b, 'v) ref ⇒ ('a, 'b, 'v) ref ⇒ ('s, unit, unit, 'abort, 'i prompt, 'o prompt_output) expression› where
  ‹swap_ref rA rB ≡ ⟦
     let oldA = *rA;
     let oldB = *rB;
     rA = oldB;
     rB = oldA;
  ⟧›

text‹Let's prove that this indeed swaps the contents of the references ▩‹rA› and ▩‹rB›:›

lemma swap_ref_correct:
  fixes Γ :: ‹('s, 'abort, 'i, 'o) striple_context›
    and gA gB :: 'b
    and vA vB :: 'v
    and rA rB :: ‹('a, 'b, 'v) ref›
  shows ‹Γ ;   ―‹Initial reference contents›
              rA ↦ ⟨⊤⟩ gA↓vA ⋆ rB ↦⟨⊤⟩ gB↓vB
            ⊢ swap_ref rA rB
              ―‹Updated reference contents›
            ⊣ (λ_.  (⨆gA' gB'. rA ↦ ⟨⊤⟩ gA'↓vB ⋆ rB ↦⟨⊤⟩ gB'↓vA))
                ⨝ ⊥ ―‹No ▩‹return›› ⨝ ⊥ ―‹No abort›
  ›
  ―‹Move to WP calculus -- this is not usually done manually but part of the automation for
  function specifications, which is discussed further below.›
  apply (simp add: wp_sstriple_iff)
  using [[crush_log_toplevel]]
  apply (crush_base simp add: swap_ref_def)
  done

text‹NB: It is instructive to see how many low-level proof steps are already necessary for code
as simple as this:›

lemma
  fixes Γ :: ‹('s, 'abort, 'i, 'o) striple_context›
    and gA gB :: 'b
    and vA vB :: 'v
    and rA rB :: ‹('a, 'b, 'v) ref›
  shows ‹Γ ;   ―‹Initial reference contents›
              rA ↦ ⟨⊤⟩ gA↓vA ⋆ rB ↦⟨⊤⟩ gB↓vB
            ⊢ swap_ref rA rB
              ―‹Updated reference contents›
            ⊣ (λ_.  (⨆gA' gB'. rA ↦ ⟨⊤⟩ gA'↓vB ⋆ rB ↦⟨⊤⟩ gB'↓vA))
                ⨝ ⊥ ―‹No ▩‹return›› ⨝ ⊥ ―‹No abort›
  ›
  ―‹Move to WP calculus -- this is not usually done manually but part of the automation for
  function specifications, which is discussed further below.›
  apply (simp add: wp_sstriple_iff)
  using [[crush_log_toplevel]]
  apply (crush_base simp add: swap_ref_def stepwise)
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  done

subsubsection‹Function contracts and specifications›

text‹Normally, we would not reason about individual μRust expressions, but functions, and use
specifications and contracts to do so. We illustrate this in the example of the above
reference-swapping code wrapped into a function:›

definition swap_ref_fun :: ‹('a, 'b, 'v) ref ⇒ ('a, 'b, 'v) ref ⇒ ('s, unit, 'abort, 'i prompt, 'o prompt_output) function_body› where
  ‹swap_ref_fun rA rB ≡ FunctionBody ⟦
     let oldA = *rA;
     let oldB = *rB;
     rA = oldB;
     rB = oldA;
  ⟧›

text‹The contract for a uRust function is usually captured in a separate definition of type
▩‹function_contract›. In this case:›

definition swap_ref_fun_contract ::
   ‹('a, 'b, 'v) ref ⇒ ('a, 'b, 'v) ref ⇒ 'b ⇒ 'v ⇒ 'b ⇒ 'v ⇒ ('s, unit, 'abort) function_contract›
  where ‹swap_ref_fun_contract rA rB gA vA gB vB ≡
     let pre = rA ↦ ⟨⊤⟩ gA↓vA ⋆ rB ↦⟨⊤⟩ gB↓vB in
     let post = λ_. (⨆gA' gB'. rA ↦ ⟨⊤⟩ gA'↓vB ⋆ rB ↦⟨⊤⟩ gB'↓vA) in
      make_function_contract pre post›

text‹Note how we are force to make all contextual parameters of the function specification arguments
to the function contract definition.›

text‹As with all assertions, we need to prove that pre and post conditions are upwards closed.
Luckily, there is custom automation to do this:›
ucincl_auto swap_ref_fun_contract

text‹If the proof isn't trivial, you can use ▩‹ucincl_proof› instead to open up a proof context
for the required ▩‹ucincl› assertions.›

text‹Next, we state and prove the function specification:›

lemma swap_ref_fun_spec:
  shows ‹Γ; swap_ref_fun rA rB ⊨⇩_F swap_ref_fun_contract rA rB gA vA gB vB›
  ―‹There is a separate 'boot' tactic for specification proofs›
  apply (crush_boot f: swap_ref_fun_def contract: swap_ref_fun_contract_def)
  apply crush_base
  done

text‹Next, imagine we write a higher-level function which relies on ▩‹swap_ref_fun›.›

definition rotate_ref3 :: ‹('a, 'b, 'v) ref ⇒ ('a, 'b, 'v) ref ⇒ ('a, 'b, 'v) ref ⇒ ('s, unit, 'abort, 'i prompt, 'o prompt_output) function_body› where
  ‹rotate_ref3 rA rB rC ≡ FunctionBody ⟦
     swap_ref_fun (rA, rB);
     swap_ref_fun (rB, rC);
  ⟧›

text‹Next, we write the contract for ▩‹rotate_ref3›:›

definition rotate_ref3_contract ::
   ‹('a, 'b, 'v) ref ⇒ ('a, 'b, 'v) ref ⇒ ('a, 'b, 'v) ref ⇒ 'b ⇒ 'v ⇒ 'b ⇒ 'v ⇒ 'b ⇒ 'v ⇒ ('s, unit, 'abort) function_contract›
  where ‹rotate_ref3_contract rA rB rC gA vA gB vB gC vC ≡
     let pre = rA ↦ ⟨⊤⟩ gA↓vA ⋆ rB ↦⟨⊤⟩ gB↓vB ⋆ rC ↦⟨⊤⟩ gC↓vC in
     let post = λ_. (⨆gA' gB' gC'. rA ↦ ⟨⊤⟩ gA'↓vB ⋆ rB ↦⟨⊤⟩ gB'↓vC ⋆ rC ↦⟨⊤⟩ gC'↓vA) in
      make_function_contract pre post›
ucincl_auto rotate_ref3_contract

text‹To prove the specification for ▩‹rotate_ref3›, we could just unfold/inline the definition of
▩‹swap_ref_fun›:›

lemma rotate_ref3_spec:
  shows ‹Γ; rotate_ref3 rA rB rC ⊨⇩_F rotate_ref3_contract rA rB rC gA vA gB vB gC vC›
  ―‹There is a separate 'boot' tactic for specification proofs›
  apply (crush_boot f: rotate_ref3_def contract: rotate_ref3_contract_def)
  apply (crush_base simp add: swap_ref_fun_def)
  done

text‹This ∗‹can› work, but may lead to issues with SSA normalization. The more robust way to inline
a function is to explicitly mark it via the ▩‹inline: ...› configuration option:›

lemma rotate_ref3_spec':
  shows ‹Γ; rotate_ref3 rA rB rC ⊨⇩_F rotate_ref3_contract rA rB rC gA vA gB vB gC vC›
  ―‹There is a separate 'boot' tactic for specification proofs›
  apply (crush_boot f: rotate_ref3_def contract: rotate_ref3_contract_def)
  apply (crush_base inline: swap_ref_fun_def)
  done

text‹This looks like a fine proof -- certainly without much work -- but we effectively need to reprove
the spec for ▩‹swap_ref_fun› with every invocation, which is neither scalable nor modular.

Instead, the proof should refer to the already-proven specification of ▩‹swap_ref_fun›. This is done
by registering spec and proof via ▩‹contracts add: ...› and ▩‹spec add: ...›, respectively:›

lemma
  ―‹TODO: There is some issue with simplification of `f ` N` expressions which causes
  bad schematics unless we remove ▩‹image_cong› as a congruence rule. To be investigated.›
  notes image_cong[cong del]
  shows ‹Γ; rotate_ref3 rA rB rC ⊨⇩_F rotate_ref3_contract rA rB rC gA vA gB vB gC vC›
  ―‹There is a separate 'boot' tactic for specification proofs›
  apply (crush_boot f: rotate_ref3_def contract: rotate_ref3_contract_def)
  using [[crush_log_toplevel]]
  apply (crush_base specs add: swap_ref_fun_spec contracts add: swap_ref_fun_contract_def)
  done

text‹To avoid explicitly listing all specifications and contracts needed by a proof, one can
alternatively globally or locally populate the underlying named theorem lists ▩‹crush_specs›
and ▩‹crush_contracts›. This is the prevalent style used in the code base, where we mark
contracts/specs as ▩‹crush_contracts/crush_specs› at time of definition/proofs.›

declare swap_ref_fun_spec[crush_specs, crush_specs_eager]
declare swap_ref_fun_contract_def[crush_contracts]

lemma
  shows ‹Γ; rotate_ref3 rA rB rC ⊨⇩_F rotate_ref3_contract rA rB rC gA vA gB vB gC vC›
  ―‹There is a separate 'boot' tactic for specification proofs›
  apply (crush_boot f: rotate_ref3_def contract: rotate_ref3_contract_def)
  by crush_base

no_adhoc_overloading store_update_const ⇌
  update_fun

end

subsubsection‹Reasoning about references and structures›

text‹The next examples demonstrate how ▩‹crush› can reason about structure and array manipulation.
They are also useful as vehicles to understand and improve the underlying focus and lens mechanics.›

text‹The examples will operate on the following structures:›

datatype_record test_record =
  foo :: int
  bar :: int
micro_rust_record test_record

datatype_record test_record2 =
  f0 :: int
  f1 :: int
  f2 :: int
  f3 :: int
  f4 :: int
  f5 :: int
  f6 :: int
  f7 :: int
  f8 :: int
  f9 :: int
  f10 :: int
  f11 :: int
  f12 :: int
  f13 :: int
  f14 :: int
  f15 :: int
  f16 :: int
  f17 :: int
  f18 :: int
  f19 :: int
  f20 :: int
micro_rust_record test_record2

datatype_record test_record3 =
  data :: ‹(int, 20) array›
  rest :: int
micro_rust_record test_record3

context reference
begin
adhoc_overloading store_update_const ⇌
  update_fun

text‹Overwrite one structure field, return the other:›

definition write_foo_read_bar :: ‹('a, 'b, test_record) ref ⇒ ('s, int, 'abort, 'i prompt, 'o prompt_output) function_body› where
  ‹write_foo_read_bar ptr ≡ FunctionBody ⟦
     ptr.foo = 42;
     *(ptr.bar)
  ⟧›

definition write_foo_read_bar_contract ::
   ‹('a, 'b, test_record) ref ⇒ 'b ⇒ test_record ⇒ ('s, int, 'abort) function_contract›
  where ‹write_foo_read_bar_contract r g v ≡
     let pre = r ↦ ⟨⊤⟩ g↓v in
     let post = λt. ⟨t = test_record.bar v⟩ ⋆ (⨆g'. r ↦ ⟨⊤⟩ g'↓(test_record.update_foo (λ_. 42) v)) in
      make_function_contract pre post›
ucincl_auto write_foo_read_bar_contract

lemma write_foo_read_bar_spec:
  shows ‹Γ; write_foo_read_bar ptr ⊨⇩_F write_foo_read_bar_contract ptr g v›
  apply (crush_boot f: write_foo_read_bar_def contract: write_foo_read_bar_contract_def)
  using [[crush_log_toplevel]]
  apply crush_base
  done

text‹Let's prove the same in single-stepping mode:›

lemma write_foo_read_bar_spec':
  shows ‹Γ; write_foo_read_bar ptr ⊨⇩_F write_foo_read_bar_contract ptr g v›
  apply (crush_boot f: write_foo_read_bar_def contract: write_foo_read_bar_contract_def)
  using [[crush_log_toplevel]]
  apply (crush_base stepwise)
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  step
  done

text‹Clear many structure fields, return another:›

definition test_record2_zeroize :: ‹('a, 'b, test_record2) ref ⇒ ('s, int, 'abort, 'i prompt, 'o prompt_output) function_body› where
  ‹test_record2_zeroize ptr ≡ FunctionBody ⟦
     ptr.f0 = 0;
     ptr.f1 = 0;
     ptr.f2 = 0;
     ptr.f3 = 0;
     ptr.f4 = 0;
     ptr.f5 = 0;
     ptr.f6 = 0;
     ptr.f7 = 0;
     ptr.f8 = 0;
     ptr.f9 = 0;
     ptr.f10 = 0;
     ptr.f11 = 0;
     ptr.f12 = 0;
     ptr.f13 = 0;
     ptr.f14 = 0;
     ptr.f15 = 0;
     ptr.f16 = 0;
     ptr.f17 = 0;
     ptr.f18 = 0;
     ptr.f19 = 0;
     *(ptr.f20)
  ⟧›

definition test_record2_zeroize_contract ::
   ‹('a, 'b, test_record2) ref ⇒ 'b ⇒ test_record2 ⇒ ('s, int, 'abort) function_contract›
  where ‹test_record2_zeroize_contract r g v ≡
     let pre = r ↦ ⟨⊤⟩ g↓v in
     let post = λt. ⟨t = test_record2.f20 v⟩ ⋆ (⨆g'. r ↦ ⟨⊤⟩ g'↓(
          test_record2.update_f0 (λ_. 0) (
          test_record2.update_f1 (λ_. 0) (
          test_record2.update_f2 (λ_. 0) (
          test_record2.update_f3 (λ_. 0) (
          test_record2.update_f4 (λ_. 0) (
          test_record2.update_f5 (λ_. 0) (
          test_record2.update_f6 (λ_. 0) (
          test_record2.update_f7 (λ_. 0) (
          test_record2.update_f8 (λ_. 0) (
          test_record2.update_f9 (λ_. 0) (
          test_record2.update_f10 (λ_. 0) (
          test_record2.update_f11 (λ_. 0) (
          test_record2.update_f12 (λ_. 0) (
          test_record2.update_f13 (λ_. 0) (
          test_record2.update_f14 (λ_. 0) (
          test_record2.update_f15 (λ_. 0) (
          test_record2.update_f16 (λ_. 0) (
          test_record2.update_f17 (λ_. 0) (
          test_record2.update_f18 (λ_. 0) (
          test_record2.update_f19 (λ_. 0) (
            v
          )))))))))))))))))))))) in
      make_function_contract pre post›
ucincl_auto test_record2_zeroize_contract

lemma test_record2_zeroize_contract_spec:
  shows ‹Γ; test_record2_zeroize ptr ⊨⇩_F test_record2_zeroize_contract ptr g v›
  apply (crush_boot f: test_record2_zeroize_def contract: test_record2_zeroize_contract_def)
  using [[crush_time_steps]]
  applyτ (crush_base)
  show_timelogs
  done

text‹Another similar stress test, but this time using array accesses behind a function wrapper.›

definition test_record3_zero_field :: ‹('a, 'b, test_record3) ref ⇒ 64 word ⇒ ('s, unit, 'abort, 'i prompt, 'o prompt_output) function_body› where
  ‹test_record3_zero_field r i ≡ FunctionBody ⟦
     r.data[i] = 0
  ⟧›

definition test_record3_zero_field_contract ::
   ‹('a, 'b, test_record3) ref ⇒ 64 word ⇒ 'b ⇒ test_record3 ⇒ ('s, unit, 'abort) function_contract›
  where [crush_contracts]: ‹test_record3_zero_field_contract r i g v ≡
     let pre = r ↦ ⟨⊤⟩ g↓v ⋆ ⟨i < 20⟩ in
     let zero_ith_pure :: test_record3 ⇒ test_record3 = (λt. t ⦇ data := array_update (data t) (unat i) 0 ⦈ ) in
     let post = λ_. (⨆g'. r ↦ ⟨⊤⟩ g'↓(zero_ith_pure v)) in
      make_function_contract pre post›
ucincl_auto test_record3_zero_field_contract

text‹For many non-trivial examples it is useful to conduct some Isar-style reasoning prior to
starting the ▩‹apply›-style ▩‹crush› proof. The following proof demonstrates this pattern:›

lemma test_record3_zero_field_spec[crush_specs]:
  shows ‹Γ; test_record3_zero_field ptr i ⊨⇩_F test_record3_zero_field_contract ptr i g v›
proof (crush_boot f: test_record3_zero_field_def contract: test_record3_zero_field_contract_def; goal_cases)
  case 1 note assms = this
  from assms have ‹unat i < 20›
    by (simp add: unat_less_helper)
  then show ?case
    by crush_base
qed

definition test_record3_zeroize :: ‹('a, 'b, test_record3) ref ⇒ ('s, int, 'abort, 'i prompt, 'o prompt_output) function_body› where
  ‹test_record3_zeroize ptr ≡ FunctionBody ⟦
     ptr.test_record3_zero_field(0);
     ptr.test_record3_zero_field(1);
     ptr.test_record3_zero_field(2);
     ptr.test_record3_zero_field(3);
     ptr.test_record3_zero_field(4);
     ptr.test_record3_zero_field(5);
     ptr.test_record3_zero_field(6);
     ptr.test_record3_zero_field(7);
     ptr.test_record3_zero_field(8);
     ptr.test_record3_zero_field(9);
     ptr.test_record3_zero_field(10);
     ptr.test_record3_zero_field(11);
     ptr.test_record3_zero_field(12);
     ptr.test_record3_zero_field(13);
     ptr.test_record3_zero_field(14);
     ptr.test_record3_zero_field(15);
     ptr.test_record3_zero_field(16);
     ptr.test_record3_zero_field(17);
     ptr.test_record3_zero_field(18);
     ptr.test_record3_zero_field(19);
     *(ptr.rest)
  ⟧›

definition test_record3_zeroize_contract ::
   ‹('a, 'b, test_record3) ref ⇒ 'b ⇒ test_record3 ⇒ ('s, int, 'abort) function_contract›
  where ‹test_record3_zeroize_contract r g v ≡
     let pre = r ↦ ⟨⊤⟩ g↓v in
     let zero_data_pure :: test_record3 ⇒ test_record3 = (λt. t ⦇ data := array_constant 0 ⦈ ) in
     let post = λret. ⟨ret = rest v⟩ ⋆ (⨆g'. r ↦ ⟨⊤⟩ g'↓(zero_data_pure v)) in
      make_function_contract pre post›
ucincl_auto test_record3_zeroize_contract

lemma test_record3_zeroize_spec:
  ―‹TODO: This can go away once specs are eager by default›
  notes test_record3_zero_field_spec[crush_specs_eager]
  shows ‹Γ; test_record3_zeroize ptr ⊨⇩_F test_record3_zeroize_contract ptr g v›
―‹Again, we hoist out some pure lemma into an initial Isar-style proof block. We could
inline the argument into the ▩‹crush› call, but that would slow it down significantly because
of a large number of irrelevant assumptions.›
proof (crush_boot f: test_record3_zeroize_def contract: test_record3_zeroize_contract_def, goal_cases)
  case 1
  { fix x :: ‹(int, 20) array›
    let ?x = x
    let ?x0 = ‹array_update ?x  0 0›
    let ?x1 = ‹array_update ?x0 1 0›
    let ?x2 = ‹array_update ?x1 2 0›
    let ?x3 = ‹array_update ?x2 3 0›
    let ?x4 = ‹array_update ?x3 4 0›
    let ?x5 = ‹array_update ?x4 5 0›
    let ?x6 = ‹array_update ?x5 6 0›
    let ?x7 = ‹array_update ?x6 7 0›
    let ?x8 = ‹array_update ?x7 8 0›
    let ?x9 = ‹array_update ?x8 9 0›
    let ?x10 = ‹array_update ?x9 10 0›
    let ?x11 = ‹array_update ?x10 11 0›
    let ?x12 = ‹array_update ?x11 12 0›
    let ?x13 = ‹array_update ?x12 13 0›
    let ?x14 = ‹array_update ?x13 14 0›
    let ?x15 = ‹array_update ?x14 15 0›
    let ?x16 = ‹array_update ?x15 16 0›
    let ?x17 = ‹array_update ?x16 17 0›
    let ?x18 = ‹array_update ?x17 18 0›
    let ?x19 = ‹array_update ?x18 19 0›
    have ‹?x19 = array_constant 0›
      by (auto intro!: array_extI simp add: less_Suc_eq numeral_Bit0 numeral_Bit1)
  }
  note eq = this[simplified]
  show ?case
  ―‹We see a linear built-up of premises here, which may cause performance problems in larger examples.›
    using [[crush_time_instantiation, crush_time_steps, crush_time_log_instantiation, crush_log_toplevel,
    crush_timing_threshold=1, crush_time_base_simps]]
  applyτ (time force "crush" ‹crush_base›)
  show_timelogs
  apply (simp add: eq)
  done
qed

end

end