fix: update Reparen golden file for inferInstanceAs docstring change

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
doc: update inferInstanceAs docstring and rename normalizeInstance to wrapInstance
2026-03-25 14:24:08 +00:00 · 2026-03-25 12:52:20 +11:00 · 2026-03-25 12:33:58 +11:00 · 2026-03-24 21:14:36 +00:00 · 2026-03-24 19:05:56 +00:00 · 2026-03-24 17:51:06 +00:00
4359 changed files with 45162 additions and 13641 deletions
--- a/.claude/CLAUDE.md
+++ b/.claude/CLAUDE.md
@@ -1,7 +1,12 @@
 (In the following, use `sysctl -n hw.logicalcpu` instead of `nproc` on macOS)

+## Building
+
 To build Lean you should use `make -j$(nproc) -C build/release`.

+The build uses `ccache`, and in a sandbox `ccache` may complain about read-only file systems.
+Use `CCACHE_READONLY` and `CCACHE_TEMPDIR` instead of disabling ccache completely.
+
 ## Running Tests

 See `tests/README.md` for full documentation. Quick reference:
@@ -11,18 +16,46 @@ See `tests/README.md` for full documentation. Quick reference:
 CTEST_PARALLEL_LEVEL="$(nproc)" CTEST_OUTPUT_ON_FAILURE=1 \
 make -C build/release -j "$(nproc)" test

-# Specific test by name (supports regex via ctest -R)
+# Specific test by name (supports regex via ctest -R; double-quote special chars like |)
 CTEST_PARALLEL_LEVEL="$(nproc)" CTEST_OUTPUT_ON_FAILURE=1 \
-make -C build/release -j "$(nproc)" test ARGS='-R grind_ematch'
+make -C build/release -j "$(nproc)" test ARGS="-R 'grind_ematch'"
+
+# Multiple tests matching a pattern
+CTEST_PARALLEL_LEVEL="$(nproc)" CTEST_OUTPUT_ON_FAILURE=1 \
+make -C build/release -j "$(nproc)" test ARGS="-R 'treemap|phashmap'"

 # Rerun only previously failed tests
 CTEST_PARALLEL_LEVEL="$(nproc)" CTEST_OUTPUT_ON_FAILURE=1 \
 make -C build/release -j "$(nproc)" test ARGS='--rerun-failed'

-# Single test from tests/foo/bar/ (quick check during development)
-cd tests/foo/bar && ./run_test example_test.lean
+# Run a test manually without ctest (test pile: pass filename relative to the pile dir)
+tests/with_stage1_test_env.sh tests/elab_bench/run_bench.sh cbv_decide.lean
+tests/with_stage1_test_env.sh tests/elab/run_test.sh grind_indexmap.lean
 ```

+## Benchmark vs Test Problem Sizes
+
+Benchmarks are also run as tests. Use the `TEST_BENCH` environment variable (unset in tests, set to `1` in benchmarks) to scale problem sizes:
+
+- In `compile_bench` `.init.sh` files: check `$TEST_BENCH` and set `TEST_ARGS` accordingly
+- In `elab_bench` Lean files: use `(← IO.getEnv "TEST_BENCH") == some "1"` to switch between small (test) and large (bench) inputs
+
+See `tests/README.md` for the full benchmark writing guide.
+
+## Testing stage 2
+
+When requested to test stage 2, build it as follows:
+```
+make -C build/release stage2 -j$(nproc)
+```
+Stage 2 is *not* automatically invalidated by changes to `src/` which allows for faster iteration
+when fixing a specific file in the stage 2 build but for invalidating any files that already passed
+the stage 2 build as well as for final validation,
+```
+make -C build/release/stage2 clean-stdlib
+```
+must be run manually before building.
+
 ## New features

 When asked to implement new features:
@@ -40,6 +73,10 @@ When asked to implement new features:
 - ONLY use the project's documented build command: `make -j$(nproc) -C build/release`
 - If a build is broken, ask the user before attempting any manual cleanup

+## stage0 Is a Copy of src
+
+**Never manually edit files under `stage0/`.** The `stage0/` directory is a snapshot of `src/` produced by `make update-stage0`. To change anything in stage0 (CMakeLists.txt, C++ source, etc.), edit the corresponding file in `src/` and let `update-stage0` propagate it.
+
 ## LSP and IDE Diagnostics

 After rebuilding, LSP diagnostics may be stale until the user interacts with files. Trust command-line test results over IDE diagnostics.
--- a/.claude/commands/release.md
+++ b/.claude/commands/release.md
@@ -121,6 +121,42 @@ The nightly build system uses branches and tags across two repositories:

 When a nightly succeeds with mathlib, all three should point to the same commit. Don't confuse these: branches are in the main lean4 repo, dated tags are in lean4-nightly.

+## CI Failures: Investigate Immediately
+
+**CRITICAL: If the checklist reports `❌ CI: X check(s) failing` for any PR, investigate immediately.**
+
+Do NOT:
+- Report it as "CI in progress" or "some checks pending"
+- Wait for the remaining checks to finish before investigating
+- Assume it's a transient failure without checking
+
+DO:
+1. Run `gh pr checks <number> --repo <owner>/<repo>` to see which specific check failed
+2. Run `gh run view <run-id> --repo <owner>/<repo> --log-failed` to see the failure output
+3. Diagnose the failure and report clearly to the user: what failed and why
+4. Propose a fix if one is obvious (e.g., subverso version mismatch, transient elan install error)
+
+The checklist now distinguishes `❌ X check(s) failing, Y still in progress` from `🔄 Y check(s) in progress`.
+Any `❌` in CI status requires immediate investigation — do not move on.
+
+## Waiting for CI or Merges
+
+Use `gh pr checks --watch` to block until a PR's CI checks complete (no polling needed).
+Run these as background bash commands so you get notified when they finish:
+
+```bash
+# Watch CI, then check merge state
+gh pr checks <number> --repo <owner>/<repo> --watch && gh pr view <number> --repo <owner>/<repo> --json state --jq '.state'
+```
+
+For multiple PRs, launch one background command per PR in parallel. When each completes,
+you'll be notified automatically via a task-notification. Do NOT use sleep-based polling
+loops — `--watch` is event-driven and exits as soon as checks finish.
+
+Note: `gh pr checks --watch` exits as soon as ALL checks complete (pass or fail). If some checks
+fail while others are still running, `--watch` will continue until everything settles, then exit
+with a non-zero code. So a background `--watch` finishing = all checks done; check which failed.
+
 ## Error Handling

 **CRITICAL**: If something goes wrong or a command fails:
--- a/.claude/skills/profiling/SKILL.md
+++ b/.claude/skills/profiling/SKILL.md
@@ -0,0 +1,26 @@
+---
+name: profiling
+description: Profile Lean programs with demangled names using samply and Firefox Profiler. Use when the user asks to profile a Lean binary or investigate performance.
+allowed-tools: Bash, Read, Glob, Grep
+---
+
+# Profiling Lean Programs
+
+Full documentation: `script/PROFILER_README.md`.
+
+## Quick Start
+
+```bash
+script/lean_profile.sh ./build/release/stage1/bin/lean some_file.lean
+```
+
+Requires `samply` (`cargo install samply`) and `python3`.
+
+## Agent Notes
+
+- The pipeline is interactive (serves to browser at the end). When running non-interactively, run the steps manually instead of using the wrapper script.
+- The three steps are: `samply record --save-only`, `symbolicate_profile.py`, then `serve_profile.py`.
+- `lean_demangle.py` works standalone as a stdin filter (like `c++filt`) for quick name lookups.
+- The `--raw` flag on `lean_demangle.py` gives exact demangled names without postprocessing (keeps `._redArg`, `._lam_0` suffixes as-is).
+- Use `PROFILE_KEEP=1` to keep the temp directory for later inspection.
+- The demangled profile is a standard Firefox Profiler JSON. Function names live in `threads[i].stringArray`, indexed by `threads[i].funcTable.name`.
--- a/.gitattributes
+++ b/.gitattributes
@@ -5,9 +5,3 @@ stage0/** binary linguist-generated
 # The following file is often manually edited, so do show it in diffs
 stage0/src/stdlib_flags.h -binary -linguist-generated
 doc/std/grove/GroveStdlib/Generated/** linguist-generated
-# These files should not have line endings translated on Windows, because
-# it throws off parser tests. Later lines override earlier ones, so the
-# runner code is still treated as ordinary text.
-tests/lean/docparse/* eol=lf
-tests/lean/docparse/*.lean eol=auto
-tests/lean/docparse/*.sh eol=auto
--- a/.github/workflows/awaiting-manual.yml
+++ b/.github/workflows/awaiting-manual.yml
@@ -2,16 +2,19 @@ name: Check awaiting-manual label

 on:
  merge_group:
-  pull_request:
+  pull_request_target:
    types: [opened, synchronize, reopened, labeled, unlabeled]

+permissions:
+  pull-requests: read
+
 jobs:
  check-awaiting-manual:
    runs-on: ubuntu-latest
    steps:
      - name: Check awaiting-manual label
        id: check-awaiting-manual-label
-        if: github.event_name == 'pull_request'
+        if: github.event_name == 'pull_request_target'
        uses: actions/github-script@v8
        with:
          script: |
@@ -28,7 +31,7 @@ jobs:
            }
      
      - name: Wait for manual compatibility
-        if: github.event_name == 'pull_request' && steps.check-awaiting-manual-label.outputs.awaiting == 'true'
+        if: github.event_name == 'pull_request_target' && steps.check-awaiting-manual-label.outputs.awaiting == 'true'
        run: |
          echo "::notice title=Awaiting manual::PR is marked 'awaiting-manual' but neither 'breaks-manual' nor 'builds-manual' labels are present."
          echo "This check will remain in progress until the PR is updated with appropriate manual compatibility labels."
--- a/.github/workflows/awaiting-mathlib.yml
+++ b/.github/workflows/awaiting-mathlib.yml
@@ -2,16 +2,19 @@ name: Check awaiting-mathlib label

 on:
  merge_group:
-  pull_request:
+  pull_request_target:
    types: [opened, synchronize, reopened, labeled, unlabeled]

+permissions:
+  pull-requests: read
+
 jobs:
  check-awaiting-mathlib:
    runs-on: ubuntu-latest
    steps:
      - name: Check awaiting-mathlib label
        id: check-awaiting-mathlib-label
-        if: github.event_name == 'pull_request'
+        if: github.event_name == 'pull_request_target'
        uses: actions/github-script@v8
        with:
          script: |
@@ -28,7 +31,7 @@ jobs:
            }
      
      - name: Wait for mathlib compatibility
-        if: github.event_name == 'pull_request' && steps.check-awaiting-mathlib-label.outputs.awaiting == 'true'
+        if: github.event_name == 'pull_request_target' && steps.check-awaiting-mathlib-label.outputs.awaiting == 'true'
        run: |
          echo "::notice title=Awaiting mathlib::PR is marked 'awaiting-mathlib' but neither 'breaks-mathlib' nor 'builds-mathlib' labels are present."
          echo "This check will remain in progress until the PR is updated with appropriate mathlib compatibility labels."
--- a/.github/workflows/build-template.yml
+++ b/.github/workflows/build-template.yml
@@ -33,7 +33,7 @@ jobs:
        include: ${{fromJson(inputs.config)}}
      # complete all jobs
      fail-fast: false
-    runs-on: ${{ endsWith(matrix.os, '-with-cache') && fromJSON(format('["{0}", "nscloud-git-mirror-1gb"]', matrix.os)) || matrix.os }}
+    runs-on: ${{ endsWith(matrix.os, '-with-cache') && fromJSON(format('["{0}", "nscloud-git-mirror-5gb"]', matrix.os)) || matrix.os }}
    defaults:
      run:
        shell: ${{ matrix.shell || 'nix develop -c bash -euxo pipefail {0}' }}
@@ -66,16 +66,10 @@ jobs:
          brew install ccache tree zstd coreutils gmp libuv
        if: runner.os == 'macOS'
      - name: Checkout
-        if: (!endsWith(matrix.os, '-with-cache'))
        uses: actions/checkout@v6
        with:
          # the default is to use a virtual merge commit between the PR and master: just use the PR
          ref: ${{ github.event.pull_request.head.sha }}
-      - name: Namespace Checkout
-        if: endsWith(matrix.os, '-with-cache')
-        uses: namespacelabs/nscloud-checkout-action@v8
-        with:
-          ref: ${{ github.event.pull_request.head.sha }}
      - name: Open Nix shell once
        run: true
        if: runner.os == 'Linux'
@@ -84,7 +78,7 @@ jobs:
      # (needs to be after "Install *" to use the right shell)
      - name: CI Merge Checkout
        run: |
-          git fetch --depth=1 origin ${{ github.sha }}
+          git fetch --depth=${{ matrix.name == 'Linux Lake (Cached)' && '10' || '1' }} origin ${{ github.sha }}
          git checkout FETCH_HEAD flake.nix flake.lock script/prepare-* tests/elab/importStructure.lean
        if: github.event_name == 'pull_request'
      # (needs to be after "Checkout" so files don't get overridden)
@@ -131,7 +125,7 @@ jobs:
          else
            echo "TARGET_STAGE=stage1" >> $GITHUB_ENV
          fi
-      - name: Build
+      - name: Configure Build
        run: |
          ulimit -c unlimited  # coredumps
          [ -d build ] || mkdir build
@@ -168,7 +162,21 @@ jobs:
          fi
          # contortion to support empty OPTIONS with old macOS bash
          cmake .. --preset ${{ matrix.CMAKE_PRESET || 'release' }} -B . ${{ matrix.CMAKE_OPTIONS }} ${OPTIONS[@]+"${OPTIONS[@]}"} -DLEAN_INSTALL_PREFIX=$PWD/..
-          time make $TARGET_STAGE -j$NPROC
+      - name: Build Stage 0 & Configure Stage 1
+        run: |
+          ulimit -c unlimited  # coredumps
+          time make -C build stage1-configure -j$NPROC
+      - name: Download Lake Cache
+        if: matrix.name == 'Linux Lake (Cached)'
+        run: |
+          cd src
+          ../build/stage0/bin/lake cache get --repo=${{ github.repository }}
+        timeout-minutes: 20 # prevent excessive hanging from network issues
+        continue-on-error: true
+      - name: Build Target Stage
+        run: |
+          ulimit -c unlimited  # coredumps
+          time make -C build $TARGET_STAGE -j$NPROC
      # Should be done as early as possible and in particular *before* "Check rebootstrap" which
      # changes the state of stage1/
      - name: Save Cache
@@ -187,6 +195,21 @@ jobs:
            build/stage1/**/*.c
            build/stage1/**/*.c.o*' || '' }}
          key: ${{ steps.restore-cache.outputs.cache-primary-key }}
+      - name: Upload Lake Cache
+        # Caching on cancellation created some mysterious issues perhaps related to improper build
+        # shutdown. Also, since this needs access to secrets, it cannot be run on forks.
+        if: matrix.name == 'Linux Lake' && !cancelled() && (github.event_name != 'pull_request' || github.event.pull_request.head.repo.full_name == github.repository)
+        run: |
+          curl --version
+          cd src
+          time ../build/stage0/bin/lake build -o ../build/lake-mappings.jsonl
+          time ../build/stage0/bin/lake cache put ../build/lake-mappings.jsonl --repo=${{ github.repository }}
+        env:
+          LAKE_CACHE_KEY: ${{ secrets.LAKE_CACHE_KEY }}
+          LAKE_CACHE_ARTIFACT_ENDPOINT: ${{ vars.LAKE_CACHE_ENDPOINT }}/a1
+          LAKE_CACHE_REVISION_ENDPOINT: ${{ vars.LAKE_CACHE_ENDPOINT }}/r1
+        timeout-minutes: 20 # prevent excessive hanging from network issues
+        continue-on-error: true
      - name: Install
        run: |
          make -C build/$TARGET_STAGE install
@@ -240,14 +263,16 @@ jobs:
      - name: Build Stage 2
        run: |
          make -C build -j$NPROC stage2
-        if: matrix.test-speedcenter
+        if: matrix.test-bench
      - name: Check Stage 3
        run: |
          make -C build -j$NPROC check-stage3
        if: matrix.check-stage3
-      - name: Test Speedcenter Benchmarks
-        run: nix shell github:Kha/lakeprof -c make -C build -j$NPROC bench
-        if: matrix.test-speedcenter
+      - name: Test Benchmarks
+        run: |
+          cd tests
+          nix develop -c make -C ../build -j$NPROC bench
+        if: matrix.test-bench
      - name: Check rebootstrap
        run: |
          set -e
--- a/.github/workflows/check-empty-pr.yml
+++ b/.github/workflows/check-empty-pr.yml
@@ -0,0 +1,29 @@
+name: Check for empty PR
+
+on:
+  merge_group:
+  pull_request:
+
+jobs:
+  check-empty-pr:
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v6
+      with:
+        ref: ${{ github.event_name == 'pull_request' && github.event.pull_request.head.sha || github.sha }}
+        fetch-depth: 0
+        filter: tree:0
+
+    - name: Check for empty diff
+      run: |
+        if [[ "${{ github.event_name }}" == "pull_request" ]]; then
+          base=$(git merge-base "origin/${{ github.base_ref }}" HEAD)
+        else
+          base=$(git rev-parse HEAD^1)
+        fi
+        if git diff --quiet "$base" HEAD --; then
+          echo "This PR introduces no changes compared to its base branch." | tee "$GITHUB_STEP_SUMMARY"
+          echo "It may be a duplicate of an already-merged PR." | tee -a "$GITHUB_STEP_SUMMARY"
+          exit 1
+        fi
+      shell: bash
--- a/.github/workflows/check-stdlib-flags.yml
+++ b/.github/workflows/check-stdlib-flags.yml
@@ -1,9 +1,12 @@
 name: Check stdlib_flags.h modifications

 on:
-  pull_request:
+  pull_request_target:
    types: [opened, synchronize, reopened, labeled, unlabeled]

+permissions:
+  pull-requests: read
+
 jobs:
  check-stdlib-flags:
    runs-on: ubuntu-latest
--- a/.github/workflows/ci.yml
+++ b/.github/workflows/ci.yml
@@ -61,15 +61,19 @@ jobs:
            git remote add nightly https://foo:'${{ secrets.PUSH_NIGHTLY_TOKEN }}'@github.com/${{ github.repository_owner }}/lean4-nightly.git
            git fetch nightly --tags
            if [[ '${{ github.event_name }}' == 'workflow_dispatch' ]]; then
-              # Manual re-release: create a revision of the most recent nightly
-              BASE_NIGHTLY=$(git tag -l 'nightly-*' | sort -rV | head -1)
-              # Strip any existing -revK suffix to get the base date tag
-              BASE_NIGHTLY="${BASE_NIGHTLY%%-rev*}"
-              REV=1
-              while git rev-parse "refs/tags/${BASE_NIGHTLY}-rev${REV}" >/dev/null 2>&1; do
-                REV=$((REV + 1))
-              done
-              LEAN_VERSION_STRING="${BASE_NIGHTLY}-rev${REV}"
+              # Manual re-release: retry today's nightly, or create a revision if it already exists
+              TODAY_NIGHTLY="nightly-$(date -u +%F)"
+              if git rev-parse "refs/tags/${TODAY_NIGHTLY}" >/dev/null 2>&1; then
+                # Today's nightly already exists, create a revision
+                REV=1
+                while git rev-parse "refs/tags/${TODAY_NIGHTLY}-rev${REV}" >/dev/null 2>&1; do
+                  REV=$((REV + 1))
+                done
+                LEAN_VERSION_STRING="${TODAY_NIGHTLY}-rev${REV}"
+              else
+                # Today's nightly doesn't exist yet (e.g. scheduled run failed), create it
+                LEAN_VERSION_STRING="${TODAY_NIGHTLY}"
+              fi
              echo "nightly=$LEAN_VERSION_STRING" >> "$GITHUB_OUTPUT"
            else
              # Scheduled: do nothing if commit already has a different tag
@@ -166,7 +170,7 @@ jobs:
      # 0: PRs without special label
      # 1: PRs with `merge-ci` label, merge queue checks, master commits
      # 2: nightlies
-      # 3: PRs with `release-ci` label, full releases
+      # 3: PRs with `release-ci` or `lake-ci` label, full releases
      - name: Set check level
        id: set-level
        # We do not use github.event.pull_request.labels.*.name here because
@@ -175,6 +179,7 @@ jobs:
        run: |
          check_level=0
          fast=false
+          lake_ci=false

          if [[ -n "${{ steps.set-release.outputs.RELEASE_TAG }}" || -n "${{ steps.set-release-custom.outputs.RELEASE_TAG }}" ]]; then
            check_level=3
@@ -189,13 +194,19 @@ jobs:
            elif echo "$labels" | grep -q "merge-ci"; then
              check_level=1
            fi
+            if echo "$labels" | grep -q "lake-ci"; then
+              lake_ci=true
+            fi
            if echo "$labels" | grep -q "fast-ci"; then
              fast=true
            fi
          fi

-          echo "check-level=$check_level" >> "$GITHUB_OUTPUT"
-          echo "fast=$fast" >> "$GITHUB_OUTPUT"
+          {
+            echo "check-level=$check_level"
+            echo "fast=$fast"
+            echo "lake-ci=$lake_ci"
+          } >> "$GITHUB_OUTPUT"
        env:
          GH_TOKEN: ${{ github.token }}

@@ -206,6 +217,7 @@ jobs:
          script: |
            const level = ${{ steps.set-level.outputs.check-level }};
            const fast = ${{ steps.set-level.outputs.fast }};
+            const lakeCi = "${{ steps.set-level.outputs.lake-ci }}" == "true";
            console.log(`level: ${level}, fast: ${fast}`);
            // use large runners where available (original repo)
            let large = ${{ github.repository == 'leanprover/lean4' }};
@@ -232,7 +244,7 @@ jobs:
                // portable release build: use channel with older glibc (2.26)
                "name": "Linux release",
                // usually not a bottleneck so make exclusive to `fast-ci`
-                "os": large && fast ? "nscloud-ubuntu-22.04-amd64-8x16-with-cache" : "ubuntu-latest",
+                "os": large && fast ? "nscloud-ubuntu-24.04-amd64-8x16-with-cache" : "ubuntu-latest",
                "release": true,
                // Special handling for release jobs. We want:
                // 1. To run it in PRs so developers get PR toolchains (so secondary without tests is sufficient)
@@ -253,15 +265,27 @@ jobs:
              },
              {
                "name": "Linux Lake",
-                "os": large ? "nscloud-ubuntu-22.04-amd64-8x16-with-cache" : "ubuntu-latest",
+                "os": large ? "nscloud-ubuntu-24.04-amd64-8x16-with-cache" : "ubuntu-latest",
                "enabled": true,
                "check-rebootstrap": level >= 1,
                "check-stage3": level >= 2,
                "test": true,
-                // NOTE: `test-speedcenter` currently seems to be broken on `ubuntu-latest`
-                "test-speedcenter": large && level >= 2,
+                // NOTE: `test-bench` currently seems to be broken on `ubuntu-latest`
+                "test-bench": large && level >= 2,
                // We are not warning-free yet on all platforms, start here
-                "CMAKE_OPTIONS": "-DLEAN_EXTRA_CXX_FLAGS=-Werror",
+                "CMAKE_OPTIONS": "-DLEAN_EXTRA_CXX_FLAGS=-Werror -DUSE_LAKE_CACHE=ON",
+              },
+              {
+                "name": "Linux Lake (Cached)",
+                "os": large ? "nscloud-ubuntu-24.04-amd64-8x16-with-cache" : "ubuntu-latest",
+                "enabled": true,
+                "check-rebootstrap": level >= 1,
+                "check-stage3": level >= 2,
+                "test": true,
+                "secondary": true,
+                // NOTE: `test-bench` currently seems to be broken on `ubuntu-latest`
+                "test-bench": large && level >= 2,
+                "CMAKE_OPTIONS": "-DLEAN_EXTRA_CXX_FLAGS=-Werror -DUSE_LAKE_CACHE=ON",
              },
              {
                "name": "Linux Reldebug",
@@ -269,11 +293,13 @@ jobs:
                "enabled": level >= 2,
                "test": true,
                "CMAKE_PRESET": "reldebug",
+                // * `elab_bench/big_do` crashes with exit code 134
+                "CTEST_OPTIONS": "-E 'elab_bench/big_do'",
              },
              {
                "name": "Linux fsanitize",
                // Always run on large if available, more reliable regarding timeouts
-                "os": large ? "nscloud-ubuntu-22.04-amd64-16x32-with-cache" : "ubuntu-latest",
+                "os": large ? "nscloud-ubuntu-24.04-amd64-16x32-with-cache" : "ubuntu-latest",
                "enabled": level >= 2,
                // do not fail nightlies on this for now
                "secondary": level <= 2,
@@ -377,6 +403,11 @@ jobs:
                job["CMAKE_OPTIONS"] = (job["CMAKE_OPTIONS"] ? job["CMAKE_OPTIONS"] + " " : "") + "-DUSE_LAKE=OFF";
              }
            }
+            if (lakeCi) {
+              for (const job of matrix) {
+                job["CMAKE_OPTIONS"] = (job["CMAKE_OPTIONS"] ? job["CMAKE_OPTIONS"] + " " : "") + "-DLAKE_CI=ON";
+              }
+            }
            console.log(`matrix:\n${JSON.stringify(matrix, null, 2)}`);
            matrix = matrix.filter((job) => job["enabled"]);
            core.setOutput('matrix', matrix.filter((job) => !job["secondary"]));
--- a/.github/workflows/labels-from-comments.yml
+++ b/.github/workflows/labels-from-comments.yml
@@ -1,5 +1,5 @@
 # This workflow allows any user to add one of the `awaiting-review`, `awaiting-author`, `WIP`,
-# `release-ci`, or a `changelog-XXX` label by commenting on the PR or issue.
+# `release-ci`, `lake-ci`, or a `changelog-XXX` label by commenting on the PR or issue.
 # If any labels from the set {`awaiting-review`, `awaiting-author`, `WIP`} are added, other labels
 # from that set are removed automatically at the same time.
 # Similarly, if any `changelog-XXX` label is added, other `changelog-YYY` labels are removed.
@@ -12,7 +12,7 @@ on:

 jobs:
  update-label:
-    if: github.event.issue.pull_request != null && (contains(github.event.comment.body, 'awaiting-review') || contains(github.event.comment.body, 'awaiting-author') || contains(github.event.comment.body, 'WIP') || contains(github.event.comment.body, 'release-ci') || contains(github.event.comment.body, 'changelog-'))
+    if: github.event.issue.pull_request != null && (contains(github.event.comment.body, 'awaiting-review') || contains(github.event.comment.body, 'awaiting-author') || contains(github.event.comment.body, 'WIP') || contains(github.event.comment.body, 'release-ci') || contains(github.event.comment.body, 'lake-ci') || contains(github.event.comment.body, 'changelog-'))
    runs-on: ubuntu-latest

    steps:
@@ -28,6 +28,7 @@ jobs:
          const awaitingAuthor = commentLines.includes('awaiting-author');
          const wip = commentLines.includes('WIP');
          const releaseCI = commentLines.includes('release-ci');
+          const lakeCI = commentLines.includes('lake-ci');
          const changelogMatch = commentLines.find(line => line.startsWith('changelog-'));

          if (awaitingReview || awaitingAuthor || wip) {
@@ -49,6 +50,9 @@ jobs:
          if (releaseCI) {
            await github.rest.issues.addLabels({ owner, repo, issue_number, labels: ['release-ci'] });
          }
+          if (lakeCI) {
+            await github.rest.issues.addLabels({ owner, repo, issue_number, labels: ['lake-ci'] });
+          }

          if (changelogMatch) {
            const changelogLabel = changelogMatch.trim();
--- a/.github/workflows/pr-body.yml
+++ b/.github/workflows/pr-body.yml
@@ -2,17 +2,23 @@ name: Check PR body for changelog convention

 on:
  merge_group:
-  pull_request:
+  pull_request_target:
    types: [opened, synchronize, reopened, edited, labeled, converted_to_draft, ready_for_review]

+permissions:
+  pull-requests: read
+
 jobs:
  check-pr-body:
    runs-on: ubuntu-latest
    steps:
      - name: Check PR body
-        if: github.event_name == 'pull_request'
+        if: github.event_name == 'pull_request_target'
        uses: actions/github-script@v8
        with:
+          # Safety note: this uses pull_request_target, so the workflow has elevated privileges.
+          # The PR title and body are only used in regex tests (read-only string matching),
+          # never interpolated into shell commands, eval'd, or written to GITHUB_ENV/GITHUB_OUTPUT.
          script: |
            const { title, body, labels, draft } = context.payload.pull_request;
            if (!draft && /^(feat|fix):/.test(title) && !labels.some(label => label.name == "changelog-no")) {
--- a/.github/workflows/restart-on-label.yml
+++ b/.github/workflows/restart-on-label.yml
@@ -7,7 +7,7 @@ on:
 jobs:
  restart-on-label:
    runs-on: ubuntu-latest
-    if: contains(github.event.label.name, 'merge-ci') || contains(github.event.label.name, 'release-ci')
+    if: contains(github.event.label.name, 'merge-ci') || contains(github.event.label.name, 'release-ci') || contains(github.event.label.name, 'lake-ci')
    steps:
    - run: |
        # Finding latest CI workflow run on current pull request
--- a/.gitignore
+++ b/.gitignore
@@ -1,7 +1,6 @@
 *~
 \#*
 .#*
-*.lock
 .lake
 lake-manifest.json
 /build
@@ -21,6 +20,9 @@ settings.json
 !.claude/settings.json
 .gdb_history
 .vscode/*
+!.vscode/settings.json
+!.vscode/tasks.json
+!.vscode/extensions.json
 script/__pycache__
 *.produced.out
 CMakeSettings.json
--- a/.vscode/extensions.json
+++ b/.vscode/extensions.json
@@ -0,0 +1,5 @@
+{
+	"recommendations": [
+		"leanprover.lean4"
+	]
+}
--- a/.vscode/settings.json
+++ b/.vscode/settings.json
@@ -0,0 +1,12 @@
+{
+	"files.insertFinalNewline": true,
+	"files.trimTrailingWhitespace": true,
+	// These require the CMake Tools extension (ms-vscode.cmake-tools).
+	"cmake.buildDirectory": "${workspaceFolder}/build/release",
+	"cmake.generator": "Unix Makefiles",
+	"[lean4]": {
+		"editor.rulers": [
+			100
+		]
+	}
+}
--- a/.vscode/tasks.json
+++ b/.vscode/tasks.json
@@ -0,0 +1,34 @@
+{
+	"version": "2.0.0",
+	"tasks": [
+		{
+			"label": "build",
+			"type": "shell",
+			"command": "make -C build/release -j$(nproc 2>/dev/null || sysctl -n hw.logicalcpu 2>/dev/null || echo 4)",
+			"problemMatcher": [],
+			"group": {
+				"kind": "build",
+				"isDefault": true
+			}
+		},
+		{
+			"label": "build-old",
+			"type": "shell",
+			"command": "make -C build/release -j$(nproc 2>/dev/null || sysctl -n hw.logicalcpu 2>/dev/null || echo 4) LAKE_EXTRA_ARGS=--old",
+			"problemMatcher": [],
+			"group": {
+				"kind": "build"
+			}
+		},
+		{
+			"label": "test",
+			"type": "shell",
+			"command": "NPROC=$(nproc 2>/dev/null || sysctl -n hw.logicalcpu 2>/dev/null || echo 4); CTEST_OUTPUT_ON_FAILURE=1 make -C build/release test -j$NPROC ARGS=\"-j$NPROC\"",
+			"problemMatcher": [],
+			"group": {
+				"kind": "test",
+				"isDefault": true
+			}
+		}
+	]
+}
--- a/CMakeLists.txt
+++ b/CMakeLists.txt
@@ -41,7 +41,7 @@ if(NOT (DEFINED STAGE0_CMAKE_EXECUTABLE_SUFFIX))
  set(STAGE0_CMAKE_EXECUTABLE_SUFFIX "${CMAKE_EXECUTABLE_SUFFIX}")
 endif()

-# Don't do anything with cadical on wasm
+# Don't do anything with cadical/leantar on wasm
 if(NOT CMAKE_SYSTEM_NAME MATCHES "Emscripten")
  find_program(CADICAL cadical)
  if(NOT CADICAL)
@@ -77,7 +77,45 @@ if(NOT CMAKE_SYSTEM_NAME MATCHES "Emscripten")
    set(CADICAL ${CMAKE_BINARY_DIR}/cadical/cadical${CMAKE_EXECUTABLE_SUFFIX})
    list(APPEND EXTRA_DEPENDS cadical)
  endif()
-  list(APPEND CL_ARGS -DCADICAL=${CADICAL})
+  find_program(LEANTAR leantar)
+  if(NOT LEANTAR)
+    set(LEANTAR_VERSION v0.1.19)
+    if(CMAKE_SYSTEM_NAME MATCHES "Windows")
+      set(LEANTAR_ARCHIVE_SUFFIX .zip)
+      set(LEANTAR_TARGET x86_64-pc-windows-msvc)
+    else()
+      set(LEANTAR_ARCHIVE_SUFFIX .tar.gz)
+      if(CMAKE_SYSTEM_PROCESSOR MATCHES "arm64")
+        set(LEANTAR_TARGET_ARCH aarch64)
+      else()
+        set(LEANTAR_TARGET_ARCH x86_64)
+      endif()
+      if(CMAKE_SYSTEM_NAME MATCHES "Darwin")
+        set(LEANTAR_TARGET_OS apple-darwin)
+      else()
+        set(LEANTAR_TARGET_OS unknown-linux-musl)
+      endif()
+      set(LEANTAR_TARGET ${LEANTAR_TARGET_ARCH}-${LEANTAR_TARGET_OS})
+    endif()
+    set(
+      LEANTAR
+      ${CMAKE_BINARY_DIR}/leantar/leantar-${LEANTAR_VERSION}-${LEANTAR_TARGET}/leantar${CMAKE_EXECUTABLE_SUFFIX}
+    )
+    if(NOT EXISTS "${LEANTAR}")
+      file(
+        DOWNLOAD
+          https://github.com/digama0/leangz/releases/download/${LEANTAR_VERSION}/leantar-${LEANTAR_VERSION}-${LEANTAR_TARGET}${LEANTAR_ARCHIVE_SUFFIX}
+        ${CMAKE_BINARY_DIR}/leantar${LEANTAR_ARCHIVE_SUFFIX}
+      )
+      file(
+        ARCHIVE_EXTRACT
+        INPUT ${CMAKE_BINARY_DIR}/leantar${LEANTAR_ARCHIVE_SUFFIX}
+        DESTINATION ${CMAKE_BINARY_DIR}/leantar
+      )
+    endif()
+  endif()
+  list(APPEND STAGE0_ARGS -DLEANTAR=${LEANTAR})
+  list(APPEND CL_ARGS -DCADICAL=${CADICAL} -DLEANTAR=${LEANTAR})
 endif()

 if(USE_MIMALLOC)
--- a/CMakePresets.json
+++ b/CMakePresets.json
@@ -41,7 +41,7 @@
        "SMALL_ALLOCATOR": "OFF",
        "USE_MIMALLOC": "OFF",
        "BSYMBOLIC": "OFF",
-        "LEAN_TEST_VARS": "MAIN_STACK_SIZE=16000 LSAN_OPTIONS=max_leaks=10"
+        "LEAN_TEST_VARS": "MAIN_STACK_SIZE=16000 TEST_STACK_SIZE=16000 LSAN_OPTIONS=max_leaks=10"
      },
      "generator": "Unix Makefiles",
      "binaryDir": "${sourceDir}/build/sanitize"
--- a/CONTRIBUTING.md
+++ b/CONTRIBUTING.md
@@ -7,7 +7,7 @@ Helpful links
 -------

 * [Development Setup](./doc/dev/index.md)
-* [Testing](./doc/dev/testing.md)
+* [Testing](./tests/README.md)
 * [Commit convention](./doc/dev/commit_convention.md)

 Before You Submit a Pull Request (PR):
--- a/206
+++ b/206
@@ -1370,4 +1370,208 @@ FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
-SOFTWARE.
+SOFTWARE.
+==============================================================================
+leantar is by Mario Carneiro and distributed under the Apache 2.0 License:
+==============================================================================
+                                 Apache License
+                           Version 2.0, January 2004
+                        http://www.apache.org/licenses/
+
+   TERMS AND CONDITIONS FOR USE, REPRODUCTION, AND DISTRIBUTION
+
+   1. Definitions.
+
+      "License" shall mean the terms and conditions for use, reproduction,
+      and distribution as defined by Sections 1 through 9 of this document.
+
+      "Licensor" shall mean the copyright owner or entity authorized by
+      the copyright owner that is granting the License.
+
+      "Legal Entity" shall mean the union of the acting entity and all
+      other entities that control, are controlled by, or are under common
+      control with that entity. For the purposes of this definition,
+      "control" means (i) the power, direct or indirect, to cause the
+      direction or management of such entity, whether by contract or
+      otherwise, or (ii) ownership of fifty percent (50%) or more of the
+      outstanding shares, or (iii) beneficial ownership of such entity.
+
+      "You" (or "Your") shall mean an individual or Legal Entity
+      exercising permissions granted by this License.
+
+      "Source" form shall mean the preferred form for making modifications,
+      including but not limited to software source code, documentation
+      source, and configuration files.
+
+      "Object" form shall mean any form resulting from mechanical
+      transformation or translation of a Source form, including but
+      not limited to compiled object code, generated documentation,
+      and conversions to other media types.
+
+      "Work" shall mean the work of authorship, whether in Source or
+      Object form, made available under the License, as indicated by a
+      copyright notice that is included in or attached to the work
+      (an example is provided in the Appendix below).
+
+      "Derivative Works" shall mean any work, whether in Source or Object
+      form, that is based on (or derived from) the Work and for which the
+      editorial revisions, annotations, elaborations, or other modifications
+      represent, as a whole, an original work of authorship. For the purposes
+      of this License, Derivative Works shall not include works that remain
+      separable from, or merely link (or bind by name) to the interfaces of,
+      the Work and Derivative Works thereof.
+
+      "Contribution" shall mean any work of authorship, including
+      the original version of the Work and any modifications or additions
+      to that Work or Derivative Works thereof, that is intentionally
+      submitted to Licensor for inclusion in the Work by the copyright owner
+      or by an individual or Legal Entity authorized to submit on behalf of
+      the copyright owner. For the purposes of this definition, "submitted"
+      means any form of electronic, verbal, or written communication sent
+      to the Licensor or its representatives, including but not limited to
+      communication on electronic mailing lists, source code control systems,
+      and issue tracking systems that are managed by, or on behalf of, the
+      Licensor for the purpose of discussing and improving the Work, but
+      excluding communication that is conspicuously marked or otherwise
+      designated in writing by the copyright owner as "Not a Contribution."
+
+      "Contributor" shall mean Licensor and any individual or Legal Entity
+      on behalf of whom a Contribution has been received by Licensor and
+      subsequently incorporated within the Work.
+
+   2. Grant of Copyright License. Subject to the terms and conditions of
+      this License, each Contributor hereby grants to You a perpetual,
+      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
+      copyright license to reproduce, prepare Derivative Works of,
+      publicly display, publicly perform, sublicense, and distribute the
+      Work and such Derivative Works in Source or Object form.
+
+   3. Grant of Patent License. Subject to the terms and conditions of
+      this License, each Contributor hereby grants to You a perpetual,
+      worldwide, non-exclusive, no-charge, royalty-free, irrevocable
+      (except as stated in this section) patent license to make, have made,
+      use, offer to sell, sell, import, and otherwise transfer the Work,
+      where such license applies only to those patent claims licensable
+      by such Contributor that are necessarily infringed by their
+      Contribution(s) alone or by combination of their Contribution(s)
+      with the Work to which such Contribution(s) was submitted. If You
+      institute patent litigation against any entity (including a
+      cross-claim or counterclaim in a lawsuit) alleging that the Work
+      or a Contribution incorporated within the Work constitutes direct
+      or contributory patent infringement, then any patent licenses
+      granted to You under this License for that Work shall terminate
+      as of the date such litigation is filed.
+
+   4. Redistribution. You may reproduce and distribute copies of the
+      Work or Derivative Works thereof in any medium, with or without
+      modifications, and in Source or Object form, provided that You
+      meet the following conditions:
+
+      (a) You must give any other recipients of the Work or
+          Derivative Works a copy of this License; and
+
+      (b) You must cause any modified files to carry prominent notices
+          stating that You changed the files; and
+
+      (c) You must retain, in the Source form of any Derivative Works
+          that You distribute, all copyright, patent, trademark, and
+          attribution notices from the Source form of the Work,
+          excluding those notices that do not pertain to any part of
+          the Derivative Works; and
+
+      (d) If the Work includes a "NOTICE" text file as part of its
+          distribution, then any Derivative Works that You distribute must
+          include a readable copy of the attribution notices contained
+          within such NOTICE file, excluding those notices that do not
+          pertain to any part of the Derivative Works, in at least one
+          of the following places: within a NOTICE text file distributed
+          as part of the Derivative Works; within the Source form or
+          documentation, if provided along with the Derivative Works; or,
+          within a display generated by the Derivative Works, if and
+          wherever such third-party notices normally appear. The contents
+          of the NOTICE file are for informational purposes only and
+          do not modify the License. You may add Your own attribution
+          notices within Derivative Works that You distribute, alongside
+          or as an addendum to the NOTICE text from the Work, provided
+          that such additional attribution notices cannot be construed
+          as modifying the License.
+
+      You may add Your own copyright statement to Your modifications and
+      may provide additional or different license terms and conditions
+      for use, reproduction, or distribution of Your modifications, or
+      for any such Derivative Works as a whole, provided Your use,
+      reproduction, and distribution of the Work otherwise complies with
+      the conditions stated in this License.
+
+   5. Submission of Contributions. Unless You explicitly state otherwise,
+      any Contribution intentionally submitted for inclusion in the Work
+      by You to the Licensor shall be under the terms and conditions of
+      this License, without any additional terms or conditions.
+      Notwithstanding the above, nothing herein shall supersede or modify
+      the terms of any separate license agreement you may have executed
+      with Licensor regarding such Contributions.
+
+   6. Trademarks. This License does not grant permission to use the trade
+      names, trademarks, service marks, or product names of the Licensor,
+      except as required for reasonable and customary use in describing the
+      origin of the Work and reproducing the content of the NOTICE file.
+
+   7. Disclaimer of Warranty. Unless required by applicable law or
+      agreed to in writing, Licensor provides the Work (and each
+      Contributor provides its Contributions) on an "AS IS" BASIS,
+      WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or
+      implied, including, without limitation, any warranties or conditions
+      of TITLE, NON-INFRINGEMENT, MERCHANTABILITY, or FITNESS FOR A
+      PARTICULAR PURPOSE. You are solely responsible for determining the
+      appropriateness of using or redistributing the Work and assume any
+      risks associated with Your exercise of permissions under this License.
+
+   8. Limitation of Liability. In no event and under no legal theory,
+      whether in tort (including negligence), contract, or otherwise,
+      unless required by applicable law (such as deliberate and grossly
+      negligent acts) or agreed to in writing, shall any Contributor be
+      liable to You for damages, including any direct, indirect, special,
+      incidental, or consequential damages of any character arising as a
+      result of this License or out of the use or inability to use the
+      Work (including but not limited to damages for loss of goodwill,
+      work stoppage, computer failure or malfunction, or any and all
+      other commercial damages or losses), even if such Contributor
+      has been advised of the possibility of such damages.
+
+   9. Accepting Warranty or Additional Liability. While redistributing
+      the Work or Derivative Works thereof, You may choose to offer,
+      and charge a fee for, acceptance of support, warranty, indemnity,
+      or other liability obligations and/or rights consistent with this
+      License. However, in accepting such obligations, You may act only
+      on Your own behalf and on Your sole responsibility, not on behalf
+      of any other Contributor, and only if You agree to indemnify,
+      defend, and hold each Contributor harmless for any liability
+      incurred by, or claims asserted against, such Contributor by reason
+      of your accepting any such warranty or additional liability.
+
+   END OF TERMS AND CONDITIONS
+
+   APPENDIX: How to apply the Apache License to your work.
+
+      To apply the Apache License to your work, attach the following
+      boilerplate notice, with the fields enclosed by brackets "[]"
+      replaced with your own identifying information. (Don't include
+      the brackets!)  The text should be enclosed in the appropriate
+      comment syntax for the file format. We also recommend that a
+      file or class name and description of purpose be included on the
+      same "printed page" as the copyright notice for easier
+      identification within third-party archives.
+
+   Copyright [yyyy] [name of copyright owner]
+
+   Licensed under the Apache License, Version 2.0 (the "License");
+   you may not use this file except in compliance with the License.
+   You may obtain a copy of the License at
+
+       http://www.apache.org/licenses/LICENSE-2.0
+
+   Unless required by applicable law or agreed to in writing, software
+   distributed under the License is distributed on an "AS IS" BASIS,
+   WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+   See the License for the specific language governing permissions and
+   limitations under the License.
--- a/doc/.gitignore
+++ b/doc/.gitignore
@@ -1 +0,0 @@
-out
--- a/doc/dev/index.md
+++ b/doc/dev/index.md
@@ -1,7 +1,9 @@
 # Development Workflow

 If you want to make changes to Lean itself, start by [building Lean](../make/index.md) from a clean checkout to make sure that everything is set up correctly.
-After that, read on below to find out how to set up your editor for changing the Lean source code, followed by further sections of the development manual where applicable such as on the [test suite](testing.md) and [commit convention](commit_convention.md).
+After that, read on below to find out how to set up your editor for changing the Lean source code,
+followed by further sections of the development manual where applicable
+such as on the [test suite](../../tests/README.md) and [commit convention](commit_convention.md).

 If you are planning to make any changes that may affect the compilation of Lean itself, e.g. changes to the parser, elaborator, or compiler, you should first read about the [bootstrapping pipeline](bootstrap.md).
 You should not edit the `stage0` directory except using the commands described in that section when necessary.
@@ -61,10 +63,10 @@ you can then put `my_name/lean4:my-tag` in your `lean-toolchain` file in a proje

 ### VS Code

-There is a `lean.code-workspace` file that correctly sets up VS Code with workspace roots for the stage0/stage1 setup described above as well as with other settings.
-You should always load it when working on Lean, such as by invoking
+There is a `.vscode/` directory that correctly sets up VS Code with settings, tasks, and recommended extensions.
+Simply open the repository folder in VS Code, such as by invoking
 ```
-code lean.code-workspace
+code .
 ```
 on the command line.

--- a/doc/dev/testing.md
+++ b/doc/dev/testing.md
@@ -1,142 +0,0 @@
-# Test Suite
-
-**Warning:** This document is partially outdated.
-It describes the old test suite, which is currently in the process of being replaced.
-The new test suite's documentation can be found at [`tests/README.md`](../../tests/README.md).
-
-After [building Lean](../make/index.md) you can run all the tests using
-```
-cd build/release
-make test ARGS=-j4
-```
-Change the 4 to the maximum number of parallel tests you want to
-allow. The best choice is the number of CPU cores on your machine as
-the tests are mostly CPU bound.  You can find the number of processors
-on linux using `nproc` and on Windows it is the `NUMBER_OF_PROCESSORS`
-environment variable.
-
-You can run tests after [building a specific stage](bootstrap.md) by
-adding the `-C stageN` argument. The default when run as above is stage 1.  The
-Lean tests will automatically use that stage's corresponding Lean
-executables
-
-Running `make test` will not pick up new test files; run
-```bash
-cmake build/release/stage1
-```
-to update the list of tests.
-
-You can also use `ctest` directly if you are in the right folder.  So
-to run stage1 tests with a 300 second timeout run this:
-
-```bash
-cd build/release/stage1
-ctest -j 4 --output-on-failure --timeout 300
-```
-Useful `ctest` flags are `-R <name of test>` to run a single test, and
-`--rerun-failed` to run all tests that failed during the last run.
-You can also pass `ctest` flags via `make test ARGS="--rerun-failed"`.
-
-To get verbose output from ctest pass the `--verbose` command line
-option. Test output is normally suppressed and only summary
-information is displayed. This option will show all test output.
-
-## Test Suite Organization
-
-All these tests are included by [src/shell/CMakeLists.txt](https://github.com/leanprover/lean4/blob/master/src/shell/CMakeLists.txt):
-
- [`tests/lean`](https://github.com/leanprover/lean4/tree/master/tests/lean/): contains tests that come equipped with a
-  .lean.expected.out file. The driver script [`test_single.sh`](https://github.com/leanprover/lean4/tree/master/tests/lean/test_single.sh) runs
-  each test and checks the actual output (*.produced.out) with the
-  checked in expected output.
-
- [`tests/lean/run`](https://github.com/leanprover/lean4/tree/master/tests/lean/run/): contains tests that are run through the lean
-  command line one file at a time. These tests only look for error
-  codes and do not check the expected output even though output is
-  produced, it is ignored.
-
-  **Note:** Tests in this directory run with `-Dlinter.all=false` to reduce noise.
-  If your test needs to verify linter behavior (e.g., deprecation warnings),
-  explicitly enable the relevant linter with `set_option linter.<name> true`.
-
- [`tests/lean/interactive`](https://github.com/leanprover/lean4/tree/master/tests/lean/interactive/): are designed to test server requests at a
-  given position in the input file. Each .lean file contains comments
-  that indicate how to simulate a client request at that position.
-  using a `--^` point to the line position. Example:
-    ```lean,ignore
-    open Foo in
-    theorem tst2 (h : a ≤ b) : a + 2 ≤ b + 2 :=
-    Bla.
-      --^ completion
-    ```
-    In this example, the test driver [`test_single.sh`](https://github.com/leanprover/lean4/tree/master/tests/lean/interactive/test_single.sh) will simulate an
-    auto-completion request at `Bla.`. The expected output is stored in
-    a .lean.expected.out in the json format that is part of the
-    [Language Server
-    Protocol](https://microsoft.github.io/language-server-protocol/).
-
-    This can also be used to test the following additional requests:
-    ```
-    --^ textDocument/hover
-    --^ textDocument/typeDefinition
-    --^ textDocument/definition
-    --^ $/lean/plainGoal
-    --^ $/lean/plainTermGoal
-    --^ insert: ...
-    --^ collectDiagnostics
-    ```
-
- [`tests/lean/server`](https://github.com/leanprover/lean4/tree/master/tests/lean/server/): Tests more of the Lean `--server` protocol.
-  There are just a few of them, and it uses .log files containing
-  JSON.
-
- [`tests/compiler`](https://github.com/leanprover/lean4/tree/master/tests/compiler/): contains tests that will run the Lean compiler and
-  build an executable that is executed and the output is compared to
-  the .lean.expected.out file. This test also contains a subfolder
-  [`foreign`](https://github.com/leanprover/lean4/tree/master/tests/compiler/foreign/) which shows how to extend Lean using C++.
-
- [`tests/lean/trust0`](https://github.com/leanprover/lean4/tree/master/tests/lean/trust0): tests that run Lean in a mode that Lean doesn't
-  even trust the .olean files (i.e., trust 0).
-
- [`tests/bench`](https://github.com/leanprover/lean4/tree/master/tests/bench/): contains performance tests.
-
- [`tests/plugin`](https://github.com/leanprover/lean4/tree/master/tests/plugin/): tests that compiled Lean code can be loaded into
-  `lean` via the `--plugin` command line option.
-
-## Writing Good Tests
-
-Every test file should contain:
-* an initial `/-! -/` module docstring summarizing the test's purpose
-* a module docstring for each test section that describes what is tested
-  and, if not 100% clear, why that is the desirable behavior
-
-At the time of writing, most tests do not follow these new guidelines yet.
-For an example of a conforming test, see [`tests/lean/1971.lean`](https://github.com/leanprover/lean4/tree/master/tests/lean/1971.lean).
-
-## Fixing Tests
-
-When the Lean source code or the standard library are modified, some of the
-tests break because the produced output is slightly different, and we have
-to reflect the changes in the `.lean.expected.out` files.
-We should not blindly copy the new produced output since we may accidentally
-miss a bug introduced by recent changes.
-The test suite contains commands that allow us to see what changed in a convenient way.
-First, we must install [meld](http://meldmerge.org/). On Ubuntu, we can do it by simply executing
-
-```
-sudo apt-get install meld
-```
-
-Now, suppose `bad_class.lean` test is broken. We can see the problem by going to [`tests/lean`](https://github.com/leanprover/lean4/tree/master/tests/lean) directory and
-executing
-
-```
-./test_single.sh -i bad_class.lean
-```
-
-When the `-i` option is provided, `meld` is automatically invoked
-whenever there is discrepancy between the produced and expected
-outputs. `meld` can also be used to repair the problems.
-
-In Emacs, we can also execute `M-x lean4-diff-test-file` to check/diff the file of the current buffer.
-To mass-copy all `.produced.out` files to the respective `.expected.out` file, use `tests/lean/copy-produced`.
--- a/doc/examples/.gitignore
+++ b/doc/examples/.gitignore
@@ -0,0 +1,2 @@
+*.out.produced
+*.exit.produced
--- a/doc/examples/bintree.lean.out.expected
+++ b/doc/examples/bintree.lean.out.expected
@@ -0,0 +1,2 @@
+Tree.node (Tree.node (Tree.leaf) 1 "one" (Tree.leaf)) 2 "two" (Tree.node (Tree.leaf) 3 "three" (Tree.leaf))
+[(1, "one"), (2, "two"), (3, "three")]
--- a/doc/examples/compiler/run_test.sh
+++ b/doc/examples/compiler/run_test.sh
@@ -0,0 +1,4 @@
+leanmake --always-make bin
+
+capture ./build/bin/test hello world
+check_out_contains "[hello, world]"
--- a/doc/examples/compiler/test.lean.out.expected
+++ b/doc/examples/compiler/test.lean.out.expected
@@ -0,0 +1 @@
+[hello, world]
--- a/doc/examples/interp.lean.out.expected
+++ b/doc/examples/interp.lean.out.expected
@@ -0,0 +1,4 @@
+30
+interp.lean:146:4: warning: declaration uses `sorry`
+interp.lean:146:0: warning: declaration uses `sorry`
+3628800
--- a/doc/examples/palindromes.lean.out.expected
+++ b/doc/examples/palindromes.lean.out.expected
@@ -0,0 +1,2 @@
+true
+false
--- a/doc/examples/phoas.lean.out.expected
+++ b/doc/examples/phoas.lean.out.expected
@@ -0,0 +1,2 @@
+"(((fun x_1 => (fun x_2 => (x_1 + x_2))) 1) 2)"
+"((((fun x_1 => (fun x_2 => (x_1 + x_2))) 1) 2) + 5)"
--- a/doc/examples/run_test.sh
+++ b/doc/examples/run_test.sh
@@ -0,0 +1,4 @@
+capture_only "$1" \
+  lean -Dlinter.all=false "$1"
+check_out_file
+check_exit_is_success
--- a/doc/examples/test_single.sh
+++ b/doc/examples/test_single.sh
@@ -1,4 +0,0 @@
-#!/usr/bin/env bash
-source ../../tests/common.sh
-
-exec_check_raw lean -Dlinter.all=false "$f"
--- a/flake.nix
+++ b/flake.nix
@@ -67,5 +67,5 @@
        oldGlibc = devShellWithDist pkgsDist-old;
        oldGlibcAArch = devShellWithDist pkgsDist-old-aarch;
      };
-    }) ["x86_64-linux" "aarch64-linux"]);
+    }) ["x86_64-linux" "aarch64-linux" "aarch64-darwin"]);
 }
--- a/lean.code-workspace
+++ b/lean.code-workspace
@@ -1,60 +0,0 @@
-{
-	"folders": [
-		{
-			"path": "."
-		}
-	],
-	"settings": {
-		"files.insertFinalNewline": true,
-		"files.trimTrailingWhitespace": true,
-		"cmake.buildDirectory": "${workspaceFolder}/build/release",
-		"cmake.generator": "Unix Makefiles",
-		"[markdown]": {
-			"rewrap.wrappingColumn": 70
-		},
-		"[lean4]": {
-			"editor.rulers": [
-				100
-			]
-		}
-	},
-	"tasks": {
-		"version": "2.0.0",
-		"tasks": [
-			{
-				"label": "build",
-				"type": "shell",
-				"command": "make -C build/release -j$(nproc 2>/dev/null || sysctl -n hw.logicalcpu 2>/dev/null || echo 4)",
-				"problemMatcher": [],
-				"group": {
-					"kind": "build",
-					"isDefault": true
-				}
-			},
-			{
-				"label": "build-old",
-				"type": "shell",
-				"command": "make -C build/release -j$(nproc 2>/dev/null || sysctl -n hw.logicalcpu 2>/dev/null || echo 4) LAKE_EXTRA_ARGS=--old",
-				"problemMatcher": [],
-				"group": {
-					"kind": "build"
-				}
-			},
-			{
-				"label": "test",
-				"type": "shell",
-				"command": "NPROC=$(nproc 2>/dev/null || sysctl -n hw.logicalcpu 2>/dev/null || echo 4); CTEST_OUTPUT_ON_FAILURE=1 make -C build/release test -j$NPROC ARGS=\"-j$NPROC\"",
-				"problemMatcher": [],
-				"group": {
-					"kind": "test",
-					"isDefault": true
-				}
-			}
-		]
-	},
-	"extensions": {
-		"recommendations": [
-			"leanprover.lean4"
-		]
-	}
-}
--- a/script/gen_constants_cpp.py
+++ b/script/gen_constants_cpp.py
@@ -1,4 +1,4 @@
-#!/usr/bin/env python
+#!/usr/bin/env python3
 # -*- coding: utf-8 -*-
 #
 # Copyright (c) 2015 Microsoft Corporation. All rights reserved.
--- a/script/gen_tokens_cpp.py
+++ b/script/gen_tokens_cpp.py
@@ -1,4 +1,4 @@
-#!/usr/bin/env python
+#!/usr/bin/env python3
 # -*- coding: utf-8 -*-
 #
 # Copyright (c) 2015 Microsoft Corporation. All rights reserved.
--- a/script/lean_profile.sh
+++ b/script/lean_profile.sh
@@ -1,4 +1,4 @@
-#!/bin/bash
+#!/usr/bin/env bash
 # Profile a Lean binary with demangled names.
 #
 # Usage:
--- a/script/profiler/lean_demangle.py
+++ b/script/profiler/lean_demangle.py
@@ -1,9 +1,11 @@
 #!/usr/bin/env python3
 """
-Lean name demangler.
+Lean name demangler — thin wrapper around the Lean CLI tool.

-Demangles C symbol names produced by the Lean 4 compiler back into
-readable Lean hierarchical names.
+Spawns ``lean --run lean_demangle_cli.lean`` as a persistent subprocess
+and communicates via stdin/stdout pipes. This ensures a single source
+of truth for demangling logic (the Lean implementation in
+``Lean.Compiler.NameDemangling``).

 Usage as a filter (like c++filt):
    echo "l_Lean_Meta_Sym_main" | python lean_demangle.py
@@ -13,767 +15,68 @@ Usage as a module:
    print(demangle_lean_name("l_Lean_Meta_Sym_main"))
 """

+import atexit
+import os
+import subprocess
 import sys

-
-# ---------------------------------------------------------------------------
-# String.mangle / unmangle
-# ---------------------------------------------------------------------------
-
-def _is_ascii_alnum(ch):
-    """Check if ch is an ASCII letter or digit (matching Lean's isAlpha/isDigit)."""
-    return ('a' <= ch <= 'z') or ('A' <= ch <= 'Z') or ('0' <= ch <= '9')
-
-
-def mangle_string(s):
-    """Port of Lean's String.mangle: escape a single string for C identifiers."""
-    result = []
-    for ch in s:
-        if _is_ascii_alnum(ch):
-            result.append(ch)
-        elif ch == '_':
-            result.append('__')
-        else:
-            code = ord(ch)
-            if code < 0x100:
-                result.append('_x' + format(code, '02x'))
-            elif code < 0x10000:
-                result.append('_u' + format(code, '04x'))
-            else:
-                result.append('_U' + format(code, '08x'))
-    return ''.join(result)
-
-
-def _parse_hex(s, pos, n):
-    """Parse n lowercase hex digits at pos. Returns (new_pos, value) or None."""
-    if pos + n > len(s):
-        return None
-    val = 0
-    for i in range(n):
-        c = s[pos + i]
-        if '0' <= c <= '9':
-            val = (val << 4) | (ord(c) - ord('0'))
-        elif 'a' <= c <= 'f':
-            val = (val << 4) | (ord(c) - ord('a') + 10)
-        else:
-            return None
-    return (pos + n, val)
-
-
-# ---------------------------------------------------------------------------
-# Name mangling (for round-trip verification)
-# ---------------------------------------------------------------------------
-
-def _check_disambiguation(m):
-    """Port of Lean's checkDisambiguation: does mangled string m need a '00' prefix?"""
-    pos = 0
-    while pos < len(m):
-        ch = m[pos]
-        if ch == '_':
-            pos += 1
-            continue
-        if ch == 'x':
-            return _parse_hex(m, pos + 1, 2) is not None
-        if ch == 'u':
-            return _parse_hex(m, pos + 1, 4) is not None
-        if ch == 'U':
-            return _parse_hex(m, pos + 1, 8) is not None
-        if '0' <= ch <= '9':
-            return True
-        return False
-    # all underscores or empty
-    return True
-
-
-def _need_disambiguation(prev_component, mangled_next):
-    """Port of Lean's needDisambiguation."""
-    # Check if previous component (as a string) ends with '_'
-    prev_ends_underscore = (isinstance(prev_component, str) and
-                            len(prev_component) > 0 and
-                            prev_component[-1] == '_')
-    return prev_ends_underscore or _check_disambiguation(mangled_next)
-
-
-def mangle_name(components, prefix="l_"):
-    """
-    Mangle a list of name components (str or int) into a C symbol.
-    Port of Lean's Name.mangle.
-    """
-    if not components:
-        return prefix
-
-    parts = []
-    prev = None
-    for i, comp in enumerate(components):
-        if isinstance(comp, int):
-            if i == 0:
-                parts.append(str(comp) + '_')
-            else:
-                parts.append('_' + str(comp) + '_')
-        else:
-            m = mangle_string(comp)
-            if i == 0:
-                if _check_disambiguation(m):
-                    parts.append('00' + m)
-                else:
-                    parts.append(m)
-            else:
-                if _need_disambiguation(prev, m):
-                    parts.append('_00' + m)
-                else:
-                    parts.append('_' + m)
-        prev = comp
-
-    return prefix + ''.join(parts)
-
-
-# ---------------------------------------------------------------------------
-# Name demangling
-# ---------------------------------------------------------------------------
-
-def demangle_body(s):
-    """
-    Demangle a string produced by Name.mangleAux (without prefix).
-    Returns a list of components (str or int).
-
-    This is a faithful port of Lean's Name.demangleAux from NameMangling.lean.
-    """
-    components = []
-    length = len(s)
-
-    def emit(comp):
-        components.append(comp)
-
-    def decode_num(pos, n):
-        """Parse remaining digits, emit numeric component, continue."""
-        while pos < length:
-            ch = s[pos]
-            if '0' <= ch <= '9':
-                n = n * 10 + (ord(ch) - ord('0'))
-                pos += 1
-            else:
-                # Expect '_' (trailing underscore of numeric encoding)
-                pos += 1  # skip '_'
-                emit(n)
-                if pos >= length:
-                    return pos
-                # Skip separator '_' and go to name_start
-                pos += 1
-                return name_start(pos)
-        # End of string
-        emit(n)
-        return pos
-
-    def name_start(pos):
-        """Start parsing a new name component."""
-        if pos >= length:
-            return pos
-        ch = s[pos]
-        pos += 1
-        if '0' <= ch <= '9':
-            # Check for '00' disambiguation
-            if ch == '0' and pos < length and s[pos] == '0':
-                pos += 1
-                return demangle_main(pos, "", 0)
-            else:
-                return decode_num(pos, ord(ch) - ord('0'))
-        elif ch == '_':
-            return demangle_main(pos, "", 1)
-        else:
-            return demangle_main(pos, ch, 0)
-
-    def demangle_main(pos, acc, ucount):
-        """Main demangling loop."""
-        while pos < length:
-            ch = s[pos]
-            pos += 1
-
-            if ch == '_':
-                ucount += 1
-                continue
-
-            if ucount % 2 == 0:
-                # Even underscores: literal underscores in component name
-                acc += '_' * (ucount // 2) + ch
-                ucount = 0
-                continue
-
-            # Odd ucount: separator or escape
-            if '0' <= ch <= '9':
-                # End current str component, start number
-                emit(acc + '_' * (ucount // 2))
-                if ch == '0' and pos < length and s[pos] == '0':
-                    pos += 1
-                    return demangle_main(pos, "", 0)
-                else:
-                    return decode_num(pos, ord(ch) - ord('0'))
-
-            # Try hex escapes
-            if ch == 'x':
-                result = _parse_hex(s, pos, 2)
-                if result is not None:
-                    new_pos, val = result
-                    acc += '_' * (ucount // 2) + chr(val)
-                    pos = new_pos
-                    ucount = 0
-                    continue
-
-            if ch == 'u':
-                result = _parse_hex(s, pos, 4)
-                if result is not None:
-                    new_pos, val = result
-                    acc += '_' * (ucount // 2) + chr(val)
-                    pos = new_pos
-                    ucount = 0
-                    continue
-
-            if ch == 'U':
-                result = _parse_hex(s, pos, 8)
-                if result is not None:
-                    new_pos, val = result
-                    acc += '_' * (ucount // 2) + chr(val)
-                    pos = new_pos
-                    ucount = 0
-                    continue
-
-            # Name separator
-            emit(acc)
-            acc = '_' * (ucount // 2) + ch
-            ucount = 0
-
-        # End of string
-        acc += '_' * (ucount // 2)
-        if acc:
-            emit(acc)
-        return pos
-
-    name_start(0)
-    return components
-
-
-# ---------------------------------------------------------------------------
-# Prefix handling for lp_ (package prefix)
-# ---------------------------------------------------------------------------
-
-def _is_valid_string_mangle(s):
-    """Check if s is a valid output of String.mangle (no trailing bare _)."""
-    pos = 0
-    length = len(s)
-    while pos < length:
-        ch = s[pos]
-        if _is_ascii_alnum(ch):
-            pos += 1
-        elif ch == '_':
-            if pos + 1 >= length:
-                return False  # trailing bare _
-            nch = s[pos + 1]
-            if nch == '_':
-                pos += 2
-            elif nch == 'x' and _parse_hex(s, pos + 2, 2) is not None:
-                pos = _parse_hex(s, pos + 2, 2)[0]
-            elif nch == 'u' and _parse_hex(s, pos + 2, 4) is not None:
-                pos = _parse_hex(s, pos + 2, 4)[0]
-            elif nch == 'U' and _parse_hex(s, pos + 2, 8) is not None:
-                pos = _parse_hex(s, pos + 2, 8)[0]
-            else:
-                return False
-        else:
-            return False
-    return True
-
-
-def _skip_string_mangle(s, pos):
-    """
-    Skip past a String.mangle output in s starting at pos.
-    Returns the position after the mangled string (where we expect the separator '_').
-    This is a greedy scan.
-    """
-    length = len(s)
-    while pos < length:
-        ch = s[pos]
-        if _is_ascii_alnum(ch):
-            pos += 1
-        elif ch == '_':
-            if pos + 1 < length:
-                nch = s[pos + 1]
-                if nch == '_':
-                    pos += 2
-                elif nch == 'x' and _parse_hex(s, pos + 2, 2) is not None:
-                    pos = _parse_hex(s, pos + 2, 2)[0]
-                elif nch == 'u' and _parse_hex(s, pos + 2, 4) is not None:
-                    pos = _parse_hex(s, pos + 2, 4)[0]
-                elif nch == 'U' and _parse_hex(s, pos + 2, 8) is not None:
-                    pos = _parse_hex(s, pos + 2, 8)[0]
-                else:
-                    return pos  # bare '_': separator
-            else:
-                return pos
-        else:
-            return pos
-    return pos
-
-
-def _find_lp_body(s):
-    """
-    Given s = everything after 'lp_' in a symbol, find where the declaration
-    body (Name.mangleAux output) starts.
-    Returns the start index of the body within s, or None.
-
-    Strategy: try all candidate split points where the package part is a valid
-    String.mangle output and the body round-trips. Prefer the longest valid
-    package name (most specific match).
-    """
-    length = len(s)
-
-    # Collect candidate split positions: every '_' that could be the separator
-    candidates = []
-    pos = 0
-    while pos < length:
-        if s[pos] == '_':
-            candidates.append(pos)
-        pos += 1
-
-    # Try each candidate; collect all valid splits
-    valid_splits = []
-    for split_pos in candidates:
-        pkg_part = s[:split_pos]
-        if not pkg_part:
-            continue
-        if not _is_valid_string_mangle(pkg_part):
-            continue
-        body = s[split_pos + 1:]
-        if not body:
-            continue
-        components = demangle_body(body)
-        if not components:
-            continue
-        remangled = mangle_name(components, prefix="")
-        if remangled == body:
-            first = components[0]
-            # Score: prefer first component starting with uppercase
-            has_upper = isinstance(first, str) and first and first[0].isupper()
-            valid_splits.append((split_pos, has_upper))
-
-    if valid_splits:
-        # Among splits where first decl component starts uppercase, pick longest pkg.
-        # Otherwise pick shortest pkg.
-        upper_splits = [s for s in valid_splits if s[1]]
-        if upper_splits:
-            best = max(upper_splits, key=lambda x: x[0])
-        else:
-            best = min(valid_splits, key=lambda x: x[0])
-        return best[0] + 1
-
-    # Fallback: greedy String.mangle scan
-    greedy_pos = _skip_string_mangle(s, 0)
-    if greedy_pos < length and s[greedy_pos] == '_':
-        return greedy_pos + 1
-
-    return None
-
-
-# ---------------------------------------------------------------------------
-# Format name components for display
-# ---------------------------------------------------------------------------
-
-def format_name(components):
-    """Format a list of name components as a dot-separated string."""
-    return '.'.join(str(c) for c in components)
-
-
-# ---------------------------------------------------------------------------
-# Human-friendly postprocessing
-# ---------------------------------------------------------------------------
-
-# Compiler-generated suffix components — exact match
-_SUFFIX_FLAGS_EXACT = {
-    '_redArg':  'arity\u2193',
-    '_boxed':   'boxed',
-    '_impl':    'impl',
-}
-
-# Compiler-generated suffix prefixes — match with optional _N index
-# e.g., _lam, _lam_0, _lam_3, _lambda_0, _closed_2
-_SUFFIX_FLAGS_PREFIX = {
-    '_lam':     '\u03bb',
-    '_lambda':  '\u03bb',
-    '_elam':    '\u03bb',
-    '_jp':      'jp',
-    '_closed':  'closed',
-}
-
-
-def _match_suffix(component):
-    """
-    Check if a string component is a compiler-generated suffix.
-    Returns the flag label or None.
-
-    Handles both exact matches (_redArg, _boxed) and indexed suffixes
-    (_lam_0, _lambda_2, _closed_0) produced by appendIndexAfter.
-    """
-    if not isinstance(component, str):
-        return None
-    if component in _SUFFIX_FLAGS_EXACT:
-        return _SUFFIX_FLAGS_EXACT[component]
-    if component in _SUFFIX_FLAGS_PREFIX:
-        return _SUFFIX_FLAGS_PREFIX[component]
-    # Check for indexed suffix: prefix + _N
-    for prefix, label in _SUFFIX_FLAGS_PREFIX.items():
-        if component.startswith(prefix + '_'):
-            rest = component[len(prefix) + 1:]
-            if rest.isdigit():
-                return label
-    return None
-
-
-def _strip_private(components):
-    """Strip _private.Module.0. prefix. Returns (stripped_parts, is_private)."""
-    if (len(components) >= 3 and isinstance(components[0], str) and
-            components[0] == '_private'):
-        for i in range(1, len(components)):
-            if components[i] == 0:
-                if i + 1 < len(components):
-                    return components[i + 1:], True
-                break
-    return components, False
-
-
-def _strip_spec_suffixes(components):
-    """Strip trailing spec_N components (from appendIndexAfter)."""
-    parts = list(components)
-    while parts and isinstance(parts[-1], str) and parts[-1].startswith('spec_'):
-        rest = parts[-1][5:]
-        if rest.isdigit():
-            parts.pop()
-        else:
-            break
-    return parts
-
-
-def _is_spec_index(component):
-    """Check if a component is a spec_N index (from appendIndexAfter)."""
-    return (isinstance(component, str) and
-            component.startswith('spec_') and component[5:].isdigit())
-
-
-def _parse_spec_entries(rest):
-    """Parse _at_..._spec pairs into separate spec context entries.
-
-    Given components starting from the first _at_, returns:
-    - entries: list of component lists, one per _at_..._spec block
-    - remaining: components after the last _spec N (trailing suffixes)
-    """
-    entries = []
-    current_ctx = None
-    remaining = []
-    skip_next = False
-
-    for p in rest:
-        if skip_next:
-            skip_next = False
-            continue
-        if isinstance(p, str) and p == '_at_':
-            if current_ctx is not None:
-                entries.append(current_ctx)
-            current_ctx = []
-            continue
-        if isinstance(p, str) and p == '_spec':
-            if current_ctx is not None:
-                entries.append(current_ctx)
-                current_ctx = None
-            skip_next = True
-            continue
-        if isinstance(p, str) and p.startswith('_spec'):
-            if current_ctx is not None:
-                entries.append(current_ctx)
-                current_ctx = None
-            continue
-        if current_ctx is not None:
-            current_ctx.append(p)
-        else:
-            remaining.append(p)
-
-    if current_ctx is not None:
-        entries.append(current_ctx)
-
-    return entries, remaining
-
-
-def _process_spec_context(components):
-    """Process a spec context into a clean name and its flags.
-
-    Returns (name_parts, flags) where name_parts are the cleaned components
-    and flags is a deduplicated list of flag labels from compiler suffixes.
-    """
-    parts = list(components)
-    parts, _ = _strip_private(parts)
-
-    name_parts = []
-    ctx_flags = []
-    seen = set()
-
-    for p in parts:
-        flag = _match_suffix(p)
-        if flag is not None:
-            if flag not in seen:
-                ctx_flags.append(flag)
-                seen.add(flag)
-        elif _is_spec_index(p):
-            pass
-        else:
-            name_parts.append(p)
-
-    return name_parts, ctx_flags
-
-
-def postprocess_name(components):
-    """
-    Transform raw demangled components into a human-friendly display string.
-
-    Applies:
-    - Private name cleanup: _private.Module.0.Name.foo -> Name.foo [private]
-    - Hygienic name cleanup: strips _@.module._hygCtx._hyg.N
-    - Suffix folding: _redArg, _boxed, _lam_0, etc. -> [flags]
-    - Specialization: f._at_.g._spec.N -> f spec at g
-      Shown after base [flags], with context flags: spec at g[ctx_flags]
-    """
-    if not components:
-        return ""
-
-    parts = list(components)
-    flags = []
-    spec_entries = []
-
-    # --- Strip _private prefix ---
-    parts, is_private = _strip_private(parts)
-
-    # --- Strip hygienic suffixes: everything from _@ onward ---
-    at_idx = None
-    for i, p in enumerate(parts):
-        if isinstance(p, str) and p.startswith('_@'):
-            at_idx = i
-            break
-    if at_idx is not None:
-        parts = parts[:at_idx]
-
-    # --- Handle specialization: _at_ ... _spec N ---
-    at_positions = [i for i, p in enumerate(parts)
-                    if isinstance(p, str) and p == '_at_']
-    if at_positions:
-        first_at = at_positions[0]
-        base = parts[:first_at]
-        rest = parts[first_at:]
-
-        entries, remaining = _parse_spec_entries(rest)
-        for ctx_components in entries:
-            ctx_name, ctx_flags = _process_spec_context(ctx_components)
-            if ctx_name or ctx_flags:
-                spec_entries.append((ctx_name, ctx_flags))
-
-        parts = base + remaining
-
-    # --- Collect suffix flags from the end ---
-    while parts:
-        last = parts[-1]
-        flag = _match_suffix(last)
-        if flag is not None:
-            flags.append(flag)
-            parts.pop()
-        elif isinstance(last, int) and len(parts) >= 2:
-            prev_flag = _match_suffix(parts[-2])
-            if prev_flag is not None:
-                flags.append(prev_flag)
-                parts.pop()  # remove the number
-                parts.pop()  # remove the suffix
-            else:
-                break
-        else:
-            break
-
-    if is_private:
-        flags.append('private')
-
-    # --- Format result ---
-    name = '.'.join(str(c) for c in parts) if parts else '?'
-    result = name
-    if flags:
-        flag_str = ', '.join(flags)
-        result += f' [{flag_str}]'
-
-    for ctx_name, ctx_flags in spec_entries:
-        ctx_str = '.'.join(str(c) for c in ctx_name) if ctx_name else '?'
-        if ctx_flags:
-            ctx_flag_str = ', '.join(ctx_flags)
-            result += f' spec at {ctx_str}[{ctx_flag_str}]'
-        else:
-            result += f' spec at {ctx_str}'
-
-    return result
-
-
-# ---------------------------------------------------------------------------
-# Main demangling entry point
-# ---------------------------------------------------------------------------
-
-def demangle_lean_name_raw(mangled):
-    """
-    Demangle a Lean C symbol, preserving all internal name components.
-
-    Returns the exact demangled name with all compiler-generated suffixes
-    intact. Use demangle_lean_name() for human-friendly output.
-    """
-    try:
-        return _demangle_lean_name_inner(mangled, human_friendly=False)
-    except Exception:
-        return mangled
+_process = None
+_script_dir = os.path.dirname(os.path.abspath(__file__))
+_cli_script = os.path.join(_script_dir, "lean_demangle_cli.lean")
+
+
+def _get_process():
+    """Get or create the persistent Lean demangler subprocess."""
+    global _process
+    if _process is not None and _process.poll() is None:
+        return _process
+
+    lean = os.environ.get("LEAN", "lean")
+    _process = subprocess.Popen(
+        [lean, "--run", _cli_script],
+        stdin=subprocess.PIPE,
+        stdout=subprocess.PIPE,
+        stderr=subprocess.DEVNULL,
+        text=True,
+        bufsize=1,  # line buffered
+    )
+    atexit.register(_cleanup)
+    return _process
+
+
+def _cleanup():
+    global _process
+    if _process is not None:
+        try:
+            _process.stdin.close()
+            _process.wait(timeout=5)
+        except Exception:
+            _process.kill()
+        _process = None


 def demangle_lean_name(mangled):
    """
    Demangle a C symbol name produced by the Lean 4 compiler.

-    Returns a human-friendly demangled name with compiler suffixes folded
-    into readable flags. Use demangle_lean_name_raw() to preserve all
-    internal components.
+    Returns a human-friendly demangled name, or the original string
+    if it is not a Lean symbol.
    """
    try:
-        return _demangle_lean_name_inner(mangled, human_friendly=True)
+        proc = _get_process()
+        proc.stdin.write(mangled + "\n")
+        proc.stdin.flush()
+        result = proc.stdout.readline().rstrip("\n")
+        return result if result else mangled
    except Exception:
        return mangled


-def _demangle_lean_name_inner(mangled, human_friendly=True):
-    """Inner demangle that may raise on malformed input."""
-
-    if mangled == "_lean_main":
-        return "[lean] main"
-
-    # Handle lean_ runtime functions
-    if human_friendly and mangled.startswith("lean_apply_"):
-        rest = mangled[11:]
-        if rest.isdigit():
-            return f"<apply/{rest}>"
-
-    # Strip .cold.N suffix (LLVM linker cold function clones)
-    cold_suffix = ""
-    core = mangled
-    dot_pos = core.find('.cold.')
-    if dot_pos >= 0:
-        cold_suffix = " " + core[dot_pos:]
-        core = core[:dot_pos]
-    elif core.endswith('.cold'):
-        cold_suffix = " .cold"
-        core = core[:-5]
-
-    result = _demangle_core(core, human_friendly)
-    if result is None:
-        return mangled
-    return result + cold_suffix
-
-
-def _demangle_core(mangled, human_friendly=True):
-    """Demangle a symbol without .cold suffix. Returns None if not a Lean name."""
-
-    fmt = postprocess_name if human_friendly else format_name
-
-    # _init_ prefix
-    if mangled.startswith("_init_"):
-        rest = mangled[6:]
-        body, pkg_display = _strip_lean_prefix(rest)
-        if body is None:
-            return None
-        components = demangle_body(body)
-        if not components:
-            return None
-        name = fmt(components)
-        if pkg_display:
-            return f"[init] {name} ({pkg_display})"
-        return f"[init] {name}"
-
-    # initialize_ prefix (module init functions)
-    if mangled.startswith("initialize_"):
-        rest = mangled[11:]
-        # With package: initialize_lp_{pkg}_{body} or initialize_l_{body}
-        body, pkg_display = _strip_lean_prefix(rest)
-        if body is not None:
-            components = demangle_body(body)
-            if components:
-                name = fmt(components)
-                if pkg_display:
-                    return f"[module_init] {name} ({pkg_display})"
-                return f"[module_init] {name}"
-        # Without package: initialize_{Name.mangleAux(moduleName)}
-        if rest:
-            components = demangle_body(rest)
-            if components:
-                return f"[module_init] {fmt(components)}"
-        return None
-
-    # l_ or lp_ prefix
-    body, pkg_display = _strip_lean_prefix(mangled)
-    if body is None:
-        return None
-    components = demangle_body(body)
-    if not components:
-        return None
-    name = fmt(components)
-    if pkg_display:
-        return f"{name} ({pkg_display})"
-    return name
-
-
-def _strip_lean_prefix(s):
-    """
-    Strip the l_ or lp_ prefix from a mangled symbol.
-    Returns (body, pkg_display) where body is the Name.mangleAux output
-    and pkg_display is None or a string describing the package.
-    Returns (None, None) if the string doesn't have a recognized prefix.
-    """
-    if s.startswith("l_"):
-        return (s[2:], None)
-
-    if s.startswith("lp_"):
-        after_lp = s[3:]
-        body_start = _find_lp_body(after_lp)
-        if body_start is not None:
-            pkg_mangled = after_lp[:body_start - 1]
-            # Unmangle the package name
-            pkg_components = demangle_body(pkg_mangled)
-            if pkg_components and len(pkg_components) == 1 and isinstance(pkg_components[0], str):
-                pkg_display = pkg_components[0]
-            else:
-                pkg_display = pkg_mangled
-            return (after_lp[body_start:], pkg_display)
-        # Fallback: treat everything after lp_ as body
-        return (after_lp, "?")
-
-    return (None, None)
-
-
-# ---------------------------------------------------------------------------
-# CLI
-# ---------------------------------------------------------------------------
-
 def main():
-    """Filter stdin or arguments, demangling Lean names."""
-    import argparse
-    parser = argparse.ArgumentParser(
-        description="Demangle Lean 4 C symbol names (like c++filt for Lean)")
-    parser.add_argument('names', nargs='*',
-                        help='Names to demangle (reads stdin if none given)')
-    parser.add_argument('--raw', action='store_true',
-                        help='Output exact demangled names without postprocessing')
-    args = parser.parse_args()
-
-    demangle = demangle_lean_name_raw if args.raw else demangle_lean_name
-
-    if args.names:
-        for name in args.names:
-            print(demangle(name))
-    else:
-        for line in sys.stdin:
-            print(demangle(line.rstrip('\n')))
+    """Filter stdin, demangling Lean names."""
+    for line in sys.stdin:
+        print(demangle_lean_name(line.rstrip("\n")))


-if __name__ == '__main__':
+if __name__ == "__main__":
    main()
--- a/script/profiler/lean_demangle_cli.lean
+++ b/script/profiler/lean_demangle_cli.lean
@@ -0,0 +1,32 @@
+/-
+Copyright (c) 2026 Lean FRO, LLC. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Kim Morrison
+-/
+module
+
+import Lean.Compiler.NameDemangling
+
+/-!
+Lean name demangler CLI tool. Reads mangled symbol names from stdin (one per
+line) and writes demangled names to stdout. Non-Lean symbols pass through
+unchanged. Like `c++filt` but for Lean names.
+
+Usage:
+    echo "l_Lean_Meta_foo" | lean --run lean_demangle_cli.lean
+    cat symbols.txt | lean --run lean_demangle_cli.lean
+-/
+
+open Lean.Name.Demangle
+
+def main : IO Unit := do
+  let stdin ← IO.getStdin
+  let stdout ← IO.getStdout
+  repeat do
+    let line ← stdin.getLine
+    if line.isEmpty then break
+    let sym := line.trimRight
+    match demangleSymbol sym with
+    | some s => stdout.putStrLn s
+    | none => stdout.putStrLn sym
+    stdout.flush
--- a/script/profiler/test_demangle.py
+++ b/script/profiler/test_demangle.py
@@ -1,670 +0,0 @@
-#!/usr/bin/env python3
-"""Tests for the Lean name demangler."""
-
-import unittest
-import json
-import gzip
-import tempfile
-import os
-
-from lean_demangle import (
-    mangle_string, mangle_name, demangle_body, format_name,
-    demangle_lean_name, demangle_lean_name_raw, postprocess_name,
-    _parse_hex, _check_disambiguation,
-)
-
-
-class TestStringMangle(unittest.TestCase):
-    """Test String.mangle (character-level escaping)."""
-
-    def test_alphanumeric(self):
-        self.assertEqual(mangle_string("hello"), "hello")
-        self.assertEqual(mangle_string("abc123"), "abc123")
-
-    def test_underscore(self):
-        self.assertEqual(mangle_string("a_b"), "a__b")
-        self.assertEqual(mangle_string("_"), "__")
-        self.assertEqual(mangle_string("__"), "____")
-
-    def test_special_chars(self):
-        self.assertEqual(mangle_string("."), "_x2e")
-        self.assertEqual(mangle_string("a.b"), "a_x2eb")
-
-    def test_unicode(self):
-        self.assertEqual(mangle_string("\u03bb"), "_u03bb")
-        self.assertEqual(mangle_string("\U0001d55c"), "_U0001d55c")
-
-    def test_empty(self):
-        self.assertEqual(mangle_string(""), "")
-
-
-class TestNameMangle(unittest.TestCase):
-    """Test Name.mangle (hierarchical name mangling)."""
-
-    def test_simple(self):
-        self.assertEqual(mangle_name(["Lean", "Meta", "Sym", "main"]),
-                         "l_Lean_Meta_Sym_main")
-
-    def test_single_component(self):
-        self.assertEqual(mangle_name(["main"]), "l_main")
-
-    def test_numeric_component(self):
-        self.assertEqual(
-            mangle_name(["_private", "Lean", "Meta", "Basic", 0,
-                         "Lean", "Meta", "withMVarContextImp"]),
-            "l___private_Lean_Meta_Basic_0__Lean_Meta_withMVarContextImp")
-
-    def test_component_with_underscore(self):
-        self.assertEqual(mangle_name(["a_b"]), "l_a__b")
-        self.assertEqual(mangle_name(["a_b", "c"]), "l_a__b_c")
-
-    def test_disambiguation_digit_start(self):
-        self.assertEqual(mangle_name(["0foo"]), "l_000foo")
-
-    def test_disambiguation_escape_start(self):
-        self.assertEqual(mangle_name(["a", "x27"]), "l_a_00x27")
-
-    def test_numeric_root(self):
-        self.assertEqual(mangle_name([42]), "l_42_")
-        self.assertEqual(mangle_name([42, "foo"]), "l_42__foo")
-
-    def test_component_ending_with_underscore(self):
-        self.assertEqual(mangle_name(["a_", "b"]), "l_a___00b")
-
-    def test_custom_prefix(self):
-        self.assertEqual(mangle_name(["foo"], prefix="lp_pkg_"),
-                         "lp_pkg_foo")
-
-
-class TestDemangleBody(unittest.TestCase):
-    """Test demangle_body (the core Name.demangleAux algorithm)."""
-
-    def test_simple(self):
-        self.assertEqual(demangle_body("Lean_Meta_Sym_main"),
-                         ["Lean", "Meta", "Sym", "main"])
-
-    def test_single(self):
-        self.assertEqual(demangle_body("main"), ["main"])
-
-    def test_empty(self):
-        self.assertEqual(demangle_body(""), [])
-
-    def test_underscore_in_component(self):
-        self.assertEqual(demangle_body("a__b"), ["a_b"])
-        self.assertEqual(demangle_body("a__b_c"), ["a_b", "c"])
-
-    def test_numeric_component(self):
-        self.assertEqual(demangle_body("foo_42__bar"), ["foo", 42, "bar"])
-
-    def test_numeric_root(self):
-        self.assertEqual(demangle_body("42_"), [42])
-
-    def test_numeric_at_end(self):
-        self.assertEqual(demangle_body("foo_42_"), ["foo", 42])
-
-    def test_disambiguation_00(self):
-        self.assertEqual(demangle_body("a_00x27"), ["a", "x27"])
-
-    def test_disambiguation_00_at_root(self):
-        self.assertEqual(demangle_body("000foo"), ["0foo"])
-
-    def test_hex_escape_x(self):
-        self.assertEqual(demangle_body("a_x2eb"), ["a.b"])
-
-    def test_hex_escape_u(self):
-        self.assertEqual(demangle_body("_u03bb"), ["\u03bb"])
-
-    def test_hex_escape_U(self):
-        self.assertEqual(demangle_body("_U0001d55c"), ["\U0001d55c"])
-
-    def test_private_name(self):
-        body = "__private_Lean_Meta_Basic_0__Lean_Meta_withMVarContextImp"
-        self.assertEqual(demangle_body(body),
-                         ["_private", "Lean", "Meta", "Basic", 0,
-                          "Lean", "Meta", "withMVarContextImp"])
-
-    def test_boxed_suffix(self):
-        body = "foo___boxed"
-        self.assertEqual(demangle_body(body), ["foo", "_boxed"])
-
-    def test_redArg_suffix(self):
-        body = "foo_bar___redArg"
-        self.assertEqual(demangle_body(body), ["foo", "bar", "_redArg"])
-
-    def test_component_ending_underscore_disambiguation(self):
-        self.assertEqual(demangle_body("a___00b"), ["a_", "b"])
-
-
-class TestRoundTrip(unittest.TestCase):
-    """Test that mangle(demangle(x)) == x for various names."""
-
-    def _check_roundtrip(self, components):
-        mangled = mangle_name(components, prefix="")
-        demangled = demangle_body(mangled)
-        self.assertEqual(demangled, components,
-                         f"Round-trip failed: {components} -> '{mangled}' -> {demangled}")
-        mangled_with_prefix = mangle_name(components, prefix="l_")
-        self.assertTrue(mangled_with_prefix.startswith("l_"))
-        body = mangled_with_prefix[2:]
-        demangled2 = demangle_body(body)
-        self.assertEqual(demangled2, components)
-
-    def test_simple_names(self):
-        self._check_roundtrip(["Lean", "Meta", "main"])
-        self._check_roundtrip(["a"])
-        self._check_roundtrip(["Foo", "Bar", "baz"])
-
-    def test_numeric(self):
-        self._check_roundtrip(["foo", 0, "bar"])
-        self._check_roundtrip([42])
-        self._check_roundtrip(["a", 1, "b", 2, "c"])
-
-    def test_underscores(self):
-        self._check_roundtrip(["_private"])
-        self._check_roundtrip(["a_b", "c_d"])
-        self._check_roundtrip(["_at_", "_spec"])
-
-    def test_private_name(self):
-        self._check_roundtrip(["_private", "Lean", "Meta", "Basic", 0,
-                                "Lean", "Meta", "withMVarContextImp"])
-
-    def test_boxed(self):
-        self._check_roundtrip(["Lean", "Meta", "foo", "_boxed"])
-
-    def test_redArg(self):
-        self._check_roundtrip(["Lean", "Meta", "foo", "_redArg"])
-
-    def test_specialization(self):
-        self._check_roundtrip(["List", "map", "_at_", "Foo", "bar", "_spec", 3])
-
-    def test_lambda(self):
-        self._check_roundtrip(["Foo", "bar", "_lambda", 0])
-        self._check_roundtrip(["Foo", "bar", "_lambda", 2])
-
-    def test_closed(self):
-        self._check_roundtrip(["myConst", "_closed", 0])
-
-    def test_special_chars(self):
-        self._check_roundtrip(["a.b"])
-        self._check_roundtrip(["\u03bb"])
-        self._check_roundtrip(["a", "b\u2192c"])
-
-    def test_disambiguation_cases(self):
-        self._check_roundtrip(["a", "x27"])
-        self._check_roundtrip(["0foo"])
-        self._check_roundtrip(["a_", "b"])
-
-    def test_complex_real_names(self):
-        """Names modeled after real Lean compiler output."""
-        self._check_roundtrip(
-            ["Lean", "MVarId", "withContext", "_at_",
-             "_private", "Lean", "Meta", "Sym", 0,
-             "Lean", "Meta", "Sym", "BackwardRule", "apply",
-             "_spec", 2, "_redArg", "_lambda", 0, "_boxed"])
-
-
-class TestDemangleRaw(unittest.TestCase):
-    """Test demangle_lean_name_raw (exact demangling, no postprocessing)."""
-
-    def test_l_prefix(self):
-        self.assertEqual(
-            demangle_lean_name_raw("l_Lean_Meta_Sym_main"),
-            "Lean.Meta.Sym.main")
-
-    def test_l_prefix_private(self):
-        result = demangle_lean_name_raw(
-            "l___private_Lean_Meta_Basic_0__Lean_Meta_withMVarContextImp")
-        self.assertEqual(result,
-                         "_private.Lean.Meta.Basic.0.Lean.Meta.withMVarContextImp")
-
-    def test_l_prefix_boxed(self):
-        result = demangle_lean_name_raw("l_foo___boxed")
-        self.assertEqual(result, "foo._boxed")
-
-    def test_l_prefix_redArg(self):
-        result = demangle_lean_name_raw(
-            "l___private_Lean_Meta_Basic_0__Lean_Meta_withMVarContextImp___redArg")
-        self.assertEqual(
-            result,
-            "_private.Lean.Meta.Basic.0.Lean.Meta.withMVarContextImp._redArg")
-
-    def test_lean_main(self):
-        self.assertEqual(demangle_lean_name_raw("_lean_main"), "[lean] main")
-
-    def test_non_lean_names(self):
-        self.assertEqual(demangle_lean_name_raw("printf"), "printf")
-        self.assertEqual(demangle_lean_name_raw("malloc"), "malloc")
-        self.assertEqual(demangle_lean_name_raw("lean_apply_5"), "lean_apply_5")
-        self.assertEqual(demangle_lean_name_raw(""), "")
-
-    def test_init_prefix(self):
-        result = demangle_lean_name_raw("_init_l_Lean_Meta_foo")
-        self.assertEqual(result, "[init] Lean.Meta.foo")
-
-    def test_lp_prefix_simple(self):
-        mangled = mangle_name(["Lean", "Meta", "foo"], prefix="lp_std_")
-        self.assertEqual(mangled, "lp_std_Lean_Meta_foo")
-        result = demangle_lean_name_raw(mangled)
-        self.assertEqual(result, "Lean.Meta.foo (std)")
-
-    def test_lp_prefix_underscore_pkg(self):
-        pkg_mangled = mangle_string("my_pkg")
-        self.assertEqual(pkg_mangled, "my__pkg")
-        mangled = mangle_name(["Lean", "Meta", "foo"],
-                              prefix=f"lp_{pkg_mangled}_")
-        self.assertEqual(mangled, "lp_my__pkg_Lean_Meta_foo")
-        result = demangle_lean_name_raw(mangled)
-        self.assertEqual(result, "Lean.Meta.foo (my_pkg)")
-
-    def test_lp_prefix_private_decl(self):
-        mangled = mangle_name(
-            ["_private", "X", 0, "Y", "foo"], prefix="lp_pkg_")
-        self.assertEqual(mangled, "lp_pkg___private_X_0__Y_foo")
-        result = demangle_lean_name_raw(mangled)
-        self.assertEqual(result, "_private.X.0.Y.foo (pkg)")
-
-    def test_complex_specialization(self):
-        components = [
-            "Lean", "MVarId", "withContext", "_at_",
-            "_private", "Lean", "Meta", "Sym", 0,
-            "Lean", "Meta", "Sym", "BackwardRule", "apply",
-            "_spec", 2, "_redArg", "_lambda", 0, "_boxed"
-        ]
-        mangled = mangle_name(components)
-        result = demangle_lean_name_raw(mangled)
-        expected = format_name(components)
-        self.assertEqual(result, expected)
-
-    def test_cold_suffix(self):
-        result = demangle_lean_name_raw("l_Lean_Meta_foo___redArg.cold.1")
-        self.assertEqual(result, "Lean.Meta.foo._redArg .cold.1")
-
-    def test_cold_suffix_plain(self):
-        result = demangle_lean_name_raw("l_Lean_Meta_foo.cold")
-        self.assertEqual(result, "Lean.Meta.foo .cold")
-
-    def test_initialize_no_pkg(self):
-        result = demangle_lean_name_raw("initialize_Init_Control_Basic")
-        self.assertEqual(result, "[module_init] Init.Control.Basic")
-
-    def test_initialize_with_l_prefix(self):
-        result = demangle_lean_name_raw("initialize_l_Lean_Meta_foo")
-        self.assertEqual(result, "[module_init] Lean.Meta.foo")
-
-    def test_never_crashes(self):
-        """Demangling should never raise, just return the original."""
-        weird_inputs = [
-            "", "l_", "lp_", "lp_x", "_init_", "initialize_",
-            "l_____", "lp____", "l_00", "l_0",
-            "some random string", "l_ space",
-        ]
-        for inp in weird_inputs:
-            result = demangle_lean_name_raw(inp)
-            self.assertIsInstance(result, str)
-
-
-class TestPostprocess(unittest.TestCase):
-    """Test postprocess_name (human-friendly suffix folding, etc.)."""
-
-    def test_no_change(self):
-        self.assertEqual(postprocess_name(["Lean", "Meta", "main"]),
-                         "Lean.Meta.main")
-
-    def test_boxed(self):
-        self.assertEqual(postprocess_name(["foo", "_boxed"]),
-                         "foo [boxed]")
-
-    def test_redArg(self):
-        self.assertEqual(postprocess_name(["foo", "bar", "_redArg"]),
-                         "foo.bar [arity\u2193]")
-
-    def test_lambda_separate(self):
-        # _lam as separate component + numeric index
-        self.assertEqual(postprocess_name(["foo", "_lam", 0]),
-                         "foo [\u03bb]")
-
-    def test_lambda_indexed(self):
-        # _lam_0 as single string (appendIndexAfter)
-        self.assertEqual(postprocess_name(["foo", "_lam_0"]),
-                         "foo [\u03bb]")
-        self.assertEqual(postprocess_name(["foo", "_lambda_2"]),
-                         "foo [\u03bb]")
-
-    def test_lambda_boxed(self):
-        # _lam_0 followed by _boxed
-        self.assertEqual(
-            postprocess_name(["Lean", "Meta", "Simp", "simpLambda",
-                              "_lam_0", "_boxed"]),
-            "Lean.Meta.Simp.simpLambda [boxed, \u03bb]")
-
-    def test_closed(self):
-        self.assertEqual(postprocess_name(["myConst", "_closed", 3]),
-                         "myConst [closed]")
-
-    def test_closed_indexed(self):
-        self.assertEqual(postprocess_name(["myConst", "_closed_0"]),
-                         "myConst [closed]")
-
-    def test_multiple_suffixes(self):
-        self.assertEqual(postprocess_name(["foo", "_redArg", "_boxed"]),
-                         "foo [boxed, arity\u2193]")
-
-    def test_redArg_lam(self):
-        # _redArg followed by _lam_0 (issue #4)
-        self.assertEqual(
-            postprocess_name(["Lean", "profileitIOUnsafe",
-                              "_redArg", "_lam_0"]),
-            "Lean.profileitIOUnsafe [\u03bb, arity\u2193]")
-
-    def test_private_name(self):
-        self.assertEqual(
-            postprocess_name(["_private", "Lean", "Meta", "Basic", 0,
-                              "Lean", "Meta", "withMVarContextImp"]),
-            "Lean.Meta.withMVarContextImp [private]")
-
-    def test_private_with_suffix(self):
-        self.assertEqual(
-            postprocess_name(["_private", "Lean", "Meta", "Basic", 0,
-                              "Lean", "Meta", "foo", "_redArg"]),
-            "Lean.Meta.foo [arity\u2193, private]")
-
-    def test_hygienic_strip(self):
-        self.assertEqual(
-            postprocess_name(["Lean", "Meta", "foo", "_@", "Lean", "Meta",
-                              "_hyg", 42]),
-            "Lean.Meta.foo")
-
-    def test_specialization(self):
-        self.assertEqual(
-            postprocess_name(["List", "map", "_at_", "Foo", "bar",
-                              "_spec", 3]),
-            "List.map spec at Foo.bar")
-
-    def test_specialization_with_suffix(self):
-        # Base suffix _boxed appears in [flags] before spec at
-        self.assertEqual(
-            postprocess_name(["Lean", "MVarId", "withContext", "_at_",
-                              "Foo", "bar", "_spec", 2, "_boxed"]),
-            "Lean.MVarId.withContext [boxed] spec at Foo.bar")
-
-    def test_spec_context_with_flags(self):
-        # Compiler suffixes in spec context become context flags
-        self.assertEqual(
-            postprocess_name(["Lean", "Meta", "foo", "_at_",
-                              "Lean", "Meta", "bar", "_elam_1", "_redArg",
-                              "_spec", 2]),
-            "Lean.Meta.foo spec at Lean.Meta.bar[\u03bb, arity\u2193]")
-
-    def test_spec_context_flags_dedup(self):
-        # Duplicate flag labels are deduplicated
-        self.assertEqual(
-            postprocess_name(["f", "_at_",
-                              "g", "_lam_0", "_elam_1", "_redArg",
-                              "_spec", 1]),
-            "f spec at g[\u03bb, arity\u2193]")
-
-    def test_multiple_at(self):
-        # Multiple _at_ entries become separate spec at clauses
-        self.assertEqual(
-            postprocess_name(["f", "_at_", "g", "_spec", 1,
-                              "_at_", "h", "_spec", 2]),
-            "f spec at g spec at h")
-
-    def test_multiple_at_with_flags(self):
-        # Multiple spec at with flags on base and contexts
-        self.assertEqual(
-            postprocess_name(["f", "_at_", "g", "_redArg", "_spec", 1,
-                              "_at_", "h", "_lam_0", "_spec", 2,
-                              "_boxed"]),
-            "f [boxed] spec at g[arity\u2193] spec at h[\u03bb]")
-
-    def test_base_flags_before_spec(self):
-        # Base trailing suffixes appear in [flags] before spec at
-        self.assertEqual(
-            postprocess_name(["f", "_at_", "g", "_spec", 1, "_lam_0"]),
-            "f [\u03bb] spec at g")
-
-    def test_spec_context_strip_spec_suffixes(self):
-        # spec_0 in context should be stripped
-        self.assertEqual(
-            postprocess_name(["Lean", "Meta", "transformWithCache", "visit",
-                              "_at_",
-                              "_private", "Lean", "Meta", "Transform", 0,
-                              "Lean", "Meta", "transform",
-                              "Lean", "Meta", "Sym", "unfoldReducible",
-                              "spec_0", "spec_0",
-                              "_spec", 1]),
-            "Lean.Meta.transformWithCache.visit "
-            "spec at Lean.Meta.transform.Lean.Meta.Sym.unfoldReducible")
-
-    def test_spec_context_strip_private(self):
-        # _private in spec context should be stripped
-        self.assertEqual(
-            postprocess_name(["Array", "mapMUnsafe", "map", "_at_",
-                              "_private", "Lean", "Meta", "Transform", 0,
-                              "Lean", "Meta", "transformWithCache", "visit",
-                              "_spec", 1]),
-            "Array.mapMUnsafe.map "
-            "spec at Lean.Meta.transformWithCache.visit")
-
-    def test_empty(self):
-        self.assertEqual(postprocess_name([]), "")
-
-
-class TestDemangleHumanFriendly(unittest.TestCase):
-    """Test demangle_lean_name (human-friendly output)."""
-
-    def test_simple(self):
-        self.assertEqual(demangle_lean_name("l_Lean_Meta_main"),
-                         "Lean.Meta.main")
-
-    def test_boxed(self):
-        self.assertEqual(demangle_lean_name("l_foo___boxed"),
-                         "foo [boxed]")
-
-    def test_redArg(self):
-        self.assertEqual(demangle_lean_name("l_foo___redArg"),
-                         "foo [arity\u2193]")
-
-    def test_private(self):
-        self.assertEqual(
-            demangle_lean_name(
-                "l___private_Lean_Meta_Basic_0__Lean_Meta_foo"),
-            "Lean.Meta.foo [private]")
-
-    def test_private_with_redArg(self):
-        self.assertEqual(
-            demangle_lean_name(
-                "l___private_Lean_Meta_Basic_0__Lean_Meta_foo___redArg"),
-            "Lean.Meta.foo [arity\u2193, private]")
-
-    def test_cold_with_suffix(self):
-        self.assertEqual(
-            demangle_lean_name("l_Lean_Meta_foo___redArg.cold.1"),
-            "Lean.Meta.foo [arity\u2193] .cold.1")
-
-    def test_lean_apply(self):
-        self.assertEqual(demangle_lean_name("lean_apply_5"), "<apply/5>")
-        self.assertEqual(demangle_lean_name("lean_apply_12"), "<apply/12>")
-
-    def test_lean_apply_raw_unchanged(self):
-        self.assertEqual(demangle_lean_name_raw("lean_apply_5"),
-                         "lean_apply_5")
-
-    def test_init_private(self):
-        self.assertEqual(
-            demangle_lean_name(
-                "_init_l___private_X_0__Y_foo"),
-            "[init] Y.foo [private]")
-
-    def test_complex_specialization(self):
-        components = [
-            "Lean", "MVarId", "withContext", "_at_",
-            "_private", "Lean", "Meta", "Sym", 0,
-            "Lean", "Meta", "Sym", "BackwardRule", "apply",
-            "_spec", 2, "_redArg", "_lambda", 0, "_boxed"
-        ]
-        mangled = mangle_name(components)
-        result = demangle_lean_name(mangled)
-        # Base: Lean.MVarId.withContext with trailing _redArg, _lambda 0, _boxed
-        # Spec context: Lean.Meta.Sym.BackwardRule.apply (private stripped)
-        self.assertEqual(
-            result,
-            "Lean.MVarId.withContext [boxed, \u03bb, arity\u2193] "
-            "spec at Lean.Meta.Sym.BackwardRule.apply")
-
-    def test_non_lean_unchanged(self):
-        self.assertEqual(demangle_lean_name("printf"), "printf")
-        self.assertEqual(demangle_lean_name("malloc"), "malloc")
-        self.assertEqual(demangle_lean_name(""), "")
-
-
-class TestDemangleProfile(unittest.TestCase):
-    """Test the profile rewriter."""
-
-    def _make_profile_shared(self, strings):
-        """Create a profile with shared.stringArray (newer format)."""
-        return {
-            "meta": {"version": 28},
-            "libs": [],
-            "shared": {
-                "stringArray": list(strings),
-            },
-            "threads": [{
-                "name": "main",
-                "pid": "1",
-                "tid": 1,
-                "funcTable": {
-                    "name": list(range(len(strings))),
-                    "isJS": [False] * len(strings),
-                    "relevantForJS": [False] * len(strings),
-                    "resource": [-1] * len(strings),
-                    "fileName": [None] * len(strings),
-                    "lineNumber": [None] * len(strings),
-                    "columnNumber": [None] * len(strings),
-                    "length": len(strings),
-                },
-                "frameTable": {"length": 0},
-                "stackTable": {"length": 0},
-                "samples": {"length": 0},
-                "markers": {"length": 0},
-                "resourceTable": {"length": 0},
-                "nativeSymbols": {"length": 0},
-            }],
-            "pages": [],
-            "counters": [],
-        }
-
-    def _make_profile_per_thread(self, strings):
-        """Create a profile with per-thread stringArray (samply format)."""
-        return {
-            "meta": {"version": 28},
-            "libs": [],
-            "threads": [{
-                "name": "main",
-                "pid": "1",
-                "tid": 1,
-                "stringArray": list(strings),
-                "funcTable": {
-                    "name": list(range(len(strings))),
-                    "isJS": [False] * len(strings),
-                    "relevantForJS": [False] * len(strings),
-                    "resource": [-1] * len(strings),
-                    "fileName": [None] * len(strings),
-                    "lineNumber": [None] * len(strings),
-                    "columnNumber": [None] * len(strings),
-                    "length": len(strings),
-                },
-                "frameTable": {"length": 0},
-                "stackTable": {"length": 0},
-                "samples": {"length": 0},
-                "markers": {"length": 0},
-                "resourceTable": {"length": 0},
-                "nativeSymbols": {"length": 0},
-            }],
-            "pages": [],
-            "counters": [],
-        }
-
-    def test_profile_rewrite_shared(self):
-        from lean_demangle_profile import rewrite_profile
-        strings = [
-            "l_Lean_Meta_Sym_main",
-            "printf",
-            "lean_apply_5",
-            "l___private_Lean_Meta_Basic_0__Lean_Meta_foo",
-        ]
-        profile = self._make_profile_shared(strings)
-        rewrite_profile(profile)
-        sa = profile["shared"]["stringArray"]
-        self.assertEqual(sa[0], "Lean.Meta.Sym.main")
-        self.assertEqual(sa[1], "printf")
-        self.assertEqual(sa[2], "<apply/5>")
-        self.assertEqual(sa[3], "Lean.Meta.foo [private]")
-
-    def test_profile_rewrite_per_thread(self):
-        from lean_demangle_profile import rewrite_profile
-        strings = [
-            "l_Lean_Meta_Sym_main",
-            "printf",
-            "lean_apply_5",
-            "l___private_Lean_Meta_Basic_0__Lean_Meta_foo",
-        ]
-        profile = self._make_profile_per_thread(strings)
-        count = rewrite_profile(profile)
-        sa = profile["threads"][0]["stringArray"]
-        self.assertEqual(sa[0], "Lean.Meta.Sym.main")
-        self.assertEqual(sa[1], "printf")
-        self.assertEqual(sa[2], "<apply/5>")
-        self.assertEqual(sa[3], "Lean.Meta.foo [private]")
-        self.assertEqual(count, 3)
-
-    def test_profile_json_roundtrip(self):
-        from lean_demangle_profile import process_profile_file
-        strings = ["l_Lean_Meta_main", "malloc"]
-        profile = self._make_profile_shared(strings)
-
-        with tempfile.NamedTemporaryFile(mode='w', suffix='.json',
-                                         delete=False) as f:
-            json.dump(profile, f)
-            inpath = f.name
-
-        outpath = inpath.replace('.json', '-demangled.json')
-        try:
-            process_profile_file(inpath, outpath)
-            with open(outpath) as f:
-                result = json.load(f)
-            self.assertEqual(result["shared"]["stringArray"][0],
-                             "Lean.Meta.main")
-            self.assertEqual(result["shared"]["stringArray"][1], "malloc")
-        finally:
-            os.unlink(inpath)
-            if os.path.exists(outpath):
-                os.unlink(outpath)
-
-    def test_profile_gzip_roundtrip(self):
-        from lean_demangle_profile import process_profile_file
-        strings = ["l_Lean_Meta_main", "malloc"]
-        profile = self._make_profile_shared(strings)
-
-        with tempfile.NamedTemporaryFile(suffix='.json.gz',
-                                         delete=False) as f:
-            with gzip.open(f, 'wt') as gz:
-                json.dump(profile, gz)
-            inpath = f.name
-
-        outpath = inpath.replace('.json.gz', '-demangled.json.gz')
-        try:
-            process_profile_file(inpath, outpath)
-            with gzip.open(outpath, 'rt') as f:
-                result = json.load(f)
-            self.assertEqual(result["shared"]["stringArray"][0],
-                             "Lean.Meta.main")
-        finally:
-            os.unlink(inpath)
-            if os.path.exists(outpath):
-                os.unlink(outpath)
-
-
-if __name__ == '__main__':
-    unittest.main()
--- a/script/release_checklist.py
+++ b/script/release_checklist.py
@@ -11,7 +11,7 @@ IMPORTANT: Keep this documentation up-to-date when modifying the script's behavi
 What this script does:
 1. Validates preliminary Lean4 release infrastructure:
   - Checks that the release branch (releases/vX.Y.0) exists
-   - Verifies CMake version settings are correct
+   - Verifies CMake version settings are correct (both src/ and stage0/)
   - Confirms the release tag exists
   - Validates the release page exists on GitHub (created automatically by CI after tag push)
   - Checks the release notes page on lean-lang.org (updated while bumping the `reference-manual` repository)
@@ -236,7 +236,7 @@ def parse_version(version_str):
 def is_version_gte(version1, version2):
    """Check if version1 >= version2, including proper handling of release candidates."""
    # Check if version1 is a nightly toolchain
-    if version1.startswith("leanprover/lean4:nightly-"):
+    if version1.startswith("leanprover/lean4:nightly-") or version1.startswith("leanprover/lean4-nightly:"):
        return False
    return parse_version(version1) >= parse_version(version2)

@@ -326,6 +326,42 @@ def check_cmake_version(repo_url, branch, version_major, version_minor, github_t
    print(f"  ✅ CMake version settings are correct in {cmake_file_path}")
    return True

+def check_stage0_version(repo_url, branch, version_major, version_minor, github_token):
+    """Verify that stage0/src/CMakeLists.txt has the same version as src/CMakeLists.txt.
+
+    The stage0 pre-built binaries stamp .olean headers with their baked-in version.
+    If stage0 has a different version (e.g. from a 'begin development cycle' bump),
+    the release tarball will contain .olean files with the wrong version.
+    """
+    stage0_cmake = "stage0/src/CMakeLists.txt"
+    content = get_branch_content(repo_url, branch, stage0_cmake, github_token)
+    if content is None:
+        print(f"  ❌ Could not retrieve {stage0_cmake} from {branch}")
+        return False
+
+    errors = []
+    for line in content.splitlines():
+        stripped = line.strip()
+        if stripped.startswith("set(LEAN_VERSION_MAJOR "):
+            actual = stripped.split()[-1].rstrip(")")
+            if actual != str(version_major):
+                errors.append(f"LEAN_VERSION_MAJOR: expected {version_major}, found {actual}")
+        elif stripped.startswith("set(LEAN_VERSION_MINOR "):
+            actual = stripped.split()[-1].rstrip(")")
+            if actual != str(version_minor):
+                errors.append(f"LEAN_VERSION_MINOR: expected {version_minor}, found {actual}")
+
+    if errors:
+        print(f"  ❌ stage0 version mismatch in {stage0_cmake}:")
+        for error in errors:
+            print(f"     {error}")
+        print(f"     The stage0 compiler stamps .olean headers with its baked-in version.")
+        print(f"     Run `make update-stage0` to rebuild stage0 with the correct version.")
+        return False
+
+    print(f"  ✅ stage0 version matches in {stage0_cmake}")
+    return True
+
 def extract_org_repo_from_url(repo_url):
    """Extract the 'org/repo' part from a GitHub URL."""
    if repo_url.startswith("https://github.com/"):
@@ -441,7 +477,10 @@ def get_pr_ci_status(repo_url, pr_number, github_token):
    conclusions = [run['conclusion'] for run in check_runs if run.get('status') == 'completed']
    in_progress = [run for run in check_runs if run.get('status') in ['queued', 'in_progress']]

+    failed = sum(1 for c in conclusions if c in ['failure', 'timed_out', 'action_required'])
    if in_progress:
+        if failed > 0:
+            return "failure", f"{failed} check(s) failing, {len(in_progress)} still in progress"
        return "pending", f"{len(in_progress)} check(s) in progress"

    if not conclusions:
@@ -450,7 +489,6 @@ def get_pr_ci_status(repo_url, pr_number, github_token):
    if all(c == 'success' for c in conclusions):
        return "success", f"All {len(conclusions)} checks passed"

-    failed = sum(1 for c in conclusions if c in ['failure', 'timed_out', 'action_required'])
    if failed > 0:
        return "failure", f"{failed} check(s) failed"

@@ -680,6 +718,9 @@ def main():
        # Check CMake version settings
        if not check_cmake_version(lean_repo_url, branch_name, version_major, version_minor, github_token):
            lean4_success = False
+        # Check that stage0 version matches (stage0 stamps .olean headers with its version)
+        if not check_stage0_version(lean_repo_url, branch_name, version_major, version_minor, github_token):
+            lean4_success = False

    # Check for tag and release page
    if not tag_exists(lean_repo_url, toolchain, github_token):
@@ -965,14 +1006,15 @@ def main():
        # Find the actual minor version in CMakeLists.txt
        for line in cmake_lines:
            if line.strip().startswith("set(LEAN_VERSION_MINOR "):
-                actual_minor = int(line.split()[-1].rstrip(")"))
+                m = re.search(r'set\(LEAN_VERSION_MINOR\s+(\d+)', line)
+                actual_minor = int(m.group(1)) if m else 0
                version_minor_correct = actual_minor >= next_minor
                break
        else:
            version_minor_correct = False
            
        is_release_correct = any(
-            l.strip().startswith("set(LEAN_VERSION_IS_RELEASE 0)") 
+            re.match(r'set\(LEAN_VERSION_IS_RELEASE\s+0[\s)]', l.strip())
            for l in cmake_lines
        )
        
--- a/script/release_steps.py
+++ b/script/release_steps.py
@@ -479,6 +479,25 @@ def execute_release_steps(repo, version, config):
        print(blue("Updating lakefile.toml..."))
        run_command(f'perl -pi -e \'s/"v4\\.[0-9]+(\\.[0-9]+)?(-rc[0-9]+)?"/"' + version + '"/g\' lakefile.*', cwd=repo_path)
        run_command("lake update", cwd=repo_path, stream_output=True)
+    elif repo_name == "verso":
+        # verso has nested Lake projects in test-projects/ that each have their own
+        # lake-manifest.json with a subverso pin. After updating the root manifest via
+        # `lake update`, sync the de-modulized subverso rev into all sub-manifests.
+        # The sub-projects use an old toolchain (v4.21.0) that doesn't support module/prelude
+        # syntax, so they need the de-modulized version (tagged no-modules/<root-rev>).
+        # The "SubVerso version consistency" CI check accepts either the root or de-modulized rev.
+        run_command("lake update", cwd=repo_path, stream_output=True)
+        print(blue("Syncing de-modulized subverso rev to test-project sub-manifests..."))
+        sync_script = (
+            'ROOT_REV=$(jq -r \'.packages[] | select(.name == "subverso") | .rev\' lake-manifest.json); '
+            'SUBVERSO_URL=$(jq -r \'.packages[] | select(.name == "subverso") | .url\' lake-manifest.json); '
+            'DEMOD_REV=$(git ls-remote "$SUBVERSO_URL" "refs/tags/no-modules/$ROOT_REV" | awk \'{print $1}\'); '
+            'find test-projects -name lake-manifest.json -print0 | while IFS= read -r -d \'\' f; do '
+            'jq --arg rev "$DEMOD_REV" \'.packages |= map(if .name == "subverso" then .rev = $rev else . end)\' "$f" > /tmp/lm_tmp.json && mv /tmp/lm_tmp.json "$f"; '
+            'done'
+        )
+        run_command(sync_script, cwd=repo_path)
+        print(green("Synced de-modulized subverso rev to all test-project sub-manifests"))
    elif dependencies:
        run_command(f'perl -pi -e \'s/"v4\\.[0-9]+(\\.[0-9]+)?(-rc[0-9]+)?"/"' + version + '"/g\' lakefile.*', cwd=repo_path)
        run_command("lake update", cwd=repo_path, stream_output=True)
--- a/src/CMakeLists.txt
+++ b/src/CMakeLists.txt
@@ -7,11 +7,17 @@ if(NOT DEFINED STAGE)
 endif()
 include(ExternalProject)
 project(LEAN CXX C)
-set(LEAN_VERSION_MAJOR 4)
-set(LEAN_VERSION_MINOR 30)
-set(LEAN_VERSION_PATCH 0)
-set(LEAN_VERSION_IS_RELEASE 0) # This number is 1 in the release revision, and 0 otherwise.
+set(LEAN_VERSION_MAJOR 4 CACHE STRING "")
+set(LEAN_VERSION_MINOR 30 CACHE STRING "")
+set(LEAN_VERSION_PATCH 0 CACHE STRING "")
+set(LEAN_VERSION_IS_RELEASE 0 CACHE STRING "") # This number is 1 in the release revision, and 0 otherwise.
 set(LEAN_SPECIAL_VERSION_DESC "" CACHE STRING "Additional version description like 'nightly-2018-03-11'")
+# project(LEAN) above implicitly creates empty LEAN_VERSION_{MAJOR,MINOR,PATCH}
+# normal variables (CMake sets <PROJECT>_VERSION_* for the project name). These
+# shadow the cache values. Remove them so ${VAR} falls through to the cache.
+unset(LEAN_VERSION_MAJOR)
+unset(LEAN_VERSION_MINOR)
+unset(LEAN_VERSION_PATCH)
 set(LEAN_VERSION_STRING "${LEAN_VERSION_MAJOR}.${LEAN_VERSION_MINOR}.${LEAN_VERSION_PATCH}")
 if(LEAN_SPECIAL_VERSION_DESC)
  string(APPEND LEAN_VERSION_STRING "-${LEAN_SPECIAL_VERSION_DESC}")
@@ -81,6 +87,8 @@ option(USE_GITHASH "GIT_HASH" ON)
 option(INSTALL_LICENSE "INSTALL_LICENSE" ON)
 # When ON we install a copy of cadical
 option(INSTALL_CADICAL "Install a copy of cadical" ON)
+# When ON we install a copy of leantar
+option(INSTALL_LEANTAR "Install a copy of leantar" ON)

 # FLAGS for disabling optimizations and debugging
 option(FREE_VAR_RANGE_OPT "FREE_VAR_RANGE_OPT" ON)
@@ -110,6 +118,9 @@ option(USE_LAKE_CACHE "Use the Lake artifact cache for stage 1 builds (requires
 set(LEAN_EXTRA_MAKE_OPTS "" CACHE STRING "extra options to lean --make")
 set(LEANC_CC ${CMAKE_C_COMPILER} CACHE STRING "C compiler to use in `leanc`")

+# Temporary, core-only flags. Must be synced with stdlib_flags.h.
+string(APPEND LEAN_EXTRA_MAKE_OPTS " -Dbackward.do.legacy=false")
+
 if(LAZY_RC MATCHES "ON")
  set(LEAN_LAZY_RC "#define LEAN_LAZY_RC")
 endif()
@@ -751,6 +762,14 @@ if(STAGE GREATER 0 AND CADICAL AND INSTALL_CADICAL)
  add_dependencies(leancpp copy-cadical)
 endif()

+if(LEANTAR AND INSTALL_LEANTAR)
+  add_custom_target(
+    copy-leantar
+    COMMAND cmake -E copy_if_different "${LEANTAR}" "${CMAKE_BINARY_DIR}/bin/leantar${CMAKE_EXECUTABLE_SUFFIX}"
+  )
+  add_dependencies(leancpp copy-leantar)
+endif()
+
 # MSYS2 bash usually handles Windows paths relatively well, but not when putting them in the PATH
 string(REGEX REPLACE "^([a-zA-Z]):" "/\\1" LEAN_BIN "${CMAKE_BINARY_DIR}/bin")

@@ -778,7 +797,7 @@ if(LLVM AND STAGE GREATER 0)
  set(EXTRA_LEANMAKE_OPTS "LLVM=1")
 endif()

-set(STDLIBS Init Std Lean Leanc)
+set(STDLIBS Init Std Lean Leanc LeanIR)
 if(NOT CMAKE_SYSTEM_NAME MATCHES "Emscripten")
  list(APPEND STDLIBS Lake LeanChecker)
 endif()
@@ -885,9 +904,16 @@ if(PREV_STAGE)
  add_custom_target(update-stage0-commit COMMAND git commit -m "chore: update stage0" DEPENDS update-stage0)
 endif()

+if(NOT CMAKE_SYSTEM_NAME MATCHES "Emscripten")
+  add_custom_target(leanir ALL
+    DEPENDS leanshared
+    COMMAND $(MAKE) -f ${CMAKE_BINARY_DIR}/stdlib.make leanir
+    VERBATIM)
+endif()
+
 # use Bash version for building, use Lean version in bin/ for tests & distribution
 configure_file("${LEAN_SOURCE_DIR}/bin/leanc.in" "${CMAKE_BINARY_DIR}/leanc.sh" @ONLY)
-if(STAGE GREATER 0 AND EXISTS "${LEAN_SOURCE_DIR}/Leanc.lean" AND NOT CMAKE_SYSTEM_NAME MATCHES "Emscripten")
+if(STAGE GREATER 0 AND NOT CMAKE_SYSTEM_NAME MATCHES "Emscripten")
  configure_file("${LEAN_SOURCE_DIR}/Leanc.lean" "${CMAKE_BINARY_DIR}/leanc/Leanc.lean" @ONLY)
  add_custom_target(
    leanc
@@ -907,6 +933,10 @@ if(STAGE GREATER 0 AND CADICAL AND INSTALL_CADICAL)
  install(PROGRAMS "${CADICAL}" DESTINATION bin)
 endif()

+if(LEANTAR AND INSTALL_LEANTAR)
+  install(PROGRAMS "${LEANTAR}" DESTINATION bin)
+endif()
+
 add_custom_target(
  clean-stdlib
  COMMAND rm -rf "${CMAKE_BINARY_DIR}/lib" || true
@@ -922,6 +952,7 @@ install(
  PATTERN "*.hash" EXCLUDE
  PATTERN "*.trace" EXCLUDE
  PATTERN "*.rsp" EXCLUDE
+  PATTERN "*.filelist" EXCLUDE
 )

 # symlink source into expected installation location for go-to-definition, if file system allows it
--- a/src/Init.lean
+++ b/src/Init.lean
@@ -30,6 +30,7 @@ public import Init.Hints
 public import Init.Conv
 public import Init.Guard
 public import Init.Simproc
+public import Init.CbvSimproc
 public import Init.SizeOfLemmas
 public import Init.BinderPredicates
 public import Init.Ext
--- a/src/Init/CbvSimproc.lean
+++ b/src/Init/CbvSimproc.lean
@@ -0,0 +1,71 @@
+/-
+Copyright (c) 2026 Lean FRO, LLC. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Wojciech Różowski
+-/
+module
+
+prelude
+public meta import Init.Data.ToString.Name  -- shake: keep (transitive public meta dep, fix)
+public import Init.Tactics
+import Init.Meta.Defs
+
+public section
+
+namespace Lean.Parser
+
+syntax cbvSimprocEval := "cbv_eval"
+
+/--
+A user-defined simplification procedure used by the `cbv` tactic.
+The body must have type `Lean.Meta.Sym.Simp.Simproc` (`Expr → SimpM Result`).
+Procedures are indexed by a discrimination tree pattern and fire at one of three phases:
+`↓` (pre), `cbv_eval` (eval), or `↑` (post, default).
+-/
+syntax (docComment)? attrKind "cbv_simproc " (Tactic.simpPre <|> Tactic.simpPost <|> cbvSimprocEval)? ident " (" term ")" " := " term : command
+
+/--
+A `cbv_simproc` declaration without automatically adding it to the cbv simproc set.
+To activate, use `attribute [cbv_simproc]`.
+-/
+syntax (docComment)? "cbv_simproc_decl " ident " (" term ")" " := " term : command
+
+syntax (docComment)? attrKind "builtin_cbv_simproc " (Tactic.simpPre <|> Tactic.simpPost <|> cbvSimprocEval)? ident " (" term ")" " := " term : command
+
+syntax (docComment)? "builtin_cbv_simproc_decl " ident " (" term ")" " := " term : command
+
+syntax (name := cbvSimprocPattern) "cbv_simproc_pattern% " term " => " ident : command
+
+syntax (name := cbvSimprocPatternBuiltin) "builtin_cbv_simproc_pattern% " term " => " ident : command
+
+namespace Attr
+
+syntax (name := cbvSimprocAttr) "cbv_simproc" (Tactic.simpPre <|> Tactic.simpPost <|> cbvSimprocEval)? : attr
+
+syntax (name := cbvSimprocBuiltinAttr) "builtin_cbv_simproc" (Tactic.simpPre <|> Tactic.simpPost <|> cbvSimprocEval)? : attr
+
+end Attr
+
+macro_rules
+  | `($[$doc?:docComment]? cbv_simproc_decl $n:ident ($pattern:term) := $body) => do
+    let simprocType := `Lean.Meta.Sym.Simp.Simproc
+    `($[$doc?:docComment]? meta def $n:ident : $(mkIdent simprocType) := $body
+      cbv_simproc_pattern% $pattern => $n)
+
+macro_rules
+  | `($[$doc?:docComment]? builtin_cbv_simproc_decl $n:ident ($pattern:term) := $body) => do
+    let simprocType := `Lean.Meta.Sym.Simp.Simproc
+    `($[$doc?:docComment]? def $n:ident : $(mkIdent simprocType) := $body
+      builtin_cbv_simproc_pattern% $pattern => $n)
+
+macro_rules
+  | `($[$doc?:docComment]? $kind:attrKind cbv_simproc $[$phase?]? $n:ident ($pattern:term) := $body) => do
+    `($[$doc?:docComment]? cbv_simproc_decl $n ($pattern) := $body
+      attribute [$kind cbv_simproc $[$phase?]?] $n)
+
+macro_rules
+  | `($[$doc?:docComment]? $kind:attrKind builtin_cbv_simproc $[$phase?]? $n:ident ($pattern:term) := $body) => do
+    `($[$doc?:docComment]? builtin_cbv_simproc_decl $n ($pattern) := $body
+      attribute [$kind builtin_cbv_simproc $[$phase?]?] $n)
+
+end Lean.Parser
--- a/src/Init/Classical.lean
+++ b/src/Init/Classical.lean
@@ -69,9 +69,11 @@ theorem em (p : Prop) : p ∨ ¬p :=
 theorem exists_true_of_nonempty {α : Sort u} : Nonempty α → ∃ _ : α, True
  | ⟨x⟩ => ⟨x, trivial⟩

+@[implicit_reducible]
 noncomputable def inhabited_of_nonempty {α : Sort u} (h : Nonempty α) : Inhabited α :=
  ⟨choice h⟩

+@[implicit_reducible]
 noncomputable def inhabited_of_exists {α : Sort u} {p : α → Prop} (h : ∃ x, p x) : Inhabited α :=
  inhabited_of_nonempty (Exists.elim h (fun w _ => ⟨w⟩))

@@ -81,6 +83,7 @@ noncomputable scoped instance (priority := low) propDecidable (a : Prop) : Decid
    | Or.inl h => ⟨isTrue h⟩
    | Or.inr h => ⟨isFalse h⟩

+@[implicit_reducible]
 noncomputable def decidableInhabited (a : Prop) : Inhabited (Decidable a) where
  default := inferInstance

--- a/src/Init/Control.lean
+++ b/src/Init/Control.lean
@@ -18,3 +18,4 @@ public import Init.Control.StateCps
 public import Init.Control.ExceptCps
 public import Init.Control.MonadAttach
 public import Init.Control.EState
+public import Init.Control.Do
--- a/src/Init/Control/Id.lean
+++ b/src/Init/Control/Id.lean
@@ -49,6 +49,7 @@ instance : Monad Id where
 /--
 The identity monad has a `bind` operator.
 -/
+@[implicit_reducible]
 def hasBind : Bind Id :=
  inferInstance

@@ -58,7 +59,7 @@ Runs a computation in the identity monad.
 This function is the identity function. Because its parameter has type `Id α`, it causes
 `do`-notation in its arguments to use the `Monad Id` instance.
 -/
-@[always_inline, inline, expose]
+@[always_inline, inline, expose, implicit_reducible]
 protected def run (x : Id α) : α := x

 instance [OfNat α n] : OfNat (Id α) n :=
--- a/src/Init/Control/Lawful/Basic.lean
+++ b/src/Init/Control/Lawful/Basic.lean
@@ -254,8 +254,8 @@ instance : LawfulMonad Id := by
@[simp, grind =] theorem run_bind (x : Id α) (f : α → Id β) : (x >>= f).run = (f x.run).run := rfl
@[simp, grind =] theorem run_pure (a : α) : (pure a : Id α).run = a := rfl
@[simp, grind =] theorem pure_run (a : Id α) : pure a.run = a := rfl
-@[simp] theorem run_seqRight (x y : Id α) : (x *> y).run = y.run := rfl
-@[simp] theorem run_seqLeft (x y : Id α) : (x <* y).run = x.run := rfl
+@[simp] theorem run_seqRight (x : Id α) (y : Id β) : (x *> y).run = y.run := rfl
+@[simp] theorem run_seqLeft (x : Id α) (y : Id β) : (x <* y).run = x.run := rfl
@[simp] theorem run_seq (f : Id (α → β)) (x : Id α) : (f <*> x).run = f.run x.run := rfl

 end Id
--- a/src/Init/Control/Lawful/MonadAttach/Instances.lean
+++ b/src/Init/Control/Lawful/MonadAttach/Instances.lean
@@ -72,11 +72,11 @@ public instance [Monad m] [LawfulMonad m] [MonadAttach m] [LawfulMonadAttach m]

 public instance [Monad m] [MonadAttach m] [LawfulMonad m] [WeaklyLawfulMonadAttach m] :
    WeaklyLawfulMonadAttach (StateRefT' ω σ m) :=
-  inferInstanceAs (WeaklyLawfulMonadAttach (ReaderT _ _))
+  inferInstanceAs (WeaklyLawfulMonadAttach (ReaderT (ST.Ref ω σ) m))

 public instance [Monad m] [MonadAttach m] [LawfulMonad m] [LawfulMonadAttach m] :
    LawfulMonadAttach (StateRefT' ω σ m) :=
-  inferInstanceAs (LawfulMonadAttach (ReaderT _ _))
+  inferInstanceAs (LawfulMonadAttach (ReaderT (ST.Ref ω σ) m))

 section

--- a/src/Init/Control/Lawful/MonadLift/Instances.lean
+++ b/src/Init/Control/Lawful/MonadLift/Instances.lean
@@ -103,11 +103,11 @@ namespace StateRefT'
 instance {ω σ : Type} {m : Type → Type} [Monad m] : LawfulMonadLift m (StateRefT' ω σ m) where
  monadLift_pure _ := by
    simp only [MonadLift.monadLift, pure]
-    unfold StateRefT'.lift ReaderT.pure
+    unfold StateRefT'.lift instMonad._aux_5 ReaderT.pure
    simp only
  monadLift_bind _ _ := by
    simp only [MonadLift.monadLift, bind]
-    unfold StateRefT'.lift ReaderT.bind
+    unfold StateRefT'.lift instMonad._aux_13 ReaderT.bind
    simp only

 end StateRefT'
--- a/src/Init/Conv.lean
+++ b/src/Init/Conv.lean
@@ -60,9 +60,6 @@ with functions defined via well-founded recursion or partial fixpoints.
 The proofs produced by `cbv` only use the three standard axioms.
 In particular, they do not require trust in the correctness of the code
 generator.
-
-This tactic is experimental and its behavior is likely to change in upcoming
-releases of Lean.
 -/
 syntax (name := cbv) "cbv" : conv

@@ -280,7 +277,7 @@ resulting in `t'`, which becomes the new target subgoal. -/
 syntax (name := convConvSeq) "conv" " => " convSeq : conv

 /-- `· conv` focuses on the main conv goal and tries to solve it using `s`. -/
-macro dot:patternIgnore("· " <|> ". ") s:convSeq : conv => `(conv| {%$dot ($s) })
+macro dot:unicode("· ", ". ") s:convSeq : conv => `(conv| {%$dot ($s) })


 /-- `fail_if_success t` fails if the tactic `t` succeeds. -/
--- a/src/Init/Core.lean
+++ b/src/Init/Core.lean
@@ -172,6 +172,8 @@ instance thunkCoe : CoeTail α (Thunk α) where
  -- Since coercions are expanded eagerly, `a` is evaluated lazily.
  coe a := ⟨fun _ => a⟩

+instance [Inhabited α] : Inhabited (Thunk α) := ⟨.pure default⟩
+
 /-- A variation on `Eq.ndrec` with the equality argument first. -/
 abbrev Eq.ndrecOn.{u1, u2} {α : Sort u2} {a : α} {motive : α → Sort u1} {b : α} (h : a = b) (m : motive a) : motive b :=
  Eq.ndrec m h
--- a/src/Init/Data/Array.lean
+++ b/src/Init/Data/Array.lean
@@ -34,3 +34,4 @@ public import Init.Data.Array.MinMax
 public import Init.Data.Array.Nat
 public import Init.Data.Array.Int
 public import Init.Data.Array.Count
+public import Init.Data.Array.Sort
--- a/src/Init/Data/Array/Attach.lean
+++ b/src/Init/Data/Array/Attach.lean
@@ -98,7 +98,7 @@ well-founded recursion mechanism to prove that the function terminates.

@[simp] theorem pmap_push {P : α → Prop} (f : ∀ a, P a → β) (a : α) (xs : Array α) (h : ∀ b ∈ xs.push a, P b) :
    pmap f (xs.push a) h =
-      (pmap f xs (fun a m => by simp at h; exact h a (.inl m))).push (f a (h a (by simp))) := by
+      (pmap f xs (fun a m => by simp [forall_or_eq_imp] at h; exact h.1 _ m)).push (f a (h a (by simp))) := by
  simp [pmap]

@[simp] theorem attach_empty : (#[] : Array α).attach = #[] := rfl
@@ -153,7 +153,7 @@ theorem attachWith_congr {xs ys : Array α} (w : xs = ys) {P : α → Prop} {H :

@[simp] theorem attachWith_push {a : α} {xs : Array α} {P : α → Prop} {H : ∀ x ∈ xs.push a, P x} :
    (xs.push a).attachWith P H =
-      (xs.attachWith P (fun x h => by simp at H; exact H x (.inl h))).push ⟨a, H a (by simp)⟩ := by
+      (xs.attachWith P (fun x h => by simp [forall_or_eq_imp] at H; exact H.1 _ h)).push ⟨a, H a (by simp)⟩ := by
  cases xs
  simp

--- a/src/Init/Data/Array/Basic.lean
+++ b/src/Init/Data/Array/Basic.lean
@@ -148,6 +148,9 @@ end List

 namespace Array

+@[simp, grind =] theorem getElem!_toList [Inhabited α] {xs : Array α} {i : Nat} : xs.toList[i]! = xs[i]! := by
+  rw [List.getElem!_toArray]
+
 theorem size_eq_length_toList {xs : Array α} : xs.size = xs.toList.length := rfl

 /-! ### Externs -/
@@ -283,7 +286,7 @@ Examples:
 * `#[1, 2].isEmpty = false`
 * `#[()].isEmpty = false`
 -/
-@[expose]
+@[expose, inline]
 def isEmpty (xs : Array α) : Bool :=
  xs.size = 0

@@ -377,6 +380,7 @@ Returns the last element of an array, or panics if the array is empty.
 Safer alternatives include `Array.back`, which requires a proof the array is non-empty, and
 `Array.back?`, which returns an `Option`.
 -/
+@[inline]
 def back! [Inhabited α] (xs : Array α) : α :=
  xs[xs.size - 1]!

@@ -386,6 +390,7 @@ Returns the last element of an array, given a proof that the array is not empty.
 See `Array.back!` for the version that panics if the array is empty, or `Array.back?` for the
 version that returns an option.
 -/
+@[inline]
 def back (xs : Array α) (h : 0 < xs.size := by get_elem_tactic) : α :=
  xs[xs.size - 1]'(Nat.sub_one_lt_of_lt h)

@@ -395,6 +400,7 @@ Returns the last element of an array, or `none` if the array is empty.
 See `Array.back!` for the version that panics if the array is empty, or `Array.back` for the version
 that requires a proof the array is non-empty.
 -/
+@[inline]
 def back? (xs : Array α) : Option α :=
  xs[xs.size - 1]?

@@ -553,9 +559,9 @@ def modifyOp (xs : Array α) (idx : Nat) (f : α → α) : Array α :=
  xs.modify idx f

 /--
-  We claim this unsafe implementation is correct because an array cannot have more than `usizeSz` elements in our runtime.
+  We claim this unsafe implementation is correct because an array cannot have more than `USize.size` elements in our runtime.

-  This kind of low level trick can be removed with a little bit of compiler support. For example, if the compiler simplifies `as.size < usizeSz` to true. -/
+  This kind of low level trick can be removed with a little bit of compiler support. For example, if the compiler simplifies `as.size < USize.size` to true. -/
@[inline] unsafe def forIn'Unsafe {α : Type u} {β : Type v} {m : Type v → Type w} [Monad m] (as : Array α) (b : β) (f : (a : α) → a ∈ as → β → m (ForInStep β)) : m β :=
  let sz := as.usize
  let rec @[specialize] loop (i : USize) (b : β) : m β := do
@@ -2145,7 +2151,4 @@ protected def repr {α : Type u} [Repr α] (xs : Array α) : Std.Format :=
 instance {α : Type u} [Repr α] : Repr (Array α) where
  reprPrec xs _ := Array.repr xs

-instance [ToString α] : ToString (Array α) where
-  toString xs := String.Internal.append "#" (toString xs.toList)
-
 end Array
--- a/src/Init/Data/Array/Find.lean
+++ b/src/Init/Data/Array/Find.lean
@@ -622,12 +622,12 @@ theorem findIdx?_eq_some_le_of_findIdx?_eq_some {xs : Array α} {p q : α → Bo
 /-! ### findFinIdx? -/

@[grind =]
-theorem findFinIdx?_empty {p : α → Bool} : findFinIdx? p #[] = none := by simp; rfl
+theorem findFinIdx?_empty {p : α → Bool} : findFinIdx? p #[] = none := by simp

@[grind =]
 theorem findFinIdx?_singleton {a : α} {p : α → Bool} :
    #[a].findFinIdx? p = if p a then some ⟨0, by simp⟩ else none := by
-  simp; rfl
+  simp

 -- We can't mark this as a `@[congr]` lemma since the head of the RHS is not `findFinIdx?`.
 theorem findFinIdx?_congr {p : α → Bool} {xs ys : Array α} (w : xs = ys) :
@@ -801,7 +801,7 @@ theorem idxOf?_eq_map_finIdxOf?_val [BEq α] {xs : Array α} {a : α} :
    xs.idxOf? a = (xs.finIdxOf? a).map (·.val) := by
  simp [idxOf?, finIdxOf?]

-@[grind =] theorem finIdxOf?_empty [BEq α] : (#[] : Array α).finIdxOf? a = none := by simp; rfl
+@[grind =] theorem finIdxOf?_empty [BEq α] : (#[] : Array α).finIdxOf? a = none := by simp

@[simp, grind =] theorem finIdxOf?_eq_none_iff [BEq α] [LawfulBEq α] {xs : Array α} {a : α} :
    xs.finIdxOf? a = none ↔ a ∉ xs := by
--- a/src/Init/Data/Array/Lex/Lemmas.lean
+++ b/src/Init/Data/Array/Lex/Lemmas.lean
@@ -78,7 +78,7 @@ private theorem cons_lex_cons [BEq α] {lt : α → α → Bool} {a b : α} {xs
  simp only [lex, size_append, List.size_toArray, List.length_cons, List.length_nil, Nat.zero_add,
    Nat.add_min_add_left, Nat.add_lt_add_iff_left, Std.Rco.forIn'_eq_forIn'_toList]
  rw [cons_lex_cons.forIn'_congr_aux (Nat.toList_rco_eq_cons (by omega)) rfl (fun _ _ _ => rfl)]
-  simp only [bind_pure_comp, map_pure, Nat.toList_rco_succ_succ, Nat.add_comm 1]
+  simp only [Nat.toList_rco_succ_succ, Nat.add_comm 1]
  cases h : lt a b
  · cases h' : a == b <;> simp [bne, *]
  · simp [*]
--- a/src/Init/Data/Array/Sort.lean
+++ b/src/Init/Data/Array/Sort.lean
@@ -0,0 +1,10 @@
+/-
+Copyright (c) 2026 Lean FRO. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Paul Reichert
+-/
+module
+
+prelude
+public import Init.Data.Array.Sort.Basic
+public import Init.Data.Array.Sort.Lemmas
--- a/src/Init/Data/Array/Sort/Basic.lean
+++ b/src/Init/Data/Array/Sort/Basic.lean
@@ -0,0 +1,55 @@
+/-
+Copyright (c) 2026 Lean FRO. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Paul Reichert
+-/
+module
+
+prelude
+public import Init.Data.Array.Subarray.Split
+public import Init.Data.Slice.Array
+import Init.Omega
+
+public section
+
+private def Array.MergeSort.Internal.merge (xs ys : Array α) (le : α → α → Bool := by exact (· ≤ ·)) :
+    Array α :=
+  if hxs : 0 < xs.size then
+    if hys : 0 < ys.size then
+      go xs[*...*] ys[*...*] (by simp only [Array.size_mkSlice_rii]; omega) (by simp only [Array.size_mkSlice_rii]; omega) (Array.emptyWithCapacity (xs.size + ys.size))
+    else
+      xs
+  else
+    ys
+where
+  go (xs ys : Subarray α) (hxs : 0 < xs.size) (hys : 0 < ys.size) (acc : Array α) : Array α :=
+    let x := xs[0]
+    let y := ys[0]
+    if le x y then
+      if hi : 1 < xs.size then
+        go (xs.drop 1) ys (by simp only [Subarray.size_drop]; omega) hys (acc.push x)
+      else
+        ys.foldl (init := acc.push x) (fun acc y => acc.push y)
+    else
+      if hj : 1 < ys.size then
+        go xs (ys.drop 1) hxs (by simp only [Subarray.size_drop]; omega) (acc.push y)
+      else
+        xs.foldl (init := acc.push y) (fun acc x => acc.push x)
+  termination_by xs.size + ys.size
+
+def Subarray.mergeSort (xs : Subarray α) (le : α → α → Bool := by exact (· ≤ ·)) : Array α :=
+    if h : 1 < xs.size then
+      let splitIdx := (xs.size + 1) / 2 -- We follow the same splitting convention as `List.mergeSort`
+      let left := xs[*...splitIdx]
+      let right := xs[splitIdx...*]
+      Array.MergeSort.Internal.merge (mergeSort left le) (mergeSort right le) le
+    else
+      xs.toArray
+termination_by xs.size
+decreasing_by
+  · simp only [Subarray.size_mkSlice_rio]; omega
+  · simp only [Subarray.size_mkSlice_rci]; omega
+
+@[inline]
+def Array.mergeSort (xs : Array α) (le : α → α → Bool := by exact (· ≤ ·)) : Array α :=
+    xs[*...*].mergeSort le
--- a/src/Init/Data/Array/Sort/Lemmas.lean
+++ b/src/Init/Data/Array/Sort/Lemmas.lean
@@ -0,0 +1,241 @@
+/-
+Copyright (c) 2026 Lean FRO. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Paul Reichert
+-/
+module
+
+prelude
+public import Init.Data.Array.Sort.Basic
+public import Init.Data.List.Sort.Basic
+public import Init.Data.Array.Perm
+import all Init.Data.Array.Sort.Basic
+import all Init.Data.List.Sort.Basic
+import Init.Data.List.Sort.Lemmas
+import Init.Data.Slice.Array.Lemmas
+import Init.Data.Slice.List.Lemmas
+import Init.Data.Array.Bootstrap
+import Init.Data.Array.Lemmas
+import Init.Data.Array.MapIdx
+import Init.ByCases
+
+public section
+
+private theorem Array.MergeSort.merge.go_eq_listMerge {xs ys : Subarray α} {hxs hys le acc} :
+    (Array.MergeSort.Internal.merge.go le xs ys hxs hys acc).toList = acc.toList ++ List.merge xs.toList ys.toList le := by
+  fun_induction Array.MergeSort.Internal.merge.go le xs ys hxs hys acc
+  · rename_i xs ys _ _ _ _ _ _ _ _
+    rw [List.merge.eq_def]
+    split
+    · have : xs.size = 0 := by simp [← Subarray.length_toList, *]
+      omega
+    · have : ys.size = 0 := by simp [← Subarray.length_toList, *]
+      omega
+    · rename_i x' xs' y' ys' _ _
+      simp +zetaDelta only at *
+      have h₁ : x' = xs[0] := by simp [Subarray.getElem_eq_getElem_toList, *]
+      have h₂ : y' = ys[0] := by simp [Subarray.getElem_eq_getElem_toList, *]
+      cases h₁
+      cases h₂
+      simp [Subarray.toList_drop, *]
+  · rename_i xs ys _ _ _ _ _ _ _
+    rw [List.merge.eq_def]
+    split
+    · have : xs.size = 0 := by simp [← Subarray.length_toList, *]
+      omega
+    · have : ys.size = 0 := by simp [← Subarray.length_toList, *]
+      omega
+    · rename_i x' xs' y' ys' _ _
+      simp +zetaDelta only at *
+      have h₁ : x' = xs[0] := by simp [Subarray.getElem_eq_getElem_toList, *]
+      have h₂ : y' = ys[0] := by simp [Subarray.getElem_eq_getElem_toList, *]
+      cases h₁
+      cases h₂
+      simp [*]
+      have : xs.size = xs'.length + 1 := by simp [← Subarray.length_toList, *]
+      have : xs' = [] := List.eq_nil_of_length_eq_zero (by omega)
+      simp only [this]
+      rw [← Subarray.foldl_toList]
+      simp [*]
+  · rename_i xs ys _ _ _ _ _ _ _ _
+    rw [List.merge.eq_def]
+    split
+    · have : xs.size = 0 := by simp [← Subarray.length_toList, *]
+      omega
+    · have : ys.size = 0 := by simp [← Subarray.length_toList, *]
+      omega
+    · rename_i x' xs' y' ys' _ _
+      simp +zetaDelta only at *
+      have h₁ : x' = xs[0] := by simp [Subarray.getElem_eq_getElem_toList, *]
+      have h₂ : y' = ys[0] := by simp [Subarray.getElem_eq_getElem_toList, *]
+      cases h₁
+      cases h₂
+      simp [Subarray.toList_drop, *]
+  · rename_i xs ys _ _ _ _ _ _ _
+    rw [List.merge.eq_def]
+    split
+    · have : xs.size = 0 := by simp [← Subarray.length_toList, *]
+      omega
+    · have : ys.size = 0 := by simp [← Subarray.length_toList, *]
+      omega
+    · rename_i x' xs' y' ys' _ _
+      simp +zetaDelta only at *
+      have h₁ : x' = xs[0] := by simp [Subarray.getElem_eq_getElem_toList, *]
+      have h₂ : y' = ys[0] := by simp [Subarray.getElem_eq_getElem_toList, *]
+      cases h₁
+      cases h₂
+      simp [*]
+      have : ys.size = ys'.length + 1 := by simp [← Subarray.length_toList, *]
+      have : ys' = [] := List.eq_nil_of_length_eq_zero (by omega)
+      simp [this]
+      rw [← Subarray.foldl_toList]
+      simp [*]
+
+private theorem Array.MergeSort.merge_eq_listMerge {xs ys : Array α} {le} :
+    (Array.MergeSort.Internal.merge xs ys le).toList = List.merge xs.toList ys.toList le := by
+  rw [Array.MergeSort.Internal.merge]
+  split <;> rename_i heq₁
+  · split <;> rename_i heq₂
+    · simp [Array.MergeSort.merge.go_eq_listMerge]
+    · have : ys.toList = [] := by simp_all
+      simp [this]
+  · have : xs.toList = [] := by simp_all
+    simp [this]
+
+private theorem List.mergeSort_eq_merge_mkSlice {xs : List α} :
+    xs.mergeSort le =
+      if 1 < xs.length then
+        merge (xs[*...((xs.length + 1) / 2)].toList.mergeSort le) (xs[((xs.length + 1) / 2)...*].toList.mergeSort le) le
+      else
+        xs := by
+  fun_cases xs.mergeSort le
+  · simp
+  · simp
+  · rename_i x y ys lr hl hr
+    simp [lr]
+
+theorem Subarray.toList_mergeSort {xs : Subarray α} {le : α → α → Bool} :
+    (xs.mergeSort le).toList = xs.toList.mergeSort le := by
+  fun_induction xs.mergeSort le
+  · rw [List.mergeSort_eq_merge_mkSlice]
+    simp +zetaDelta [Array.MergeSort.merge_eq_listMerge, *]
+  · simp [List.mergeSort_eq_merge_mkSlice, *]
+
+@[simp, grind =]
+theorem Subarray.mergeSort_eq_mergeSort_toArray {xs : Subarray α} {le : α → α → Bool} :
+    xs.mergeSort le = xs.toArray.mergeSort le := by
+  simp [← Array.toList_inj, toList_mergeSort, Array.mergeSort]
+
+theorem Subarray.mergeSort_toArray {xs : Subarray α} {le : α → α → Bool} :
+    xs.toArray.mergeSort le = xs.mergeSort le := by
+  simp
+
+theorem Array.toList_mergeSort {xs : Array α} {le : α → α → Bool} :
+    (xs.mergeSort le).toList = xs.toList.mergeSort le := by
+  rw [Array.mergeSort, Subarray.toList_mergeSort, Array.toList_mkSlice_rii]
+
+@[cbv_eval]
+theorem Array.mergeSort_eq_toArray_mergeSort_toList {xs : Array α} {le : α → α → Bool} :
+    xs.mergeSort le = (xs.toList.mergeSort le).toArray := by
+  simp [← toList_mergeSort]
+
+/-!
+# Basic properties of `Array.mergeSort`.
+
+* `pairwise_mergeSort`: `mergeSort` produces a sorted array.
+* `mergeSort_perm`: `mergeSort` is a permutation of the input array.
+* `mergeSort_of_pairwise`: `mergeSort` does not change a sorted array.
+* `sublist_mergeSort`: if `c` is a sorted sublist of `l`, then `c` is still a sublist of `mergeSort le l`.
+-/
+
+namespace Array
+
+-- Enable this instance locally so we can write `Pairwise le` instead of `Pairwise (le · ·)` everywhere.
+attribute [local instance] boolRelToRel
+
+@[simp] theorem mergeSort_empty : (#[] : Array α).mergeSort r = #[] := by
+  simp [mergeSort_eq_toArray_mergeSort_toList]
+
+@[simp] theorem mergeSort_singleton {a : α} : #[a].mergeSort r = #[a] := by
+  simp [mergeSort_eq_toArray_mergeSort_toList]
+
+theorem mergeSort_perm {xs : Array α} {le} : (xs.mergeSort le).Perm xs := by
+  simpa [mergeSort_eq_toArray_mergeSort_toList, Array.perm_iff_toList_perm] using List.mergeSort_perm _ _
+
+@[simp] theorem size_mergeSort {xs : Array α} : (mergeSort xs le).size = xs.size := by
+  simp [mergeSort_eq_toArray_mergeSort_toList]
+
+@[simp] theorem mem_mergeSort {a : α} {xs : Array α} : a ∈ mergeSort xs le ↔ a ∈ xs := by
+  simp [mergeSort_eq_toArray_mergeSort_toList]
+
+/--
+The result of `Array.mergeSort` is sorted,
+as long as the comparison function is transitive (`le a b → le b c → le a c`)
+and total in the sense that `le a b || le b a`.
+
+The comparison function need not be irreflexive, i.e. `le a b` and `le b a` is allowed even when `a ≠ b`.
+-/
+theorem pairwise_mergeSort
+    (trans : ∀ (a b c : α), le a b → le b c → le a c)
+    (total : ∀ (a b : α), le a b || le b a)
+    {xs : Array α} :
+    (mergeSort xs le).toList.Pairwise (le · ·) := by
+  simpa [mergeSort_eq_toArray_mergeSort_toList] using List.pairwise_mergeSort trans total _
+
+/--
+If the input array is already sorted, then `mergeSort` does not change the array.
+-/
+theorem mergeSort_of_pairwise {le : α → α → Bool} {xs : Array α} (_ : xs.toList.Pairwise (le · ·)) :
+    mergeSort xs le = xs := by
+  simpa [mergeSort_eq_toArray_mergeSort_toList, List.toArray_eq_iff] using List.mergeSort_of_pairwise ‹_›
+
+/--
+This merge sort algorithm is stable,
+in the sense that breaking ties in the ordering function using the position in the array
+has no effect on the output.
+
+That is, elements which are equal with respect to the ordering function will remain
+in the same order in the output array as they were in the input array.
+
+See also:
+* `sublist_mergeSort`: if `c <+ l` and `c.Pairwise le`, then `c <+ (mergeSort le l).toList`.
+* `pair_sublist_mergeSort`: if `[a, b] <+ l` and `le a b`, then `[a, b] <+ (mergeSort le l).toList`)
+-/
+theorem mergeSort_zipIdx {xs : Array α} :
+    (mergeSort (xs.zipIdx.map fun (a, i) => (a, i)) (List.zipIdxLE le)).map (·.1) = mergeSort xs le := by
+  simpa [mergeSort_eq_toArray_mergeSort_toList, Array.toList_zipIdx] using List.mergeSort_zipIdx
+
+/--
+Another statement of stability of merge sort.
+If `c` is a sorted sublist of `xs.toList`,
+then `c` is still a sublist of `(mergeSort le xs).toList`.
+-/
+theorem sublist_mergeSort {le : α → α → Bool}
+    (trans : ∀ (a b c : α), le a b → le b c → le a c)
+    (total : ∀ (a b : α), le a b || le b a)
+    {ys : List α} (_ : ys.Pairwise (le · ·)) (_ : List.Sublist ys xs.toList) :
+    List.Sublist ys (mergeSort xs le).toList := by
+  simpa [mergeSort_eq_toArray_mergeSort_toList, Array.toList_zipIdx] using
+    List.sublist_mergeSort trans total ‹_› ‹_›
+
+/--
+Another statement of stability of merge sort.
+If a pair `[a, b]` is a sublist of `xs.toList` and `le a b`,
+then `[a, b]` is still a sublist of `(mergeSort le xs).toList`.
+-/
+theorem pair_sublist_mergeSort
+    (trans : ∀ (a b c : α), le a b → le b c → le a c)
+    (total : ∀ (a b : α), le a b || le b a)
+    (hab : le a b) (h : List.Sublist [a, b] xs.toList) :
+    List.Sublist [a, b] (mergeSort xs le).toList := by
+  simpa [mergeSort_eq_toArray_mergeSort_toList, Array.toList_zipIdx] using
+    List.pair_sublist_mergeSort trans total ‹_› ‹_›
+
+theorem map_mergeSort {r : α → α → Bool} {s : β → β → Bool} {f : α → β}
+    {xs : Array α} (hxs : ∀ a ∈ xs, ∀ b ∈ xs, r a b = s (f a) (f b)) :
+    (xs.mergeSort r).map f = (xs.map f).mergeSort s := by
+  simp only [mergeSort_eq_toArray_mergeSort_toList, List.map_toArray, toList_map, mk.injEq]
+  apply List.map_mergeSort
+  simpa
+
+end Array
--- a/src/Init/Data/BEq.lean
+++ b/src/Init/Data/BEq.lean
@@ -36,6 +36,8 @@ theorem BEq.symm [BEq α] [Std.Symm (α := α) (· == ·)] {a b : α} : a == b
 theorem BEq.comm [BEq α] [PartialEquivBEq α] {a b : α} : (a == b) = (b == a) :=
  Bool.eq_iff_iff.2 ⟨BEq.symm, BEq.symm⟩

+theorem bne_eq [BEq α] {a b : α} : (a != b) = !(a == b) := rfl
+
 theorem bne_comm [BEq α] [PartialEquivBEq α] {a b : α} : (a != b) = (b != a) := by
  rw [bne, BEq.comm, bne]

@@ -64,3 +66,8 @@ theorem BEq.neq_of_beq_of_neq [BEq α] [PartialEquivBEq α] {a b c : α} :
 instance (priority := low) [BEq α] [LawfulBEq α] : EquivBEq α where
  symm h := beq_iff_eq.2 <| Eq.symm <| beq_iff_eq.1 h
  trans hab hbc := beq_iff_eq.2 <| (beq_iff_eq.1 hab).trans <| beq_iff_eq.1 hbc
+
+theorem equivBEq_of_iff_apply_eq [BEq α] (f : α → β) (hf : ∀ a b, a == b ↔ f a = f b) : EquivBEq α where
+  rfl := by simp [hf]
+  symm := by simp [hf, eq_comm]
+  trans hab hbc := (hf _ _).2 (Eq.trans ((hf _ _).1 hab) ((hf _ _).1 hbc))
--- a/src/Init/Data/BitVec/Bitblast.lean
+++ b/src/Init/Data/BitVec/Bitblast.lean
@@ -2393,4 +2393,412 @@ theorem fastUmulOverflow (x y : BitVec w) :
        simp [← Nat.pow_add, show w + 1 - (k - 1) + k = w + 1 + 1 by omega] at this
        omega

+/-! ### Population Count -/
+
+/-- Extract the `k`-th bit from `x` and extend it to have length `len`. -/
+def extractAndExtendBit (idx len : Nat) (x : BitVec w) : BitVec len :=
+  BitVec.zeroExtend len (BitVec.extractLsb' idx 1 x)
+
+
+/-- Recursively extract one bit at a time and extend it to width `w` -/
+def extractAndExtendAux (k len : Nat) (x : BitVec w) (acc : BitVec (k * len)) (hle : k ≤ w) :
+    BitVec (w * len) :=
+  match hwi : w - k with
+  | 0 => acc.cast (by simp [show w = k by omega])
+  | n' + 1 =>
+    let acc' := extractAndExtendBit k len x ++ acc
+    extractAndExtendAux (k + 1) len x (acc'.cast (by simp [Nat.add_mul]; omega)) (by omega)
+termination_by w - k
+
+/-- We instantiate `extractAndExtendAux` to extend each bit to `len`, extending
+  each bit in `x` to have width `w` and returning a `BitVec (w * w)`. -/
+def extractAndExtend (len : Nat) (x : BitVec w) : BitVec (w * len) :=
+  extractAndExtendAux 0 len x ((0#0).cast (by simp)) (by omega)
+
+/--
+  Construct a layer of the parallel-prefix-sum tree by summing two-by-two all the
+  `w`-long words in `oldLayer`, returning a bitvector containing `(oldLen + 1) / 2`
+  flattened `w`-long words, each resulting from an addition.
+-/
+def cpopLayer (oldLayer : BitVec (len * w)) (newLayer : BitVec (iterNum * w))
+    (hold : 2 * (iterNum - 1) < len) : BitVec (((len + 1)/2) * w) :=
+  if hlen : len - (iterNum * 2) = 0 then
+    have : ((len + 1)/2) = iterNum := by omega
+    newLayer.cast (by simp [this])
+  else
+    let op1 := oldLayer.extractLsb' ((2 * iterNum) * w) w
+    let op2 := oldLayer.extractLsb' ((2 * iterNum + 1) * w) w
+    let newLayer' := (op1 + op2) ++ newLayer
+    have hcast : w + iterNum * w = (iterNum + 1) * w := by simp [Nat.add_mul]; omega
+    cpopLayer oldLayer (newLayer'.cast hcast) (by omega)
+termination_by len - (iterNum * 2)
+
+/--
+  Given a `BitVec (len * w)` of `len` flattened `w`-long words,
+  construct a binary tree that sums two-by-two the `w`-long words in the previous layer,
+  ultimately returning a single `w`-long words corresponding to the whole addition.
+-/
+def cpopTree (l : BitVec (len * w)) : BitVec w :=
+  if h : len = 0 then 0#w
+  else if h : len = 1 then
+    l.cast (by simp [h])
+  else
+    cpopTree (cpopLayer l 0#(0 * w) (by omega))
+termination_by len
+
+/--
+  Given flattened bitvector `x : BitVec w` and a length `l : Nat`,
+  construct a parallel prefix sum circuit adding each available `l`-long word in `x`.
+-/
+def cpopRec (x : BitVec w) : BitVec w :=
+  if hw : 1 < w then
+    let extendedBits := x.extractAndExtend w
+    (cpopTree extendedBits).cast (by simp)
+  else if hw' : 0 < w then
+    x
+  else
+    0#w
+
+/-- Recursive addition of the elements in a flattened bitvec, starting from the `rem`-th element. -/
+private def addRecAux (x : BitVec (l * w)) (rem : Nat) (acc : BitVec w) : BitVec w :=
+  match rem with
+  | 0 => acc
+  | n + 1 => x.addRecAux n (acc + x.extractLsb' (n * w) w)
+
+/-- Recursive addition of the elements in a flattened bitvec. -/
+private def addRec (x : BitVec (l * w)) : BitVec w := addRecAux x l 0#w
+
+theorem getLsbD_extractAndExtendBit {x : BitVec w} :
+    (extractAndExtendBit k len x).getLsbD i =
+    (decide (i = 0) && decide (0 < len) && x.getLsbD k) := by
+  simp only [extractAndExtendBit, truncate_eq_setWidth, getLsbD_setWidth, getLsbD_extractLsb',
+    Nat.lt_one_iff]
+  by_cases hi : i = 0
+  <;> simp [hi]
+
+@[simp]
+private theorem extractAndExtendAux_zero {k len : Nat} {x : BitVec w}
+    {acc : BitVec (k * len)} (heq : w = k) :
+    extractAndExtendAux k len x acc (by omega) = acc.cast (by simp [heq]) := by
+  unfold extractAndExtendAux
+  split
+  · simp
+  · omega
+
+private theorem extractLsb'_extractAndExtendAux {k len : Nat} {x : BitVec w}
+    (acc : BitVec (k * len)) (hle : k ≤ w) :
+    (∀ i (_ : i < k), acc.extractLsb' (i * len) len = (x.extractLsb' i 1).setWidth len) →
+    (extractAndExtendAux k len x acc (by omega)).extractLsb' (i * len) len =
+    (x.extractLsb' i 1).setWidth len := by
+  intros hacc
+  induction hwi : w - k generalizing acc k
+  · case zero =>
+    rw [extractAndExtendAux_zero (by omega)]
+    by_cases hj : i < k
+    · apply hacc
+      exact hj
+    · ext l hl
+      have := mul_le_mul_right (n := k) (m := i) len (by omega)
+      simp [← getLsbD_eq_getElem, getLsbD_extractLsb', hl, getLsbD_setWidth,
+        show w ≤ i + l by omega, getLsbD_of_ge acc (i * len + l) (by omega)]
+  · case succ n' ihn' =>
+    rw [extractAndExtendAux]
+    split
+    · omega
+    · apply ihn'
+      · intros i hi
+        have hcast : len + k * len = (k + 1) * len := by
+            simp [Nat.mul_comm, Nat.mul_add, Nat.add_comm]
+
+        by_cases hi' : i < k
+        · have heq : extractLsb' (i * len) len (BitVec.cast hcast (extractAndExtendBit k len x ++ acc))  =
+              extractLsb' (i * len) len ((extractAndExtendBit k len x ++ acc)) := by
+            ext; simp
+          rw [heq, extractLsb'_append_of_lt hi']
+          apply hacc
+          exact hi'
+        · have heq : extractLsb' (i * len) len (BitVec.cast hcast (extractAndExtendBit k len x ++ acc))   =
+              extractLsb' (i * len) len ((extractAndExtendBit k len x ++ acc)) := by
+            ext; simp
+          rw [heq, extractLsb'_append_of_eq (by omega)]
+          simp [show i = k by omega, extractAndExtendBit]
+      · omega
+
+theorem extractLsb'_cpopLayer {w iterNum i oldLen : Nat} {oldLayer : BitVec (oldLen * w)}
+    {newLayer : BitVec (iterNum * w)} (hold : 2 * (iterNum - 1) < oldLen) :
+    (∀ i (_hi: i < iterNum),
+      newLayer.extractLsb' (i * w) w =
+      oldLayer.extractLsb' ((2 * i) * w) w + (oldLayer.extractLsb' ((2 * i + 1) * w) w)) →
+    extractLsb' (i * w) w (oldLayer.cpopLayer newLayer hold) =
+      extractLsb' (2 * i * w) w oldLayer + extractLsb' ((2 * i + 1) * w) w oldLayer := by
+  intro proof_addition
+  rw [cpopLayer]
+  split
+  · by_cases hi : i < iterNum
+    · simp only [extractLsb'_cast]
+      apply proof_addition
+      exact hi
+    · ext j hj
+      have : iterNum * w ≤ i * w := by refine mul_le_mul_right w (by omega)
+      have : oldLen * w ≤ (2 * i) * w := by refine mul_le_mul_right w (by omega)
+      have : oldLen * w ≤ (2 * i + 1) * w := by refine mul_le_mul_right w (by omega)
+      have hz : extractLsb' (2 * i * w) w oldLayer = 0#w := by
+        ext j hj
+        simp [show oldLen * w ≤ 2 * i * w + j by omega]
+      have hz' : extractLsb' ((2 * i + 1) * w) w oldLayer = 0#w := by
+        ext j hj
+        simp [show oldLen * w ≤ (2 * i + 1) * w + j by omega]
+      simp [show iterNum * w ≤ i * w + j by omega, hz, hz']
+  · generalize hop1 : oldLayer.extractLsb' ((2 * iterNum) * w) w = op1
+    generalize hop2 : oldLayer.extractLsb' ((2 * iterNum + 1) * w) w = op2
+    have hcast : w + iterNum * w = (iterNum + 1) * w := by simp [Nat.add_mul]; omega
+    apply extractLsb'_cpopLayer
+    intros i hi
+    by_cases hlt : i < iterNum
+    · rw [extractLsb'_cast, extractLsb'_append_eq_of_add_le]
+      · apply proof_addition
+        exact hlt
+      · rw [show i * w + w = i * w + 1 * w by omega, ← Nat.add_mul]
+        exact mul_le_mul_right w hlt
+    · rw [extractLsb'_cast, show i = iterNum by omega, extractLsb'_append_eq_left, hop1, hop2]
+termination_by oldLen - 2 * (iterNum + 1 - 1)
+
+theorem getLsbD_cpopLayer {w iterNum: Nat} {oldLayer : BitVec (oldLen * w)}
+    {newLayer : BitVec (iterNum * w)} (hold : 2 * (iterNum - 1) < oldLen) :
+    (∀ i (_hi: i < iterNum),
+          newLayer.extractLsb' (i * w) w =
+          oldLayer.extractLsb' ((2 * i) * w) w + (oldLayer.extractLsb' ((2 * i + 1) * w) w)) →
+    (oldLayer.cpopLayer newLayer hold).getLsbD k =
+      (extractLsb' (2 * ((k - k % w) / w) * w) w oldLayer +
+        extractLsb' ((2 * ((k - k % w) / w) + 1) * w) w oldLayer).getLsbD (k % w) := by
+  intro proof_addition
+  by_cases hw0 : w = 0
+  · subst hw0
+    simp
+  · simp only [← extractLsb'_cpopLayer (hold := by omega) proof_addition,
+      Nat.mod_lt (x := k) (y := w) (by omega), getLsbD_eq_getElem, getElem_extractLsb']
+    congr
+    by_cases hmod : k % w = 0
+    · rw [hmod, Nat.sub_zero, Nat.add_zero, Nat.div_mul_cancel (by omega)]
+    · rw [Nat.div_mul_cancel (by exact dvd_sub_mod k), Nat.sub_add_cancel (by exact mod_le k w)]
+
+@[simp]
+private theorem addRecAux_zero {x : BitVec (l * w)} {acc : BitVec w} :
+    x.addRecAux 0 acc = acc := rfl
+
+@[simp]
+private theorem addRecAux_succ {x : BitVec (l * w)} {n : Nat} {acc : BitVec w} :
+    x.addRecAux (n + 1) acc = x.addRecAux n (acc + extractLsb' (n * w) w x) := rfl
+
+private theorem addRecAux_eq {x : BitVec (l * w)} {n : Nat} {acc : BitVec w} :
+     x.addRecAux n acc = x.addRecAux n 0#w + acc := by
+  induction n generalizing acc
+  · case zero =>
+    simp
+  · case succ n ihn =>
+    simp only [addRecAux_succ, BitVec.zero_add, ihn (acc := extractLsb' (n * w) w x),
+      BitVec.add_assoc, ihn (acc := acc + extractLsb' (n * w) w x), BitVec.add_right_inj]
+    rw [BitVec.add_comm (x := acc)]
+
+private theorem extractLsb'_addRecAux_of_le {x : BitVec (len * w)} (h : r ≤ k):
+    (extractLsb' 0 (k * w) x).addRecAux r 0#w = x.addRecAux r 0#w := by
+  induction r generalizing x len k
+  · case zero =>
+    simp [addRecAux]
+  · case succ diff ihdiff =>
+    simp only [addRecAux_succ, BitVec.zero_add]
+    have hext : diff * w + w ≤ k * w := by
+      simp only [show diff * w + w = (diff + 1) * w by simp [Nat.add_mul]]
+      exact Nat.mul_le_mul_right w h
+    rw [extractLsb'_extractLsb'_of_le hext, addRecAux_eq (x := x),
+        addRecAux_eq (x := extractLsb' 0 (k * w) x), ihdiff (x := x) (by omega) (k := k)]
+
+private theorem extractLsb'_extractAndExtend_eq {i len : Nat} {x : BitVec w} :
+    (extractAndExtend len x).extractLsb' (i * len) len = extractAndExtendBit i len x := by
+  unfold extractAndExtend
+  by_cases hilt : i < w
+  · ext j hj
+    simp [extractLsb'_extractAndExtendAux, extractAndExtendBit]
+  · ext k hk
+    have := Nat.mul_le_mul_right (n := w) (k := len) (m := i) (by omega)
+    simp only [extractAndExtendBit, cast_ofNat, getElem_extractLsb', truncate_eq_setWidth,
+      getElem_setWidth, getLsbD_extractLsb', Nat.lt_one_iff]
+    rw [getLsbD_of_ge, getLsbD_of_ge]
+    · simp
+    · omega
+    · omega
+
+private theorem addRecAux_append_extractLsb' {x : BitVec (len * w)} (ha : 0 < len) :
+    ((x.extractLsb' ((len - 1) * w) w ++
+      x.extractLsb' 0 ((len - 1) * w)).cast (m := len * w) hcast).addRecAux len 0#w =
+    x.extractLsb' ((len - 1) * w) w +
+      (x.extractLsb' 0 ((len - 1) * w)).addRecAux (len - 1) 0#w := by
+  simp only [extractLsb'_addRecAux_of_le (k := len - 1) (r := len - 1) (by omega),
+    BitVec.append_extractLsb'_of_lt (hcast := hcast)]
+  have hsucc := addRecAux_succ (x := x) (acc := 0#w) (n := len - 1)
+  rw [BitVec.zero_add, Nat.sub_one_add_one (by omega)] at hsucc
+  rw [hsucc, addRecAux_eq, BitVec.add_comm]
+
+private theorem Nat.mul_add_le_mul_of_succ_le {a b c : Nat} (h : a + 1 ≤ c) :
+    a * b + b ≤ c * b := by
+  rw [← Nat.succ_mul]
+  exact mul_le_mul_right b h
+
+/--
+  The recursive addition of `w`-long words on two flattened bitvectors `x` and `y` (with different
+  number of words `len` and `len'`, respectively) returns the same value, if we can prove
+  that each `w`-long word in `x` results from the addition of two `w`-long words in `y`,
+  using exactly all `w`-long words in `y`.
+-/
+private theorem addRecAux_eq_of {x : BitVec (len * w)} {y : BitVec (len' * w)}
+    (hlen : len = (len' + 1) / 2) :
+    (∀ (i : Nat) (_h : i < (len' + 1) / 2),
+      extractLsb' (i * w) w x = extractLsb' (2 * i * w) w y + extractLsb' ((2 * i + 1) * w) w y) →
+    x.addRecAux len 0#w = y.addRecAux len' 0#w := by
+  intro hadd
+  induction len generalizing len' y
+  · case zero =>
+    simp [show len' = 0 by omega]
+  · case succ len ih =>
+    have hcast : w + (len + 1 - 1) * w = (len + 1) * w := by
+      simp [Nat.add_mul, Nat.add_comm]
+    have hcast' :  w + (len' - 1) * w = len' * w := by
+      rw [Nat.sub_mul, Nat.one_mul,
+        ← Nat.add_sub_assoc (by refine Nat.le_mul_of_pos_left w (by omega)), Nat.add_comm]
+      simp
+    rw [addRecAux_succ, ← BitVec.append_extractLsb'_of_lt (x := x) (hcast := hcast)]
+    have happ := addRecAux_append_extractLsb' (len := len + 1) (x := x) (hcast := hcast) (by omega)
+    simp only [Nat.add_one_sub_one, addRecAux_succ, BitVec.zero_add] at happ
+    simp only [Nat.add_one_sub_one, BitVec.zero_add, happ]
+    have := Nat.succ_mul (n := len' - 1) (m := w)
+    rw [succ_eq_add_one, Nat.sub_one_add_one (by omega)] at this
+    by_cases hmod : len' % 2 = 0
+    · /- `sum` results from the addition of the two last elements in `y`, `sum = op1 + op2` -/
+      have := Nat.mul_le_mul_right (n := len' - 1 - 1) (m := len' - 1) (k := w) (by omega)
+      have := Nat.succ_mul (n := len' - 1 - 1) (m := w)
+      have hcast'' :  w + (len' - 1 - 1) * w = (len' - 1) * w := by
+        rw [Nat.sub_mul, Nat.one_mul,
+          ← Nat.add_sub_assoc (k := w) (by refine Nat.le_mul_of_pos_left w (by omega))]
+        simp
+      rw [succ_eq_add_one, Nat.sub_one_add_one (by omega)] at this
+      rw [← BitVec.append_extractLsb'_of_lt (x := y) (hcast := hcast'),
+        addRecAux_append_extractLsb' (by omega),
+        ← BitVec.append_extractLsb'_of_lt (x := extractLsb' 0 ((len' - 1) * w) y) (hcast := hcast''),
+        addRecAux_append_extractLsb' (by omega),
+        extractLsb'_extractLsb'_of_le (by exact Nat.mul_add_le_mul_of_succ_le (by omega)),
+        extractLsb'_extractLsb'_of_le (by omega), ← BitVec.add_assoc, hadd (_h := by omega)]
+      congr 1
+      · rw [show len = (len' + 1) / 2 - 1 by omega, BitVec.add_comm]
+        congr <;> omega
+      · apply ih
+        · omega
+        · intros
+          rw [extractLsb'_extractLsb'_of_le (by exact Nat.mul_add_le_mul_of_succ_le (by omega)),
+            extractLsb'_extractLsb'_of_le (by exact Nat.mul_add_le_mul_of_succ_le (by omega)),
+            extractLsb'_extractLsb'_of_le (by exact Nat.mul_add_le_mul_of_succ_le (by omega)),
+            hadd (_h := by omega)]
+    · /- `sum` results from the addition of the last elements in `y` with `0#w` -/
+      have : len' * w ≤ (len' - 1 + 1) * w := by exact mul_le_mul_right w (by omega)
+      rw [← BitVec.append_extractLsb'_of_lt (x := y) (hcast := hcast'),
+        addRecAux_append_extractLsb' (by omega), hadd (_h := by omega),
+        show 2 * len = len' - 1 by omega]
+      congr 1
+      · rw [BitVec.add_right_eq_self]
+        ext k hk
+        simp only [getElem_extractLsb', getElem_zero]
+        apply getLsbD_of_ge y ((len' - 1 + 1) * w + k) (by omega)
+      · apply ih
+        · omega
+        · intros
+          rw [extractLsb'_extractLsb'_of_le (by exact Nat.mul_add_le_mul_of_succ_le (by omega)),
+            extractLsb'_extractLsb'_of_le (by exact Nat.mul_add_le_mul_of_succ_le (by omega)),
+            extractLsb'_extractLsb'_of_le (by exact Nat.mul_add_le_mul_of_succ_le (by omega)),
+            hadd (_h := by omega)]
+
+private theorem getLsbD_extractAndExtend_of_lt {x : BitVec w} (hk : k < v) :
+    (x.extractAndExtend v).getLsbD (pos * v + k) = (extractAndExtendBit pos v x).getLsbD k := by
+  simp [← extractLsb'_extractAndExtend_eq (w := w) (len := v) (i := pos) (x := x)]
+  omega
+
+/--
+  Extracting a bit from a `BitVec.extractAndExtend` is the same as extracting a bit
+  from a zero-extended bit at a certain position in the original bitvector.
+-/
+theorem getLsbD_extractAndExtend {x : BitVec w} (hv : 0 < v) :
+    (BitVec.extractAndExtend v x).getLsbD k =
+    (BitVec.extractAndExtendBit ((k - (k % v)) / v) v x).getLsbD (k % v):= by
+  rw [← getLsbD_extractAndExtend_of_lt (by exact mod_lt k hv)]
+  congr
+  by_cases hmod : k % v = 0
+  · simp only [hmod, Nat.sub_zero, Nat.add_zero]
+    rw [Nat.div_mul_cancel (by omega)]
+  · rw [← Nat.div_eq_sub_mod_div]
+    exact Eq.symm (div_add_mod' k v)
+
+private theorem addRecAux_extractAndExtend_eq_cpopNatRec {x : BitVec w} :
+    (extractAndExtend w x).addRecAux n 0#w = x.cpopNatRec n 0 := by
+  induction n
+  · case zero =>
+    simp
+  · case succ n' ihn' =>
+    rw [cpopNatRec_succ, Nat.zero_add, natCast_eq_ofNat, addRecAux_succ, BitVec.zero_add,
+      addRecAux_eq, cpopNatRec_eq, ihn', ofNat_add, natCast_eq_ofNat, BitVec.add_right_inj,
+      extractLsb'_extractAndExtend_eq]
+    ext k hk
+    simp only [extractAndExtendBit, ← getLsbD_eq_getElem, getLsbD_ofNat, hk, decide_true,
+      Bool.true_and, truncate_eq_setWidth, getLsbD_setWidth, getLsbD_extractLsb', Nat.lt_one_iff]
+    by_cases hk0 : k = 0
+    · simp only [hk0, testBit_zero, decide_true, Nat.add_zero, Bool.true_and]
+      cases x.getLsbD n' <;> simp
+    · simp only [show ¬k = 0 by omega, decide_false, Bool.false_and]
+      symm
+      apply testBit_lt_two_pow ?_
+      have : (x.getLsbD n').toNat ≤ 1 := by
+        cases x.getLsbD n' <;> simp
+      have : 1 < 2 ^ k := by exact Nat.one_lt_two_pow hk0
+      omega
+
+private theorem addRecAux_extractAndExtend_eq_cpop {x : BitVec w} :
+    (extractAndExtend w x).addRecAux w 0#w = x.cpop := by
+  simp only [cpop]
+  apply addRecAux_extractAndExtend_eq_cpopNatRec
+
+private theorem addRecAux_cpopTree {x : BitVec (len * w)} :
+    addRecAux ((cpopTree x).cast (m := 1 * w) (by simp)) 1 0#w = addRecAux x len 0#w := by
+  unfold cpopTree
+  split
+  · case _ h =>
+    subst h
+    simp [addRecAux]
+  · case _ h =>
+    split
+    · case _ h' =>
+      simp only [addRecAux_succ, Nat.zero_mul, BitVec.zero_add, addRecAux_zero, h']
+      ext; simp
+    · rw [addRecAux_cpopTree]
+      apply BitVec.addRecAux_eq_of (x := cpopLayer x 0#(0 * w) (by omega)) (y := x)
+      · rfl
+      · intros j hj
+        simp [extractLsb'_cpopLayer]
+termination_by len
+
+private theorem addRecAux_eq_cpopTree {x : BitVec (len * w)} :
+    x.addRecAux len 0#w = (x.cpopTree).cast (by simp) := by
+  rw [← addRecAux_cpopTree, addRecAux_succ, Nat.zero_mul, BitVec.zero_add, addRecAux_zero]
+  ext k hk
+  simp [← getLsbD_eq_getElem, hk]
+
+theorem cpop_eq_cpopRec {x : BitVec w} :
+    BitVec.cpop x = BitVec.cpopRec x := by
+  unfold BitVec.cpopRec
+  split
+  · simp [← addRecAux_extractAndExtend_eq_cpop, addRecAux_eq_cpopTree (x := extractAndExtend w x)]
+  · split
+    · ext k hk
+      cases hx : x.getLsbD 0
+      <;> simp [hx, cpop, ← getLsbD_eq_getElem, show k = 0 by omega, show w = 1 by omega]
+    · have hw : w = 0 := by omega
+      subst hw
+      simp [of_length_zero]
+
 end BitVec
--- a/src/Init/Data/BitVec/Lemmas.lean
+++ b/src/Init/Data/BitVec/Lemmas.lean
@@ -2786,6 +2786,14 @@ theorem msb_append {x : BitVec w} {y : BitVec v} :
  rw [getElem_append] -- Why does this not work with `simp [getElem_append]`?
  simp

+theorem append_of_zero_width (x : BitVec w) (y : BitVec v) (h : w = 0) :
+    (x ++ y) = y.cast (by simp [h]) := by
+  ext i ih
+  subst h
+  simp [← getLsbD_eq_getElem, getLsbD_append]
+  omega
+
+set_option backward.isDefEq.respectTransparency false in
@[grind =]
 theorem toInt_append {x : BitVec n} {y : BitVec m} :
    (x ++ y).toInt = if n == 0 then y.toInt else (2 ^ m) * x.toInt + y.toNat := by
@@ -3012,6 +3020,34 @@ theorem extractLsb'_append_extractLsb'_eq_extractLsb' {x : BitVec w} (h : start
  congr 1
  omega

+theorem append_extractLsb'_of_lt {x : BitVec (x_len * w)} :
+    (x.extractLsb' ((x_len - 1) * w) w ++ x.extractLsb' 0 ((x_len - 1) * w)).cast hcast = x := by
+  ext i hi
+  simp only [getElem_cast, getElem_append, getElem_extractLsb', Nat.zero_add, dite_eq_ite]
+  rw [← getLsbD_eq_getElem, ite_eq_left_iff, Nat.not_lt]
+  intros
+  simp only [show (x_len - 1) * w + (i - (x_len - 1) * w) = i by omega]
+
+
+theorem extractLsb'_append_of_lt {x : BitVec (k * w)} {y : BitVec w} (hlt : i < k) :
+    extractLsb' (i * w) w (y ++ x) = extractLsb' (i * w) w x := by
+  ext j hj
+  simp only [← getLsbD_eq_getElem, getLsbD_extractLsb', hj, decide_true, getLsbD_append,
+    Bool.true_and, ite_eq_left_iff, Nat.not_lt]
+  intros h
+  by_cases hw0 : w = 0
+  · subst hw0
+    simp
+  · have : i * w ≤ (k - 1) * w := Nat.mul_le_mul_right w (by omega)
+    have h' : i * w + j < (k - 1 + 1) * w := by simp [Nat.add_mul]; omega
+    rw [Nat.sub_one_add_one (by omega)] at h'
+    omega
+
+theorem extractLsb'_append_of_eq {x : BitVec (k * w)} {y : BitVec w} (heq : i = k) :
+    extractLsb' (i * w) w (y ++ x) = y := by
+  ext j hj
+  simp [← getLsbD_eq_getElem, getLsbD_append, hj, heq]
+
 /-- Combine adjacent `~~~ (extractLsb _)'` operations into a single `~~~ (extractLsb _)'`. -/
 theorem not_extractLsb'_append_not_extractLsb'_eq_not_extractLsb' {x : BitVec w} (h : start₂ = start₁ + len₁) :
    (~~~ (x.extractLsb' start₂ len₂) ++ ~~~ (x.extractLsb' start₁ len₁)) =
--- a/src/Init/Data/Bool.lean
+++ b/src/Init/Data/Bool.lean
@@ -629,6 +629,7 @@ export Bool (cond_eq_if cond_eq_ite xor and or not)
 This should not be turned on globally as an instance because it degrades performance in Mathlib,
 but may be used locally.
 -/
+@[implicit_reducible]
 def boolPredToPred : Coe (α → Bool) (α  → Prop) where
  coe r := fun a => Eq (r a) true

@@ -663,3 +664,6 @@ but may be used locally.

@[simp] theorem Bool.not'_eq_not (a : Bool) : a.not' = a.not := by
  cases a <;> simp [Bool.not']
+
+theorem Bool.rec_eq {α : Sort _} (b : Bool) {x y : α} : Bool.rec y x b = if b then x else y := by
+  cases b <;> simp
--- a/src/Init/Data/ByteArray/Basic.lean
+++ b/src/Init/Data/ByteArray/Basic.lean
@@ -469,5 +469,3 @@ def prevn : Iterator → Nat → Iterator

 end Iterator
 end ByteArray
-
-instance : ToString ByteArray := ⟨fun bs => bs.toList.toString⟩
--- a/src/Init/Data/Char/Basic.lean
+++ b/src/Init/Data/Char/Basic.lean
@@ -129,6 +129,14 @@ The ASCII digits are the following: `0123456789`.
@[inline] def isDigit (c : Char) : Bool :=
  c.val ≥ '0'.val && c.val ≤ '9'.val

+/--
+Returns `true` if the character is an ASCII hexadecimal digit.
+
+The ASCII hexadecimal digits are the following: `0123456789abcdefABCDEF`.
+-/
+@[inline] def isHexDigit (c : Char) : Bool :=
+  c.isDigit || (c.val ≥ 'a'.val && c.val ≤ 'f'.val) || (c.val ≥ 'A'.val && c.val ≤ 'F'.val)
+
 /--
 Returns `true` if the character is an ASCII letter or digit.

--- a/src/Init/Data/Char/Lemmas.lean
+++ b/src/Init/Data/Char/Lemmas.lean
@@ -62,7 +62,7 @@ instance ltTrichotomous : Std.Trichotomous (· < · : Char → Char → Prop) wh
  trichotomous _ _ h₁ h₂ := Char.le_antisymm (by simpa using h₂) (by simpa using h₁)

@[deprecated ltTrichotomous (since := "2025-10-27")]
-def notLTAntisymm : Std.Antisymm (¬ · < · : Char → Char → Prop) where
+theorem notLTAntisymm : Std.Antisymm (¬ · < · : Char → Char → Prop) where
  antisymm := Char.ltTrichotomous.trichotomous

 instance ltAsymm : Std.Asymm (· < · : Char → Char → Prop) where
@@ -73,7 +73,7 @@ instance leTotal : Std.Total (· ≤ · : Char → Char → Prop) where

 -- This instance is useful while setting up instances for `String`.
@[deprecated ltAsymm (since := "2025-08-01")]
-def notLTTotal : Std.Total (¬ · < · : Char → Char → Prop) where
+theorem notLTTotal : Std.Total (¬ · < · : Char → Char → Prop) where
  total := fun x y => by simpa using Char.le_total y x

@[simp] theorem ofNat_toNat (c : Char) : Char.ofNat c.toNat = c := by
@@ -86,4 +86,20 @@ theorem toUInt8_val {c : Char} : c.val.toUInt8 = c.toUInt8 := rfl
@[simp]
 theorem toString_eq_singleton {c : Char} : c.toString = String.singleton c := rfl

+@[simp]
+theorem toNat_val {c : Char} : c.val.toNat = c.toNat := rfl
+
+theorem val_inj {c d : Char} : c.val = d.val ↔ c = d :=
+  Char.ext_iff.symm
+
+theorem toNat_inj {c d : Char} : c.toNat = d.toNat ↔ c = d := by
+  simp [← toNat_val, ← val_inj, ← UInt32.toNat_inj]
+
+theorem isDigit_iff_toNat {c : Char} : c.isDigit ↔ '0'.toNat ≤ c.toNat ∧ c.toNat ≤ '9'.toNat := by
+  simp [isDigit, UInt32.le_iff_toNat_le]
+
+@[simp]
+theorem toNat_mk {val : UInt32} {h} : (Char.mk val h).toNat = val.toNat := by
+  simp [← toNat_val]
+
 end Char
--- a/src/Init/Data/Char/Ordinal.lean
+++ b/src/Init/Data/Char/Ordinal.lean
@@ -217,7 +217,7 @@ theorem succ?_eq {c : Char} : c.succ? = (c.ordinal.addNat? 1).map Char.ofOrdinal
          Nat.reduceLeDiff, UInt32.left_eq_add]
        grind [UInt32.lt_iff_toNat_lt]
      · grind
-    · simp [coe_ordinal]
+    · simp [coe_ordinal, -toNat_val]
      grind [UInt32.lt_iff_toNat_lt]
  | case2 =>
    rw [Fin.addNat?_eq_some]
--- a/src/Init/Data/FloatArray/Basic.lean
+++ b/src/Init/Data/FloatArray/Basic.lean
@@ -9,6 +9,7 @@ prelude
 public import Init.Data.Float
 import Init.Ext
 public import Init.GetElem
+public import Init.Data.ToString.Extra

 public section
 universe u
--- a/src/Init/Data/Format/Basic.lean
+++ b/src/Init/Data/Format/Basic.lean
@@ -414,7 +414,7 @@ Renders a `Format` to a string.
 -/
 def pretty (f : Format) (width : Nat := defWidth) (indent : Nat := 0) (column := 0) : String :=
  let act : StateM State Unit := prettyM f width indent
-  State.out <| act (State.mk "" column) |>.snd
+  State.out <| act.run (State.mk "" column) |>.snd

 end Format

--- a/src/Init/Data/Int.lean
+++ b/src/Init/Data/Int.lean
@@ -18,3 +18,4 @@ public import Init.Data.Int.Pow
 public import Init.Data.Int.Cooper
 public import Init.Data.Int.Linear
 public import Init.Data.Int.OfNat
+public import Init.Data.Int.ToString
--- a/src/Init/Data/Int/Pow.lean
+++ b/src/Init/Data/Int/Pow.lean
@@ -118,16 +118,19 @@ theorem toNat_pow_of_nonneg {x : Int} (h : 0 ≤ x) (k : Nat) : (x ^ k).toNat =
  | succ k ih =>
    rw [Int.pow_succ, Int.toNat_mul (Int.pow_nonneg h) h, ih, Nat.pow_succ]

-protected theorem sq_nonnneg (m : Int) : 0 ≤ m ^ 2 := by
+protected theorem sq_nonneg (m : Int) : 0 ≤ m ^ 2 := by
  rw [Int.pow_succ, Int.pow_one]
  cases m
  · apply Int.mul_nonneg <;> simp
  · apply Int.mul_nonneg_of_nonpos_of_nonpos <;> exact negSucc_le_zero _

+@[deprecated Int.sq_nonneg (since := "2026-03-13")]
+protected theorem sq_nonnneg (m : Int) : 0 ≤ m ^ 2 := Int.sq_nonneg m
+
 protected theorem pow_nonneg_of_even {m : Int} {n : Nat} (h : n % 2 = 0) : 0 ≤ m ^ n := by
  rw [← Nat.mod_add_div n 2, h, Nat.zero_add, Int.pow_mul]
  apply Int.pow_nonneg
-  exact Int.sq_nonnneg m
+  exact Int.sq_nonneg m

 protected theorem neg_pow {m : Int} {n : Nat} : (-m)^n = (-1)^(n % 2) * m^n := by
  rw [Int.neg_eq_neg_one_mul, Int.mul_pow]
--- a/src/Init/Data/Int/Repr.lean
+++ b/src/Init/Data/Int/Repr.lean
@@ -0,0 +1,24 @@
+/-
+Copyright (c) 2016 Microsoft Corporation. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Author: Leonardo de Moura
+-/
+module
+
+prelude
+public import Init.Data.Repr
+public import Init.Data.String.Defs
+
+namespace Int
+
+/--
+Returns the decimal string representation of an integer.
+-/
+public protected def repr : Int → String
+  | ofNat m   => Nat.repr m
+  | negSucc m => "-" ++ Nat.repr (Nat.succ m)
+
+public instance : Repr Int where
+  reprPrec i prec := if i < 0 then Repr.addAppParen i.repr prec else i.repr
+
+end Int
--- a/src/Init/Data/Int/ToString.lean
+++ b/src/Init/Data/Int/ToString.lean
@@ -0,0 +1,23 @@
+/-
+Copyright (c) 2026 Lean FRO, LLC. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Julia Markus Himmel
+-/
+module
+
+prelude
+public import Init.Data.ToString.Extra
+import all Init.Data.Int.Repr
+import Init.Data.Int.Order
+import Init.Data.Int.LemmasAux
+
+namespace Int
+
+public theorem repr_eq_if {a : Int} :
+    a.repr = if 0 ≤ a then a.toNat.repr else "-" ++ (-a).toNat.repr := by
+  cases a <;> simp [Int.repr]
+
+@[simp]
+public theorem toString_eq_repr {a : Int} : toString a = a.repr := (rfl)
+
+end Int
--- a/src/Init/Data/Iterators/Combinators.lean
+++ b/src/Init/Data/Iterators/Combinators.lean
@@ -6,6 +6,7 @@ Authors: Paul Reichert
 module

 prelude
+public import Init.Data.Iterators.Combinators.Append
 public import Init.Data.Iterators.Combinators.Monadic
 public import Init.Data.Iterators.Combinators.FilterMap
 public import Init.Data.Iterators.Combinators.FlatMap
--- a/src/Init/Data/Iterators/Combinators/Append.lean
+++ b/src/Init/Data/Iterators/Combinators/Append.lean
@@ -0,0 +1,79 @@
+/-
+Copyright (c) 2026 Lean FRO, LLC. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Paul Reichert
+-/
+module
+
+prelude
+public import Init.Data.Iterators.Combinators.Monadic.Append
+
+public section
+
+namespace Std
+open Std.Iterators Std.Iterators.Types
+
+/--
+Given two iterators `it₁` and `it₂`, `it₁.append it₂` is an iterator that first outputs all values
+of `it₁` in order and then all values of `it₂` in order.
+
+**Marble diagram:**
+
+```text
+it₁                 ---a----b---c--⊥
+it₂                                 --d--e--⊥
+it₁.append it₂      ---a----b---c-----d--e--⊥
+```
+
+**Termination properties:**
+
+* `Finite` instance: only if `it₁` and `it₂` are finite
+* `Productive` instance: only if `it₁` and `it₂` are productive
+
+Note: If `it₁` is not finite, then `it₁.append it₂` can be productive while `it₂` is not.
+The standard library does not provide a `Productive` instance for this case.
+
+**Performance:**
+
+This combinator incurs an additional O(1) cost with each output of `it₁` and `it₂`.
+-/
+@[cbv_opaque, inline, expose]
+def Iter.append {α₁ : Type w} {α₂ : Type w} {β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β]
+    (it₁ : Iter (α := α₁) β) (it₂ : Iter (α := α₂) β) :
+    Iter (α := Append α₁ α₂ Id β) β :=
+  (it₁.toIterM.append it₂.toIterM).toIter
+
+/--
+This combinator is only useful for advanced use cases.
+
+Given an iterator `it₂`, returns an iterator that behaves exactly like `it₂` but is of the same
+type as `it₁.append it₂` (after `it₁` has been exhausted).
+This is useful for constructing intermediate states of the append iterator.
+
+**Marble diagram:**
+
+```text
+it₂                        --a--b--⊥
+Iter.appendSnd α₁ it₂      --a--b--⊥
+```
+
+**Termination properties:**
+
+* `Finite` instance: only if `it₂` and iterators of type `α₁` are finite
+* `Productive` instance: only if `it₂` and iterators of type `α₁` are productive
+
+Note: If iterators of type `α₁` are not finite, then `append α₁ it₂` can be productive while `it₂` is not.
+The standard library does not provide a `Productive` instance for this case.
+
+**Performance:**
+
+This combinator incurs an additional O(1) cost with each output of `it₂`.
+-/
+@[inline, expose]
+def Iter.Intermediate.appendSnd {α₂ : Type w} {β : Type w}
+    [Iterator α₂ Id β] (α₁ : Type w) (it₂ : Iter (α := α₂) β) :
+    Iter (α := Append α₁ α₂ Id β) β :=
+  (IterM.Intermediate.appendSnd α₁ it₂.toIterM).toIter
+
+end Std
--- a/src/Init/Data/Iterators/Combinators/Attach.lean
+++ b/src/Init/Data/Iterators/Combinators/Attach.lean
@@ -13,7 +13,7 @@ public section
 namespace Std
 open Std.Iterators

-@[always_inline, inline, expose, inherit_doc IterM.attachWith]
+@[cbv_opaque, always_inline, inline, expose, inherit_doc IterM.attachWith]
 def Iter.attachWith {α β : Type w}
    [Iterator α Id β]
    (it : Iter (α := α) β) (P : β → Prop) (h : ∀ out, it.IsPlausibleIndirectOutput out → P out) :
--- a/src/Init/Data/Iterators/Combinators/FilterMap.lean
+++ b/src/Init/Data/Iterators/Combinators/FilterMap.lean
@@ -282,17 +282,17 @@ def Iter.mapM {α β γ : Type w} [Iterator α Id β] {m : Type w → Type w'}
    [Monad m] [MonadAttach m] (f : β → m γ) (it : Iter (α := α) β) :=
  (letI : MonadLift Id m := ⟨pure⟩; it.toIterM.mapM f : IterM m γ)

-@[always_inline, inline, inherit_doc IterM.filterMap, expose]
+@[cbv_opaque, always_inline, inline, inherit_doc IterM.filterMap, expose]
 def Iter.filterMap {α : Type w} {β : Type w} {γ : Type w} [Iterator α Id β]
    (f : β → Option γ) (it : Iter (α := α) β) :=
  ((it.toIterM.filterMap f).toIter : Iter γ)

-@[always_inline, inline, inherit_doc IterM.filter, expose]
+@[cbv_opaque, always_inline, inline, inherit_doc IterM.filter, expose]
 def Iter.filter {α : Type w} {β : Type w} [Iterator α Id β]
    (f : β → Bool) (it : Iter (α := α) β) :=
  ((it.toIterM.filter f).toIter : Iter β)

-@[always_inline, inline, inherit_doc IterM.map, expose]
+@[cbv_opaque, always_inline, inline, inherit_doc IterM.map, expose]
 def Iter.map {α : Type w} {β : Type w} {γ : Type w} [Iterator α Id β]
    (f : β → γ) (it : Iter (α := α) β) :=
  ((it.toIterM.map f).toIter : Iter γ)
--- a/src/Init/Data/Iterators/Combinators/FlatMap.lean
+++ b/src/Init/Data/Iterators/Combinators/FlatMap.lean
@@ -44,7 +44,7 @@ public def Iter.flatMapAfter {α : Type w} {β : Type w} {α₂ : Type w}
    (f : β → Iter (α := α₂) γ) (it₁ : Iter (α := α) β) (it₂ : Option (Iter (α := α₂) γ)) :=
  ((it₁.toIterM.flatMapAfter (fun b => (f b).toIterM) (Iter.toIterM <$> it₂)).toIter : Iter γ)

-@[always_inline, expose, inherit_doc IterM.flatMap]
+@[cbv_opaque, always_inline, expose, inherit_doc IterM.flatMap]
 public def Iter.flatMap {α : Type w} {β : Type w} {α₂ : Type w}
    {γ : Type w} [Iterator α Id β] [Iterator α₂ Id γ]
    (f : β → Iter (α := α₂) γ) (it : Iter (α := α) β) :=
--- a/src/Init/Data/Iterators/Combinators/Monadic.lean
+++ b/src/Init/Data/Iterators/Combinators/Monadic.lean
@@ -6,6 +6,7 @@ Authors: Paul Reichert
 module

 prelude
+public import Init.Data.Iterators.Combinators.Monadic.Append
 public import Init.Data.Iterators.Combinators.Monadic.FilterMap
 public import Init.Data.Iterators.Combinators.Monadic.FlatMap
 public import Init.Data.Iterators.Combinators.Monadic.Take
--- a/src/Init/Data/Iterators/Combinators/Monadic/Append.lean
+++ b/src/Init/Data/Iterators/Combinators/Monadic/Append.lean
@@ -0,0 +1,261 @@
+/-
+Copyright (c) 2026 Lean FRO, LLC. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Paul Reichert
+-/
+module
+
+prelude
+public import Init.Data.Iterators.Consumers.Monadic.Loop
+public import Init.Classical
+import Init.Data.Option.Lemmas
+import Init.ByCases
+import Init.Omega
+
+public section
+
+/-!
+This module provides the iterator combinator `IterM.append`.
+-/
+
+namespace Std
+
+variable {α : Type w} {m : Type w → Type w'} {β : Type w}
+
+/--
+The internal state of the `IterM.append` iterator combinator.
+-/
+inductive Iterators.Types.Append (α₁ α₂ : Type w) (m : Type w → Type w') (β : Type w) where
+  | fst : IterM (α := α₁) m β → IterM (α := α₂) m β → Append α₁ α₂ m β
+  | snd : IterM (α := α₂) m β → Append α₁ α₂ m β
+
+open Std.Iterators Std.Iterators.Types
+
+/--
+Given two iterators `it₁` and `it₂`, `it₁.append it₂` is an iterator that first outputs all values
+of `it₁` in order and then all values of `it₂` in order.
+
+**Marble diagram:**
+
+```text
+it₁                 ---a----b---c--⊥
+it₂                                 --d--e--⊥
+it₁.append it₂      ---a----b---c-----d--e--⊥
+```
+
+**Termination properties:**
+
+* `Finite` instance: only if `it₁` and `it₂` are finite
+* `Productive` instance: only if `it₁` and `it₂` are productive
+
+Note: If `it₁` is not finite, then `it₁.append it₂` can be productive while `it₂` is not.
+The standard library does not provide a `Productive` instance for this case.
+
+**Performance:**
+
+This combinator incurs an additional O(1) cost with each output of `it₁` and `it₂`.
+-/
+@[inline, expose]
+def IterM.append [Iterator α₁ m β] [Iterator α₂ m β]
+    (it₁ : IterM (α := α₁) m β) (it₂ : IterM (α := α₂) m β) :=
+  (⟨Iterators.Types.Append.fst it₁ it₂⟩ : IterM m β)
+
+/--
+This combinator is only useful for advanced use cases.
+
+Given an iterator `it₂`, `IterM.Intermediate.appendSnd α₁ it₂` returns an iterator that behaves
+exactly like `it₂` but has the same type as `it₁.append it₂` (after `it₁` has been exhausted).
+This is useful for constructing intermediate states of the append iterator.
+
+**Marble diagram:**
+
+```text
+it₂                                  --a--b--⊥
+IterM.Intermediate.appendSnd α₁ it₂  --a--b--⊥
+```
+
+**Termination properties:**
+
+* `Finite` instance: only if `it₂` and iterators of type `α₁` are finite
+* `Productive` instance: only if `it₂` and iterators of type `α₁` are productive
+
+Note: If iterators of type `α₁` are not finite, then `appendSnd α₁ it₂` can be productive
+while `it₂` is not. The standard library does not provide a `Productive` instance for this case.
+
+**Performance:**
+
+This combinator incurs an additional O(1) cost with each output of `it₂`.
+-/
+@[inline, expose]
+def IterM.Intermediate.appendSnd [Iterator α₂ m β] (α₁ : Type w) (it₂ : IterM (α := α₂) m β) :=
+  (⟨Iterators.Types.Append.snd (α₁ := α₁) it₂⟩ : IterM m β)
+
+namespace Iterators.Types
+
+inductive Append.PlausibleStep [Iterator α₁ m β] [Iterator α₂ m β] :
+    IterM (α := Append α₁ α₂ m β) m β → IterStep (IterM (α := Append α₁ α₂ m β) m β) β → Prop where
+  | fstYield {it₁ : IterM (α := α₁) m β}  {it₂ : IterM (α := α₂) m β} :
+    it₁.IsPlausibleStep (.yield it₁' out) → PlausibleStep (it₁.append it₂) (.yield (it₁'.append it₂) out)
+  | fstSkip {it₁ : IterM (α := α₁) m β} {it₂ : IterM (α := α₂) m β} :
+    it₁.IsPlausibleStep (.skip it₁') → PlausibleStep (it₁.append it₂) (.skip (it₁'.append it₂))
+  | fstDone {it₁ : IterM (α := α₁) m β} {it₂ : IterM (α := α₂) m β} :
+    it₁.IsPlausibleStep .done → PlausibleStep (it₁.append it₂) (.skip (IterM.Intermediate.appendSnd α₁ it₂))
+  | sndYield {it₂ : IterM (α := α₂) m β} :
+    it₂.IsPlausibleStep (.yield it₂' out) →
+      PlausibleStep (IterM.Intermediate.appendSnd α₁ it₂) (.yield (IterM.Intermediate.appendSnd α₁ it₂') out)
+  | sndSkip {it₂ : IterM (α := α₂) m β} :
+    it₂.IsPlausibleStep (.skip it₂') →
+      PlausibleStep (IterM.Intermediate.appendSnd α₁ it₂) (.skip (IterM.Intermediate.appendSnd α₁ it₂'))
+  | sndDone {it₂ : IterM (α := α₂) m β} :
+    it₂.IsPlausibleStep .done → PlausibleStep (IterM.Intermediate.appendSnd α₁ it₂) .done
+
+@[inline]
+instance Append.instIterator [Monad m] [Iterator α₁ m β] [Iterator α₂ m β] :
+    Iterator (Append α₁ α₂ m β) m β where
+  IsPlausibleStep := Append.PlausibleStep
+  step
+    | ⟨.fst it₁ it₂⟩ => do
+      match (← it₁.step).inflate with
+      | .yield it₁' out h => return .deflate <| .yield (it₁'.append it₂) out (.fstYield h)
+      | .skip it₁' h => return .deflate <| .skip (it₁'.append it₂) (.fstSkip h)
+      | .done h => return .deflate <| .skip (IterM.Intermediate.appendSnd α₁ it₂) (.fstDone h)
+    | ⟨.snd it₂⟩ => do
+      match (← it₂.step).inflate with
+      | .yield it₂' out h => return .deflate <| .yield (IterM.Intermediate.appendSnd α₁ it₂') out (.sndYield h)
+      | .skip it₂' h => return .deflate <| .skip (IterM.Intermediate.appendSnd α₁ it₂') (.sndSkip h)
+      | .done h => return .deflate <| .done (.sndDone h)
+
+instance Append.instIteratorLoop {n : Type x → Type x'} [Monad m] [Monad n]
+    [Iterator α₁ m β] [Iterator α₂ m β] :
+    IteratorLoop (Append α₁ α₂ m β) m n :=
+  .defaultImplementation
+
+section Finite
+
+variable {α₁ : Type w} {α₂ : Type w} {m : Type w → Type w'} {β : Type w}
+
+variable (α₁ α₂ m β) in
+def Append.Rel [Monad m] [Iterator α₁ m β] [Iterator α₂ m β] [Finite α₁ m] [Finite α₂ m] :
+    IterM (α := Append α₁ α₂ m β) m β → IterM (α := Append α₁ α₂ m β) m β → Prop :=
+  InvImage
+    (Prod.Lex
+      (Option.lt (InvImage IterM.TerminationMeasures.Finite.Rel IterM.finitelyManySteps))
+      (InvImage IterM.TerminationMeasures.Finite.Rel IterM.finitelyManySteps))
+    (fun it => match it.internalState with
+      | .fst it₁ it₂ => (some it₁, it₂)
+      | .snd it₂ => (none, it₂))
+
+theorem Append.rel_of_fst [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Finite α₁ m] [Finite α₂ m] {it₁ it₁' : IterM (α := α₁) m β} {it₂ : IterM (α := α₂) m β}
+    (h : it₁'.finitelyManySteps.Rel it₁.finitelyManySteps) :
+    Append.Rel α₁ α₂ m β (it₁'.append it₂) (it₁.append it₂) := by
+  exact Prod.Lex.left _ _ h
+
+theorem Append.rel_fstDone [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Finite α₁ m] [Finite α₂ m] {it₁ : IterM (α := α₁) m β} {it₂ : IterM (α := α₂) m β} :
+    Append.Rel α₁ α₂ m β (IterM.Intermediate.appendSnd α₁ it₂) (it₁.append it₂) := by
+  exact Prod.Lex.left _ _ trivial
+
+theorem Append.rel_of_snd [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Finite α₁ m] [Finite α₂ m] {it₂ it₂' : IterM (α := α₂) m β}
+    (h : it₂'.finitelyManySteps.Rel it₂.finitelyManySteps) :
+    Append.Rel α₁ α₂ m β (IterM.Intermediate.appendSnd α₁ it₂') (IterM.Intermediate.appendSnd α₁ it₂) := by
+  exact Prod.Lex.right _ h
+
+def Append.instFinitenessRelation [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Finite α₁ m] [Finite α₂ m] :
+    FinitenessRelation (Append α₁ α₂ m β) m where
+  Rel := Append.Rel α₁ α₂ m β
+  wf := by
+    apply InvImage.wf
+    refine ⟨fun (a, b) => Prod.lexAccessible (WellFounded.apply ?_ a) (WellFounded.apply ?_) b⟩
+    · exact Option.wellFounded_lt <| InvImage.wf _ WellFoundedRelation.wf
+    · exact InvImage.wf _ WellFoundedRelation.wf
+  subrelation {it it'} h := by
+    obtain ⟨step, h, h'⟩ := h
+    cases h' <;> cases h
+    case fstYield =>
+      apply Append.rel_of_fst
+      exact IterM.TerminationMeasures.Finite.rel_of_yield ‹_›
+    case fstSkip =>
+      apply Append.rel_of_fst
+      exact IterM.TerminationMeasures.Finite.rel_of_skip ‹_›
+    case fstDone =>
+      exact Append.rel_fstDone
+    case sndYield =>
+      apply Append.rel_of_snd
+      exact IterM.TerminationMeasures.Finite.rel_of_yield ‹_›
+    case sndSkip =>
+      apply Append.rel_of_snd
+      exact IterM.TerminationMeasures.Finite.rel_of_skip ‹_›
+
+@[no_expose]
+public instance Append.instFinite [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Finite α₁ m] [Finite α₂ m] : Finite (Append α₁ α₂ m β) m :=
+  .of_finitenessRelation instFinitenessRelation
+
+end Finite
+
+section Productive
+
+variable {α₁ : Type w} {α₂ : Type w} {m : Type w → Type w'} {β : Type w}
+
+variable (α₁ α₂ m β) in
+def Append.ProductiveRel [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Productive α₁ m] [Productive α₂ m] :
+    IterM (α := Append α₁ α₂ m β) m β → IterM (α := Append α₁ α₂ m β) m β → Prop :=
+  InvImage
+    (Prod.Lex
+      (Option.lt (InvImage IterM.TerminationMeasures.Productive.Rel IterM.finitelyManySkips))
+      (InvImage IterM.TerminationMeasures.Productive.Rel IterM.finitelyManySkips))
+    (fun it => match it.internalState with
+      | .fst it₁ it₂ => (some it₁, it₂)
+      | .snd it₂ => (none, it₂))
+
+theorem Append.productiveRel_of_fst [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Productive α₁ m] [Productive α₂ m] {it₁ it₁' : IterM (α := α₁) m β}
+    {it₂ : IterM (α := α₂) m β}
+    (h : it₁'.finitelyManySkips.Rel it₁.finitelyManySkips) :
+    Append.ProductiveRel α₁ α₂ m β (it₁'.append it₂) (it₁.append it₂) := by
+  exact Prod.Lex.left _ _ h
+
+theorem Append.productiveRel_fstDone [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Productive α₁ m] [Productive α₂ m] {it₁ : IterM (α := α₁) m β}
+    {it₂ : IterM (α := α₂) m β} :
+    Append.ProductiveRel α₁ α₂ m β (IterM.Intermediate.appendSnd α₁ it₂) (it₁.append it₂) := by
+  exact Prod.Lex.left _ _ trivial
+
+theorem Append.productiveRel_of_snd [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Productive α₁ m] [Productive α₂ m] {it₂ it₂' : IterM (α := α₂) m β}
+    (h : it₂'.finitelyManySkips.Rel it₂.finitelyManySkips) :
+    Append.ProductiveRel α₁ α₂ m β
+      (IterM.Intermediate.appendSnd α₁ it₂') (IterM.Intermediate.appendSnd α₁ it₂) := by
+  exact Prod.Lex.right _ h
+
+private def Append.instProductivenessRelation [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Productive α₁ m] [Productive α₂ m] :
+    ProductivenessRelation (Append α₁ α₂ m β) m where
+  Rel := Append.ProductiveRel α₁ α₂ m β
+  wf := by
+    apply InvImage.wf
+    refine ⟨fun (a, b) => Prod.lexAccessible (WellFounded.apply ?_ a) (WellFounded.apply ?_) b⟩
+    · exact Option.wellFounded_lt <| InvImage.wf _ WellFoundedRelation.wf
+    · exact InvImage.wf _ WellFoundedRelation.wf
+  subrelation {it it'} h := by
+    cases h
+    case fstSkip =>
+      apply Append.productiveRel_of_fst
+      exact IterM.TerminationMeasures.Productive.rel_of_skip ‹_›
+    case fstDone =>
+      exact Append.productiveRel_fstDone
+    case sndSkip =>
+      apply Append.productiveRel_of_snd
+      exact IterM.TerminationMeasures.Productive.rel_of_skip ‹_›
+
+instance Append.instProductive [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    [Productive α₁ m] [Productive α₂ m] : Productive (Append α₁ α₂ m β) m :=
+  .of_productivenessRelation instProductivenessRelation
+
+end Productive
+
+end Std.Iterators.Types
--- a/src/Init/Data/Iterators/Combinators/Monadic/FilterMap.lean
+++ b/src/Init/Data/Iterators/Combinators/Monadic/FilterMap.lean
@@ -168,6 +168,13 @@ instance Map.instIterator {α β γ : Type w} {m : Type w → Type w'} {n : Type
    Iterator (Map α m n lift f) n γ :=
  inferInstanceAs <| Iterator (FilterMap α m n lift _) n γ

+theorem Map.instIterator_eq_filterMapInstIterator {α β γ : Type w} {m : Type w → Type w'}
+    {n : Type w → Type w''} [Monad n]
+    [Iterator α m β] {lift : ⦃α : Type w⦄ → m α → n α} {f : β → PostconditionT n γ} :
+    Map.instIterator (α := α) (β := β) (γ := γ) (m := m) (n := n) (lift := lift) (f := f) =
+      FilterMap.instIterator :=
+  rfl
+
 private def FilterMap.instFinitenessRelation {α β γ : Type w} {m : Type w → Type w'}
    {n : Type w → Type w''} [Monad n] [Iterator α m β] {lift : ⦃α : Type w⦄ → m α → n α}
    {f : β → PostconditionT n (Option γ)} [Finite α m] :
--- a/src/Init/Data/Iterators/Combinators/Monadic/FlatMap.lean
+++ b/src/Init/Data/Iterators/Combinators/Monadic/FlatMap.lean
@@ -362,8 +362,7 @@ def Flatten.instProductivenessRelation [Monad m] [Iterator α m (IterM (α := α
    case innerDone =>
      apply Flatten.productiveRel_of_right₂

-@[no_expose]
-public def Flatten.instProductive [Monad m] [Iterator α m (IterM (α := α₂) m β)] [Iterator α₂ m β]
+public theorem Flatten.instProductive [Monad m] [Iterator α m (IterM (α := α₂) m β)] [Iterator α₂ m β]
    [Finite α m] [Productive α₂ m] : Productive (Flatten α α₂ β m) m :=
  .of_productivenessRelation instProductivenessRelation

--- a/src/Init/Data/Iterators/Combinators/Take.lean
+++ b/src/Init/Data/Iterators/Combinators/Take.lean
@@ -36,7 +36,7 @@ it.take 3   ---a--⊥

 This combinator incurs an additional O(1) cost with each output of `it`.
 -/
-@[always_inline, inline]
+@[cbv_opaque, always_inline, inline]
 def Iter.take {α : Type w} {β : Type w} [Iterator α Id β] (n : Nat) (it : Iter (α := α) β) :
    Iter (α := Take α Id) β :=
  it.toIterM.take n |>.toIter
--- a/src/Init/Data/Iterators/Combinators/ULift.lean
+++ b/src/Init/Data/Iterators/Combinators/ULift.lean
@@ -44,7 +44,7 @@ it.uLift n    ---.up a----.up b---.up c--.up d---⊥
 * `Finite`: only if the original iterator is finite
 * `Productive`: only if the original iterator is productive
 -/
-@[always_inline, inline, expose]
+@[cbv_opaque, always_inline, inline, expose]
 def Iter.uLift (it : Iter (α := α) β) :
    Iter (α := Types.ULiftIterator.{v} α Id Id β (fun _ => monadLift)) (ULift β) :=
  (it.toIterM.uLift Id).toIter
--- a/src/Init/Data/Iterators/Consumers/Collect.lean
+++ b/src/Init/Data/Iterators/Consumers/Collect.lean
@@ -32,7 +32,7 @@ Traverses the given iterator and stores the emitted values in an array.
 If the iterator is not finite, this function might run forever. The variant
 `it.ensureTermination.toArray` always terminates after finitely many steps.
 -/
-@[always_inline, inline]
+@[cbv_opaque, always_inline, inline]
 def Iter.toArray {α : Type w} {β : Type w}
    [Iterator α Id β] (it : Iter (α := α) β) : Array β :=
  it.toIterM.toArray.run
@@ -101,7 +101,7 @@ lists are prepend-only, `toListRev` is usually more efficient that `toList`.
 If the iterator is not finite, this function might run forever. The variant
 `it.ensureTermination.toList` always terminates after finitely many steps.
 -/
-@[always_inline, inline]
+@[cbv_opaque, always_inline, inline]
 def Iter.toList {α : Type w} {β : Type w}
    [Iterator α Id β] (it : Iter (α := α) β) : List β :=
  it.toIterM.toList.run
--- a/src/Init/Data/Iterators/Consumers/Loop.lean
+++ b/src/Init/Data/Iterators/Consumers/Loop.lean
@@ -35,7 +35,7 @@ A `ForIn'` instance for iterators. Its generic membership relation is not easy t
 so this is not marked as `instance`. This way, more convenient instances can be built on top of it
 or future library improvements will make it more comfortable.
 -/
-@[always_inline, inline]
+@[always_inline, inline, expose, implicit_reducible]
 def Iter.instForIn' {α : Type w} {β : Type w} {n : Type x → Type x'} [Monad n]
    [Iterator α Id β] [IteratorLoop α Id n] :
    ForIn' n (Iter (α := α) β) β ⟨fun it out => it.IsPlausibleIndirectOutput out⟩ where
@@ -53,7 +53,7 @@ instance (α : Type w) (β : Type w) (n : Type x → Type x') [Monad n]
 /--
 An implementation of `for h : ... in ... do ...` notation for partial iterators.
 -/
-@[always_inline, inline]
+@[always_inline, inline, expose, implicit_reducible]
 def Iter.Partial.instForIn' {α : Type w} {β : Type w} {n : Type x → Type x'} [Monad n]
    [Iterator α Id β] [IteratorLoop α Id n] :
    ForIn' n (Iter.Partial (α := α) β) β ⟨fun it out => it.it.IsPlausibleIndirectOutput out⟩ where
@@ -71,7 +71,7 @@ instance (α : Type w) (β : Type w) (n : Type x → Type x') [Monad n]
 A `ForIn'` instance for iterators that is guaranteed to terminate after finitely many steps.
 It is not marked as an instance because the membership predicate is difficult to work with.
 -/
-@[always_inline, inline]
+@[always_inline, inline, expose, implicit_reducible]
 def Iter.Total.instForIn' {α : Type w} {β : Type w} {n : Type x → Type x'} [Monad n]
    [Iterator α Id β] [IteratorLoop α Id n] [Finite α Id] :
    ForIn' n (Iter.Total (α := α) β) β ⟨fun it out => it.it.IsPlausibleIndirectOutput out⟩ where
--- a/src/Init/Data/Iterators/Consumers/Monadic/Loop.lean
+++ b/src/Init/Data/Iterators/Consumers/Monadic/Loop.lean
@@ -159,7 +159,7 @@ This is the default implementation of the `IteratorLoop` class.
 It simply iterates through the iterator using `IterM.step`. For certain iterators, more efficient
 implementations are possible and should be used instead.
 -/
-@[always_inline, inline, expose]
+@[always_inline, inline, expose, implicit_reducible]
 def IteratorLoop.defaultImplementation {α : Type w} {m : Type w → Type w'} {n : Type x → Type x'}
    [Monad n] [Iterator α m β] :
    IteratorLoop α m n where
@@ -211,7 +211,7 @@ theorem IteratorLoop.wellFounded_of_productive {α β : Type w} {m : Type w →
 /--
 This `ForIn'`-style loop construct traverses a finite iterator using an `IteratorLoop` instance.
 -/
-@[always_inline, inline]
+@[always_inline, inline, expose, implicit_reducible]
 def IteratorLoop.finiteForIn' {m : Type w → Type w'} {n : Type x → Type x'}
    {α : Type w} {β : Type w} [Iterator α m β] [IteratorLoop α m n] [Monad n]
    (lift : ∀ γ δ, (γ → n δ) → m γ → n δ) :
@@ -224,7 +224,7 @@ A `ForIn'` instance for iterators. Its generic membership relation is not easy t
 so this is not marked as `instance`. This way, more convenient instances can be built on top of it
 or future library improvements will make it more comfortable.
 -/
-@[always_inline, inline]
+@[always_inline, inline, expose, implicit_reducible]
 def IterM.instForIn' {m : Type w → Type w'} {n : Type w → Type w''}
    {α : Type w} {β : Type w} [Iterator α m β] [IteratorLoop α m n] [Monad n]
    [MonadLiftT m n] :
@@ -239,7 +239,7 @@ instance IterM.instForInOfIteratorLoop {m : Type w → Type w'} {n : Type w →
  instForInOfForIn'

 /-- Internal implementation detail of the iterator library. -/
-@[always_inline, inline]
+@[always_inline, inline, expose, implicit_reducible]
 def IterM.Partial.instForIn' {m : Type w → Type w'} {n : Type w → Type w''}
    {α : Type w} {β : Type w} [Iterator α m β] [IteratorLoop α m n] [MonadLiftT m n] [Monad n] :
    ForIn' n (IterM.Partial (α := α) m β) β ⟨fun it out => it.it.IsPlausibleIndirectOutput out⟩ where
@@ -247,7 +247,7 @@ def IterM.Partial.instForIn' {m : Type w → Type w'} {n : Type w → Type w''}
    haveI := @IterM.instForIn'; forIn' it.it init f

 /-- Internal implementation detail of the iterator library. -/
-@[always_inline, inline]
+@[always_inline, inline, expose, implicit_reducible]
 def IterM.Total.instForIn' {m : Type w → Type w'} {n : Type w → Type w''}
    {α : Type w} {β : Type w} [Iterator α m β] [IteratorLoop α m n] [MonadLiftT m n] [Monad n]
    [Finite α m] :
--- a/src/Init/Data/Iterators/Internal/LawfulMonadLiftFunction.lean
+++ b/src/Init/Data/Iterators/Internal/LawfulMonadLiftFunction.lean
@@ -70,7 +70,7 @@ theorem LawfulMonadLiftFunction.lift_seqRight [LawfulMonad m] [LawfulMonad n]
 abbrev idToMonad [Monad m] ⦃α : Type u⦄ (x : Id α) : m α :=
    pure x.run

-def LawfulMonadLiftFunction.idToMonad [Monad m] [LawfulMonad m] :
+theorem LawfulMonadLiftFunction.idToMonad [LawfulMonad m] :
    LawfulMonadLiftFunction (m := Id) (n := m) idToMonad where
  lift_pure := by simp [Internal.idToMonad]
  lift_bind := by simp [Internal.idToMonad]
@@ -95,7 +95,7 @@ instance [LawfulMonadLiftBindFunction (n := n) (fun _ _ f x => lift x >>= f)] [L
    simpa using LawfulMonadLiftBindFunction.liftBind_bind (n := n)
      (liftBind := fun _ _ f x => lift x >>= f) (β := β) (γ := γ) (δ := γ) pure x g

-def LawfulMonadLiftBindFunction.id [Monad m] [LawfulMonad m] :
+theorem LawfulMonadLiftBindFunction.id [LawfulMonad m] :
    LawfulMonadLiftBindFunction (m := Id) (n := m) (fun _ _ f x => f x.run) where
  liftBind_pure := by simp
  liftBind_bind := by simp
--- a/src/Init/Data/Iterators/Lemmas/Combinators.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators.lean
@@ -6,6 +6,7 @@ Authors: Paul Reichert
 module

 prelude
+public import Init.Data.Iterators.Lemmas.Combinators.Append
 public import Init.Data.Iterators.Lemmas.Combinators.Attach
 public import Init.Data.Iterators.Lemmas.Combinators.Monadic
 public import Init.Data.Iterators.Lemmas.Combinators.FilterMap
--- a/src/Init/Data/Iterators/Lemmas/Combinators/Append.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators/Append.lean
@@ -0,0 +1,193 @@
+/-
+Copyright (c) 2026 Lean FRO, LLC. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Paul Reichert
+-/
+module
+
+prelude
+public import Init.Data.Iterators.Combinators.Append
+public import Init.Data.Iterators.Lemmas.Combinators.Monadic.Append
+public import Init.Data.Iterators.Consumers.Collect
+public import Init.Data.Iterators.Consumers.Access
+import Init.Data.Iterators.Lemmas.Consumers.Collect
+import Init.Data.Iterators.Lemmas.Consumers.Access
+import Init.Data.Iterators.Lemmas.Basic
+import Init.Omega
+
+public section
+
+namespace Std
+open Std.Iterators Std.Iterators.Types
+
+theorem Iter.append_eq_toIter_append_toIterM {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β]
+    {it₁ : Iter (α := α₁) β} {it₂ : Iter (α := α₂) β} :
+    it₁.append it₂ = (it₁.toIterM.append it₂.toIterM).toIter :=
+  rfl
+
+theorem Iter.Intermediate.appendSnd_eq_toIter_appendSnd_toIterM {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β]
+    {it₂ : Iter (α := α₂) β} :
+    Iter.Intermediate.appendSnd α₁ it₂ = (IterM.Intermediate.appendSnd α₁ it₂.toIterM).toIter :=
+  rfl
+
+theorem Iter.step_append {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β]
+    {it₁ : Iter (α := α₁) β} {it₂ : Iter (α := α₂) β} :
+    (it₁.append it₂).step =
+      match it₁.step with
+      | .yield it₁' out h => .yield (it₁'.append it₂) out (.fstYield h)
+      | .skip it₁' h => .skip (it₁'.append it₂) (.fstSkip h)
+      | .done h => .skip (Iter.Intermediate.appendSnd α₁ it₂) (.fstDone h) := by
+  simp only [Iter.step, append_eq_toIter_append_toIterM, toIterM_toIter, IterM.step_append,
+    Id.run_bind]
+  cases it₁.toIterM.step.run.inflate using PlausibleIterStep.casesOn <;>
+    simp [Intermediate.appendSnd_eq_toIter_appendSnd_toIterM]
+
+theorem Iter.Intermediate.step_appendSnd {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β]
+    {it₂ : Iter (α := α₂) β} :
+    (Iter.Intermediate.appendSnd α₁ it₂).step =
+      match it₂.step with
+      | .yield it₂' out h => .yield (Iter.Intermediate.appendSnd α₁ it₂') out (.sndYield h)
+      | .skip it₂' h => .skip (Iter.Intermediate.appendSnd α₁ it₂') (.sndSkip h)
+      | .done h => .done (.sndDone h) := by
+  simp only [Iter.step, appendSnd, toIterM_toIter, IterM.Intermediate.step_appendSnd, Id.run_bind]
+  cases it₂.toIterM.step.run.inflate using PlausibleIterStep.casesOn <;> simp
+
+@[cbv_eval, simp]
+theorem Iter.toList_append {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β] [Finite α₁ Id] [Finite α₂ Id]
+    {it₁ : Iter (α := α₁) β} {it₂ : Iter (α := α₂) β} :
+    (it₁.append it₂).toList = it₁.toList ++ it₂.toList := by
+  simp [append_eq_toIter_append_toIterM, toList_eq_toList_toIterM]
+
+@[simp]
+theorem Iter.toListRev_append {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β] [Finite α₁ Id] [Finite α₂ Id]
+    {it₁ : Iter (α := α₁) β} {it₂ : Iter (α := α₂) β} :
+    (it₁.append it₂).toListRev = it₂.toListRev ++ it₁.toListRev := by
+  simp [append_eq_toIter_append_toIterM, toListRev_eq_toListRev_toIterM]
+
+@[cbv_eval, simp]
+theorem Iter.toArray_append {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β] [Finite α₁ Id] [Finite α₂ Id]
+    {it₁ : Iter (α := α₁) β} {it₂ : Iter (α := α₂) β} :
+    (it₁.append it₂).toArray = it₁.toArray ++ it₂.toArray := by
+  simp [append_eq_toIter_append_toIterM, toArray_eq_toArray_toIterM]
+
+@[simp]
+theorem Iter.atIdxSlow?_appendSnd {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β] [Productive α₁ Id] [Productive α₂ Id]
+    {it₂ : Iter (α := α₂) β} {n : Nat} :
+    (Iter.Intermediate.appendSnd α₁ it₂).atIdxSlow? n = it₂.atIdxSlow? n := by
+  induction n, it₂ using Iter.atIdxSlow?.induct_unfolding with
+  | yield_zero it it' out h h' =>
+    simp only [atIdxSlow?_eq_match (it := Iter.Intermediate.appendSnd α₁ it),
+      Intermediate.step_appendSnd, h']
+  | yield_succ it it' out h h' n ih =>
+    simp only [atIdxSlow?_eq_match (it := Iter.Intermediate.appendSnd α₁ it),
+      Intermediate.step_appendSnd, h', ih]
+  | skip_case n it it' h h' ih =>
+    simp only [atIdxSlow?_eq_match (it := Iter.Intermediate.appendSnd α₁ it),
+      Intermediate.step_appendSnd, h', ih]
+  | done_case n it h h' =>
+    simp only [atIdxSlow?_eq_match (it := Iter.Intermediate.appendSnd α₁ it),
+      Intermediate.step_appendSnd, h']
+
+theorem Iter.atIdxSlow?_append_of_eq_some {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β] [Productive α₁ Id] [Productive α₂ Id]
+    {it₁ : Iter (α := α₁) β} {it₂ : Iter (α := α₂) β} {n : Nat} {b : β}
+    (h : it₁.atIdxSlow? n = some b) :
+    (it₁.append it₂).atIdxSlow? n = some b := by
+  induction n, it₁ using Iter.atIdxSlow?.induct_unfolding generalizing it₂ with
+  | yield_zero it it' out hp h' =>
+    rw [atIdxSlow?_eq_match (it := it.append it₂)]
+    cases h
+    simp [step_append, h']
+  | yield_succ it it' out hp h' n ih =>
+    rw [atIdxSlow?_eq_match (it := it.append it₂)]
+    simp [step_append, h', ih h]
+  | skip_case n it it' hp h' ih =>
+    rw [atIdxSlow?_eq_match (it := it.append it₂)]
+    simp [step_append, h', ih h]
+  | done_case n it hp h' =>
+    cases h
+
+theorem Iter.atIdxSlow?_append {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β] [Finite α₁ Id] [Productive α₂ Id]
+    {it₁ : Iter (α := α₁) β} {it₂ : Iter (α := α₂) β} {n : Nat} :
+    (it₁.append it₂).atIdxSlow? n =
+      if n < it₁.toList.length then it₁.atIdxSlow? n
+      else it₂.atIdxSlow? (n - it₁.toList.length) := by
+  induction n, it₁ using Iter.atIdxSlow?.induct_unfolding generalizing it₂ with
+  | yield_zero it it' out h h' =>
+    simp only [atIdxSlow?_eq_match (it := it.append it₂), step_append, h']
+    rw [toList_eq_match_step (it := it)]
+    simp [h']
+  | yield_succ it it' out h h' n ih =>
+    simp only [atIdxSlow?_eq_match (it := it.append it₂), step_append, h', ih]
+    rw [toList_eq_match_step (it := it)]
+    simp [h', Nat.succ_lt_succ_iff, Nat.succ_sub_succ]
+  | skip_case n it it' h h' ih =>
+    simp only [atIdxSlow?_eq_match (it := it.append it₂), step_append, h', ih]
+    rw [toList_eq_match_step (it := it)]
+    simp [h']
+  | done_case n it h h' =>
+    simp [atIdxSlow?_eq_match (it := it.append it₂), step_append, h',
+      atIdxSlow?_appendSnd, toList_eq_match_step]
+
+theorem Iter.atIdxSlow?_append_of_productive {α₁ α₂ β : Type w}
+    [Iterator α₁ Id β] [Iterator α₂ Id β] [Productive α₁ Id] [Productive α₂ Id]
+    {it₁ : Iter (α := α₁) β} {it₂ : Iter (α := α₂) β} {n k : Nat}
+    (hk : it₁.atIdxSlow? k = none)
+    (hmin : ∀ j, j < k → (it₁.atIdxSlow? j).isSome)
+    (hle : k ≤ n) :
+    (it₁.append it₂).atIdxSlow? n = it₂.atIdxSlow? (n - k) := by
+  induction n, it₁ using Iter.atIdxSlow?.induct_unfolding generalizing k it₂ with
+  | yield_zero it it' out hp h' =>
+    exfalso
+    have : k = 0 := by omega
+    subst this
+    rw [atIdxSlow?_eq_match (it := it)] at hk
+    simp [h'] at hk
+  | yield_succ it it' out hp h' n ih =>
+    rw [atIdxSlow?_eq_match (it := it.append it₂)]
+    simp only [step_append, h']
+    match k with
+    | 0 =>
+      rw [atIdxSlow?_eq_match (it := it)] at hk
+      simp [h'] at hk
+    | k + 1 =>
+      rw [atIdxSlow?_eq_match (it := it)] at hk
+      simp [h'] at hk
+      have hmin' : ∀ j, j < k → (it'.atIdxSlow? j).isSome := by
+        intro j hj
+        have h := hmin (j + 1) (by omega)
+        rw [atIdxSlow?_eq_match (it := it)] at h
+        simpa [h'] using h
+      rw [ih hk hmin' (by omega)]
+      congr 1
+      omega
+  | skip_case n it it' hp h' ih =>
+    rw [atIdxSlow?_eq_match (it := it.append it₂)]
+    simp only [step_append, h']
+    rw [atIdxSlow?_eq_match (it := it)] at hk; simp [h'] at hk
+    have hmin' : ∀ j, j < k → (it'.atIdxSlow? j).isSome := by
+      intro j hj
+      have h := hmin j hj
+      rw [atIdxSlow?_eq_match (it := it)] at h
+      simpa [h'] using h
+    exact ih hk hmin' hle
+  | done_case n it hp h' =>
+    rw [atIdxSlow?_eq_match (it := it.append it₂)]
+    simp only [step_append, h', atIdxSlow?_appendSnd]
+    have hk0 : k = 0 := by
+      false_or_by_contra
+      have h := hmin 0 (by omega)
+      rw [atIdxSlow?_eq_match (it := it)] at h
+      simp [h'] at h
+    simp [hk0]
+
+end Std
--- a/src/Init/Data/Iterators/Lemmas/Combinators/Attach.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators/Attach.lean
@@ -34,7 +34,7 @@ theorem Iter.unattach_toList_attachWith [Iterator α Id β]
    ← Id.run_map (f := List.unattach), IterM.map_unattach_toList_attachWith,
    Iter.toList_eq_toList_toIterM]

-@[simp]
+@[cbv_eval, simp]
 theorem Iter.toList_attachWith [Iterator α Id β]
    {it : Iter (α := α) β} {hP}
    [Finite α Id] :
@@ -68,7 +68,7 @@ theorem Iter.unattach_toArray_attachWith [Iterator α Id β]
    (it.attachWith P hP).toListRev.unattach = it.toListRev := by
  simp [toListRev_eq]

-@[simp]
+@[cbv_eval, simp]
 theorem Iter.toArray_attachWith [Iterator α Id β]
    {it : Iter (α := α) β} {hP}
    [Finite α Id] :
--- a/src/Init/Data/Iterators/Lemmas/Combinators/FilterMap.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators/FilterMap.lean
@@ -297,7 +297,7 @@ def Iter.val_step_filter {f : β → Bool} :
  · simp
  · simp

-@[simp]
+@[cbv_eval, simp]
 theorem Iter.toList_filterMap [Finite α Id]
    {f : β → Option γ} :
    (it.filterMap f).toList = it.toList.filterMap f := by
@@ -315,12 +315,12 @@ theorem Iter.toList_mapM [Monad m] [MonadAttach m] [LawfulMonad m] [WeaklyLawful
    (it.mapM f).toList = it.toList.mapM f := by
  simp [Iter.mapM_eq_toIter_mapM_toIterM, IterM.toList_mapM, Iter.toList_eq_toList_toIterM]

-@[simp]
+@[cbv_eval, simp]
 theorem Iter.toList_map [Finite α Id] {f : β → γ} :
    (it.map f).toList = it.toList.map f := by
  simp [map_eq_toIter_map_toIterM, IterM.toList_map, Iter.toList_eq_toList_toIterM]

-@[simp]
+@[cbv_eval, simp]
 theorem Iter.toList_filter [Finite α Id] {f : β → Bool} :
    (it.filter f).toList = it.toList.filter f := by
  simp [filter_eq_toIter_filter_toIterM, IterM.toList_filter, Iter.toList_eq_toList_toIterM]
@@ -369,7 +369,7 @@ theorem Iter.toListRev_filter [Finite α Id]
    (it.filter f).toListRev = it.toListRev.filter f := by
  simp [filter_eq_toIter_filter_toIterM, IterM.toListRev_filter, Iter.toListRev_eq_toListRev_toIterM]

-@[simp]
+@[cbv_eval, simp]
 theorem Iter.toArray_filterMap [Finite α Id]
    {f : β → Option γ} :
    (it.filterMap f).toArray = it.toArray.filterMap f := by
@@ -387,13 +387,13 @@ theorem Iter.toArray_mapM [Monad m] [MonadAttach m] [LawfulMonad m] [WeaklyLawfu
    (it.mapM f).toArray = it.toArray.mapM f := by
  simp [Iter.mapM_eq_toIter_mapM_toIterM, IterM.toArray_mapM, Iter.toArray_eq_toArray_toIterM]

-@[simp]
+@[cbv_eval, simp]
 theorem Iter.toArray_map [Finite α Id] {f : β → γ} :
    (it.map f).toArray = it.toArray.map f := by
  simp [map_eq_toIter_map_toIterM, IterM.toArray_map, Iter.toArray_eq_toArray_toIterM]

-@[simp]
-theorem Iter.toArray_filter[Finite α Id] {f : β → Bool} :
+@[cbv_eval, simp]
+theorem Iter.toArray_filter [Finite α Id] {f : β → Bool} :
    (it.filter f).toArray = it.toArray.filter f := by
  simp [filter_eq_toIter_filter_toIterM, IterM.toArray_filter, Iter.toArray_eq_toArray_toIterM]

@@ -435,8 +435,9 @@ theorem Iter.forIn_filterMapWithPostcondition
        match ← (f out).run with
        | some c => g c acc
        | none => return .yield acc) := by
-  simp +instances [Iter.forIn_eq_forIn_toIterM, filterMapWithPostcondition, IterM.forIn_filterMapWithPostcondition,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]; rfl
+  simp only [filterMapWithPostcondition, IterM.forIn_filterMapWithPostcondition, forIn_eq_forIn_toIterM]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  rfl -- expressions are equal up to different matchers

 theorem Iter.forIn_filterMapM
    [Monad n] [LawfulMonad n] [Monad o] [LawfulMonad o]
@@ -448,8 +449,9 @@ theorem Iter.forIn_filterMapM
        match ← f out with
        | some c => g c acc
        | none => return .yield acc) := by
-  simp +instances [filterMapM, forIn_eq_forIn_toIterM, IterM.forIn_filterMapM,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]; rfl
+  simp [filterMapM, forIn_eq_forIn_toIterM, IterM.forIn_filterMapM]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  rfl

 theorem Iter.forIn_filterMap
    [Monad n] [LawfulMonad n] [Finite α Id]
@@ -469,8 +471,8 @@ theorem Iter.forIn_mapWithPostcondition
    {g : β₂ → γ → o (ForInStep γ)} :
    forIn (it.mapWithPostcondition f) init g =
      forIn it init (fun out acc => do g (← (f out).run) acc) := by
-  simp +instances [mapWithPostcondition, forIn_eq_forIn_toIterM, IterM.forIn_mapWithPostcondition,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  simp only [mapWithPostcondition, forIn_eq_forIn_toIterM, IterM.forIn_mapWithPostcondition]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]

 theorem Iter.forIn_mapM
    [Monad n] [LawfulMonad n] [Monad o] [LawfulMonad o]
@@ -498,8 +500,8 @@ theorem Iter.forIn_filterWithPostcondition
    haveI : MonadLift n o := ⟨monadLift⟩
    forIn (it.filterWithPostcondition f) init g =
      forIn it init (fun out acc => do if (← (f out).run).down then g out acc else return .yield acc) := by
-  simp +instances [filterWithPostcondition, forIn_eq_forIn_toIterM, IterM.forIn_filterWithPostcondition,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  simp only [filterWithPostcondition, forIn_eq_forIn_toIterM, IterM.forIn_filterWithPostcondition]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]

 theorem Iter.forIn_filterM
    [Monad n] [LawfulMonad n] [Monad o] [LawfulMonad o]
@@ -508,8 +510,8 @@ theorem Iter.forIn_filterM
    [IteratorLoop α Id o] [LawfulIteratorLoop α Id o]
    {it : Iter (α := α) β} {f : β → n (ULift Bool)} {init : γ} {g : β → γ → o (ForInStep γ)} :
    forIn (it.filterM f) init g = forIn it init (fun out acc => do if (← f out).down then g out acc else return .yield acc) := by
-  simp +instances [filterM, forIn_eq_forIn_toIterM, IterM.forIn_filterM,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  simp only [filterM, forIn_eq_forIn_toIterM, IterM.forIn_filterM]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]

 theorem Iter.forIn_filter
    [Monad n] [LawfulMonad n]
@@ -550,8 +552,9 @@ theorem Iter.foldM_filterMapM {α β γ δ : Type w}
      it.foldM (init := init) (fun d b => do
          let some c ← f b | pure d
          g d c) := by
-  simp +instances [filterMapM, IterM.foldM_filterMapM, foldM_eq_foldM_toIterM,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]; rfl
+  simp only [filterMapM, IterM.foldM_filterMapM, foldM_eq_foldM_toIterM]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  rfl

 theorem Iter.foldM_mapWithPostcondition {α β γ δ : Type w}
    {n : Type w → Type w''} {o : Type w → Type w'''}
@@ -563,8 +566,8 @@ theorem Iter.foldM_mapWithPostcondition {α β γ δ : Type w}
    {f : β → PostconditionT n γ} {g : δ → γ → o δ} {init : δ} {it : Iter (α := α) β} :
    (it.mapWithPostcondition f).foldM (init := init) g =
      it.foldM (init := init) (fun d b => do let c ← (f b).run; g d c) := by
-  simp +instances [mapWithPostcondition, IterM.foldM_mapWithPostcondition, foldM_eq_foldM_toIterM,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  simp only [mapWithPostcondition, IterM.foldM_mapWithPostcondition, foldM_eq_foldM_toIterM]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]

 theorem Iter.foldM_mapM {α β γ δ : Type w}
    {n : Type w → Type w''} {o : Type w → Type w'''}
@@ -578,8 +581,8 @@ theorem Iter.foldM_mapM {α β γ δ : Type w}
    haveI : MonadLift n o := ⟨MonadLiftT.monadLift⟩
    (it.mapM f).foldM (init := init) g =
      it.foldM (init := init) (fun d b => do let c ← f b; g d c) := by
-  simp +instances [mapM, IterM.foldM_mapM, foldM_eq_foldM_toIterM,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  simp only [mapM, IterM.foldM_mapM, foldM_eq_foldM_toIterM]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]

 theorem Iter.foldM_filterWithPostcondition {α β δ : Type w}
    {n : Type w → Type w''} {o : Type w → Type w'''}
@@ -591,8 +594,8 @@ theorem Iter.foldM_filterWithPostcondition {α β δ : Type w}
    {f : β → PostconditionT n (ULift Bool)} {g : δ → β → o δ} {init : δ} {it : Iter (α := α) β} :
    (it.filterWithPostcondition f).foldM (init := init) g =
      it.foldM (init := init) (fun d b => do if (← (f b).run).down then g d b else pure d) := by
-  simp +instances [filterWithPostcondition, IterM.foldM_filterWithPostcondition, foldM_eq_foldM_toIterM,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  simp only [filterWithPostcondition, IterM.foldM_filterWithPostcondition, foldM_eq_foldM_toIterM]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]

 theorem Iter.foldM_filterM {α β δ : Type w}
    {n : Type w → Type w''} {o : Type w → Type w'''}
@@ -605,8 +608,8 @@ theorem Iter.foldM_filterM {α β δ : Type w}
    {f : β → n (ULift Bool)} {g : δ → β → o δ} {init : δ} {it : Iter (α := α) β} :
    (it.filterM f).foldM (init := init) g =
      it.foldM (init := init) (fun d b => do if (← f b).down then g d b else pure d) := by
-  simp +instances [filterM, IterM.foldM_filterM, foldM_eq_foldM_toIterM,
-    instMonadLiftTOfMonadLift_instMonadLiftTOfPure]
+  simp only [filterM, IterM.foldM_filterM, foldM_eq_foldM_toIterM]
+  rw [instMonadLiftTOfMonadLift_instMonadLiftTOfPure]

 theorem Iter.foldM_filterMap {α β γ δ : Type w} {n : Type w → Type w''}
    [Iterator α Id β] [Finite α Id] [Monad n] [LawfulMonad n]
--- a/src/Init/Data/Iterators/Lemmas/Combinators/FlatMap.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators/FlatMap.lean
@@ -121,22 +121,22 @@ public theorem Iter.step_flatMapAfterM {α : Type w} {β : Type w} {α₂ : Type
    [Monad m] [MonadAttach m] [LawfulMonad m] [WeaklyLawfulMonadAttach m] [Iterator α Id β] [Iterator α₂ m γ]
    {f : β → m (IterM (α := α₂) m γ)} {it₁ : Iter (α := α) β} {it₂ : Option (IterM (α := α₂) m γ)} :
  (it₁.flatMapAfterM f it₂).step = (do
-    match it₂ with
+    match hit : it₂ with
    | none =>
      match it₁.step with
      | .yield it₁' b h =>
        let fx ← MonadAttach.attach (f b)
-        return .deflate (.skip (it₁'.flatMapAfterM f (some fx.val)) (.outerYield_flatMapM_pure h fx.property))
-      | .skip it₁' h => return .deflate (.skip (it₁'.flatMapAfterM f none) (.outerSkip_flatMapM_pure h))
-      | .done h => return .deflate (.done (.outerDone_flatMapM_pure h))
+        return .deflate (.skip (it₁'.flatMapAfterM f (some fx.val)) (hit ▸ .outerYield_flatMapM_pure h fx.property))
+      | .skip it₁' h => return .deflate (.skip (it₁'.flatMapAfterM f it₂) (hit ▸ .outerSkip_flatMapM_pure h))
+      | .done h => return .deflate (.done (hit ▸ .outerDone_flatMapM_pure h))
    | some it₂ =>
      match (← it₂.step).inflate with
      | .yield it₂' out h =>
-        return .deflate (.yield (it₁.flatMapAfterM f (some it₂')) out (.innerYield_flatMapM_pure h))
+        return .deflate (.yield (it₁.flatMapAfterM f (some it₂')) out (hit ▸ .innerYield_flatMapM_pure h))
      | .skip it₂' h =>
-        return .deflate (.skip (it₁.flatMapAfterM f (some it₂')) (.innerSkip_flatMapM_pure h))
+        return .deflate (.skip (it₁.flatMapAfterM f (some it₂')) (hit ▸ .innerSkip_flatMapM_pure h))
      | .done h =>
-        return .deflate (.skip (it₁.flatMapAfterM f none) (.innerDone_flatMapM_pure h))) := by
+        return .deflate (.skip (it₁.flatMapAfterM f none) (hit ▸ .innerDone_flatMapM_pure h))) := by
  simp only [flatMapAfterM, IterM.step_flatMapAfterM, Iter.step_mapWithPostcondition,
    PostconditionT.operation_pure]
  split
@@ -232,7 +232,6 @@ public theorem Iter.toArray_flatMapM {α α₂ β γ : Type w} {m : Type w → T
    (it₁.flatMapM f).toArray = Array.flatten <$> (it₁.mapM fun b => do (← f b).toArray).toArray := by
  simp [flatMapM, toArray_flatMapAfterM]

-set_option backward.isDefEq.respectTransparency false in
 public theorem Iter.toList_flatMapAfter {α α₂ β γ : Type w} [Iterator α Id β] [Iterator α₂ Id γ]
    [Finite α Id] [Finite α₂ Id]
    {f : β → Iter (α := α₂) γ} {it₁ : Iter (α := α) β} {it₂ : Option (Iter (α := α₂) γ)} :
@@ -241,9 +240,9 @@ public theorem Iter.toList_flatMapAfter {α α₂ β γ : Type w} [Iterator α I
      | some it₂ => it₂.toList ++
          (it₁.map fun b => (f b).toList).toList.flatten := by
  simp only [flatMapAfter, Iter.toList, toIterM_toIter, IterM.toList_flatMapAfter]
-  cases it₂ <;> simp [map, IterM.toList_map_eq_toList_mapM, - IterM.toList_map]
+  unfold Iter.toList
+  cases it₂ <;> simp [map]

-set_option backward.isDefEq.respectTransparency false in
 public theorem Iter.toArray_flatMapAfter {α α₂ β γ : Type w} [Iterator α Id β] [Iterator α₂ Id γ]
    [Finite α Id] [Finite α₂ Id]
    {f : β → Iter (α := α₂) γ} {it₁ : Iter (α := α) β} {it₂ : Option (Iter (α := α₂) γ)} :
@@ -252,8 +251,10 @@ public theorem Iter.toArray_flatMapAfter {α α₂ β γ : Type w} [Iterator α
      | some it₂ => it₂.toArray ++
          (it₁.map fun b => (f b).toArray).toArray.flatten := by
  simp only [flatMapAfter, Iter.toArray, toIterM_toIter, IterM.toArray_flatMapAfter]
+  unfold Iter.toArray
  cases it₂ <;> simp [map, IterM.toArray_map_eq_toArray_mapM, - IterM.toArray_map]

+@[cbv_eval]
 public theorem Iter.toList_flatMap {α α₂ β γ : Type w} [Iterator α Id β] [Iterator α₂ Id γ]
    [Finite α Id] [Finite α₂ Id]
    [Iterator α Id β] [Iterator α₂ Id γ] [Finite α Id] [Finite α₂ Id]
@@ -261,6 +262,7 @@ public theorem Iter.toList_flatMap {α α₂ β γ : Type w} [Iterator α Id β]
    (it₁.flatMap f).toList = (it₁.map fun b => (f b).toList).toList.flatten := by
  simp [flatMap, toList_flatMapAfter]

+@[cbv_eval]
 public theorem Iter.toArray_flatMap {α α₂ β γ : Type w} [Iterator α Id β] [Iterator α₂ Id γ]
    [Finite α Id] [Finite α₂ Id]
    [Iterator α Id β] [Iterator α₂ Id γ] [Finite α Id] [Finite α₂ Id]
--- a/src/Init/Data/Iterators/Lemmas/Combinators/Monadic.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators/Monadic.lean
@@ -6,6 +6,7 @@ Authors: Paul Reichert
 module

 prelude
+public import Init.Data.Iterators.Lemmas.Combinators.Monadic.Append
 public import Init.Data.Iterators.Lemmas.Combinators.Monadic.Attach
 public import Init.Data.Iterators.Lemmas.Combinators.Monadic.FilterMap
 public import Init.Data.Iterators.Lemmas.Combinators.Monadic.FlatMap
--- a/src/Init/Data/Iterators/Lemmas/Combinators/Monadic/Append.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators/Monadic/Append.lean
@@ -0,0 +1,107 @@
+/-
+Copyright (c) 2026 Lean FRO, LLC. All rights reserved.
+Released under Apache 2.0 license as described in the file LICENSE.
+Authors: Paul Reichert
+-/
+module
+
+prelude
+public import Init.Data.Iterators.Combinators.Monadic.Append
+public import Init.Data.Iterators.Consumers.Monadic.Collect
+import Init.Data.Iterators.Lemmas.Consumers.Monadic.Collect
+import Init.Data.Iterators.Lemmas.Monadic.Basic
+import Init.Data.List.Lemmas
+import Init.Data.List.ToArray
+
+public section
+
+namespace Std
+open Std.Iterators Std.Iterators.Types
+
+variable {α₁ α₂ β : Type w} {m : Type w → Type w'}
+
+theorem IterM.step_append [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    {it₁ : IterM (α := α₁) m β} {it₂ : IterM (α := α₂) m β} :
+    (it₁.append it₂).step = (do
+      match (← it₁.step).inflate with
+      | .yield it₁' out h =>
+        pure <| .deflate <| .yield (it₁'.append it₂) out (.fstYield h)
+      | .skip it₁' h =>
+        pure <| .deflate <| .skip (it₁'.append it₂) (.fstSkip h)
+      | .done h =>
+        pure <| .deflate <| .skip (IterM.Intermediate.appendSnd α₁ it₂) (.fstDone h)) := by
+  simp only [append, Intermediate.appendSnd, step, Iterator.step]
+  apply bind_congr; intro step
+  cases step.inflate using PlausibleIterStep.casesOn <;> rfl
+
+theorem IterM.Intermediate.step_appendSnd [Monad m] [Iterator α₁ m β] [Iterator α₂ m β]
+    {it₂ : IterM (α := α₂) m β} :
+    (IterM.Intermediate.appendSnd α₁ it₂).step = (do
+      match (← it₂.step).inflate with
+      | .yield it₂' out h =>
+        pure <| .deflate <| .yield (IterM.Intermediate.appendSnd α₁ it₂') out (.sndYield h)
+      | .skip it₂' h =>
+        pure <| .deflate <| .skip (IterM.Intermediate.appendSnd α₁ it₂') (.sndSkip h)
+      | .done h =>
+        pure <| .deflate <| .done (.sndDone h)) := by
+  simp only [Intermediate.appendSnd, step, Iterator.step]
+  apply bind_congr; intro step
+  cases step.inflate using PlausibleIterStep.casesOn <;> rfl
+
+@[simp]
+theorem IterM.toList_appendSnd [Monad m] [LawfulMonad m]
+    [Iterator α₁ m β] [Iterator α₂ m β] [Finite α₁ m] [Finite α₂ m]
+    {it₂ : IterM (α := α₂) m β} :
+    (IterM.Intermediate.appendSnd α₁ it₂).toList = it₂.toList := by
+  induction it₂ using IterM.inductSteps with | step it₂ ihy ihs
+  rw [toList_eq_match_step (it := IterM.Intermediate.appendSnd α₁ it₂),
+      toList_eq_match_step (it := it₂)]
+  simp only [Intermediate.step_appendSnd, bind_assoc]
+  apply bind_congr; intro step
+  cases step.inflate using PlausibleIterStep.casesOn
+  · simp [ihy ‹_›]
+  · simp [ihs ‹_›]
+  · simp
+
+@[simp]
+theorem IterM.toList_append [Monad m] [LawfulMonad m]
+    [Iterator α₁ m β] [Iterator α₂ m β] [Finite α₁ m] [Finite α₂ m]
+    {it₁ : IterM (α := α₁) m β} {it₂ : IterM (α := α₂) m β} :
+    (it₁.append it₂).toList = (do
+      let l₁ ← it₁.toList
+      let l₂ ← it₂.toList
+      pure (l₁ ++ l₂)) := by
+  induction it₁ using IterM.inductSteps with | step it₁ ihy ihs
+  rw [toList_eq_match_step (it := it₁.append it₂), toList_eq_match_step (it := it₁)]
+  simp only [step_append, bind_assoc]
+  apply bind_congr; intro step
+  cases step.inflate using PlausibleIterStep.casesOn
+  · simp [ihy ‹_›, - bind_pure_comp]
+  · simp [ihs ‹_›]
+  · simp [toList_appendSnd, - bind_pure_comp]
+
+@[simp]
+theorem IterM.toListRev_append [Monad m] [LawfulMonad m]
+    [Iterator α₁ m β] [Iterator α₂ m β] [Finite α₁ m] [Finite α₂ m]
+    {it₁ : IterM (α := α₁) m β} {it₂ : IterM (α := α₂) m β} :
+    (it₁.append it₂).toListRev = (do
+      let l₁ ← it₁.toListRev
+      let l₂ ← it₂.toListRev
+      pure (l₂ ++ l₁)) := by
+  rw [toListRev_eq (it := it₁.append it₂), toList_append,
+      toListRev_eq (it := it₁), toListRev_eq (it := it₂)]
+  simp [map_bind, bind_pure_comp, List.reverse_append]
+
+@[simp]
+theorem IterM.toArray_append [Monad m] [LawfulMonad m]
+    [Iterator α₁ m β] [Iterator α₂ m β] [Finite α₁ m] [Finite α₂ m]
+    {it₁ : IterM (α := α₁) m β} {it₂ : IterM (α := α₂) m β} :
+    (it₁.append it₂).toArray = (do
+      let a₁ ← it₁.toArray
+      let a₂ ← it₂.toArray
+      pure (a₁ ++ a₂)) := by
+  rw [← toArray_toList (it := it₁.append it₂), toList_append,
+      ← toArray_toList (it := it₁), ← toArray_toList (it := it₂)]
+  simp [map_bind, - bind_pure_comp, ← List.toArray_appendList, - toArray_toList]
+
+end Std
--- a/src/Init/Data/Iterators/Lemmas/Combinators/Monadic/FilterMap.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators/Monadic/FilterMap.lean
@@ -374,7 +374,6 @@ theorem IterM.toList_map_eq_toList_filterMapM {α β γ : Type w} {m : Type w
  simp [toList_map_eq_toList_mapM, toList_mapM_eq_toList_filterMapM]
  congr <;> simp

-set_option backward.whnf.reducibleClassField false in
 /--
 Variant of `toList_filterMapWithPostcondition_filterMapWithPostcondition` that is intended to be
 used with the `apply` tactic. Because neither the LHS nor the RHS determine all implicit parameters,
@@ -395,7 +394,7 @@ private theorem IterM.toList_filterMapWithPostcondition_filterMapWithPostconditi
      (it.filterMapWithPostcondition (n := o) fg).toList := by
  induction it using IterM.inductSteps with | step it ihy ihs
  letI : MonadLift n o := ⟨monadLift⟩
-  haveI : LawfulMonadLift n o := ⟨by simp +instances [this], by simp +instances [this]⟩
+  haveI : LawfulMonadLift n o := ⟨LawfulMonadLiftT.monadLift_pure, LawfulMonadLiftT.monadLift_bind⟩
  rw [toList_eq_match_step, toList_eq_match_step, step_filterMapWithPostcondition,
    bind_assoc, step_filterMapWithPostcondition, step_filterMapWithPostcondition]
  simp only [bind_assoc, liftM_bind]
@@ -602,7 +601,6 @@ theorem IterM.toList_map_mapM {α β γ δ : Type w}
    toList_filterMapM_mapM]
  congr <;> simp

-set_option backward.isDefEq.respectTransparency false in
@[simp]
 theorem IterM.toList_filterMapWithPostcondition {α β γ : Type w} {m : Type w → Type w'}
    [Monad m] [LawfulMonad m]
@@ -626,7 +624,6 @@ theorem IterM.toList_filterMapWithPostcondition {α β γ : Type w} {m : Type w
  · simp [ihs ‹_›, heq]
  · simp [heq]

-set_option backward.isDefEq.respectTransparency false in
@[simp]
 theorem IterM.toList_mapWithPostcondition {α β γ : Type w} {m : Type w → Type w'}
    [Monad m] [LawfulMonad m] [Iterator α Id β] [Finite α Id]
@@ -647,25 +644,25 @@ theorem IterM.toList_mapWithPostcondition {α β γ : Type w} {m : Type w → Ty
  · simp [ihs ‹_›, heq]
  · simp [heq]

-set_option backward.isDefEq.respectTransparency false in
@[simp]
 theorem IterM.toList_filterMapM {α β γ : Type w} {m : Type w → Type w'}
    [Monad m] [MonadAttach m] [LawfulMonad m] [WeaklyLawfulMonadAttach m]
    [Iterator α Id β] [Finite α Id]
    {f : β → m (Option γ)} (it : IterM (α := α) Id β) :
    (it.filterMapM f).toList = it.toList.run.filterMapM f := by
-  simp [toList_filterMapM_eq_toList_filterMapWithPostcondition, toList_filterMapWithPostcondition,
-    PostconditionT.attachLift, PostconditionT.run_eq_map, WeaklyLawfulMonadAttach.map_attach]
+  simp only [toList_filterMapM_eq_toList_filterMapWithPostcondition,
+    toList_filterMapWithPostcondition, PostconditionT.run_eq_map]
+  simp [PostconditionT.attachLift, WeaklyLawfulMonadAttach.map_attach]

-set_option backward.isDefEq.respectTransparency false in
@[simp]
 theorem IterM.toList_mapM {α β γ : Type w} {m : Type w → Type w'}
    [Monad m] [MonadAttach m] [LawfulMonad m] [WeaklyLawfulMonadAttach m]
    [Iterator α Id β] [Finite α Id]
    {f : β → m γ} (it : IterM (α := α) Id β) :
    (it.mapM f).toList = it.toList.run.mapM f := by
-  simp [toList_mapM_eq_toList_mapWithPostcondition, toList_mapWithPostcondition,
-    PostconditionT.attachLift, PostconditionT.run_eq_map, WeaklyLawfulMonadAttach.map_attach]
+  simp only [toList_mapM_eq_toList_mapWithPostcondition, toList_mapWithPostcondition,
+    PostconditionT.run_eq_map]
+  simp [PostconditionT.attachLift, WeaklyLawfulMonadAttach.map_attach]

@[simp]
 theorem IterM.toList_filterMap {α β γ : Type w} {m : Type w → Type w'}
@@ -702,18 +699,16 @@ theorem IterM.toList_map {α β β' : Type w} {m : Type w → Type w'} [Monad m]
    (it : IterM (α := α) m β) :
    (it.map f).toList = (fun x => x.map f) <$> it.toList := by
  rw [← List.filterMap_eq_map, ← toList_filterMap]
-  let t := type_of% (it.map f)
-  let t' := type_of% (it.filterMap (some ∘ f))
+  simp only [map, mapWithPostcondition, InternalCombinators.map, filterMap,
+    filterMapWithPostcondition, InternalCombinators.filterMap]
+  unfold Map
  congr
-  · simp [Map]
-  · simp [Map.instIterator, inferInstanceAs]
+  · simp
+  · rw [Map.instIterator_eq_filterMapInstIterator]
    congr
    simp
-  · simp only [map, mapWithPostcondition, InternalCombinators.map, Function.comp_apply, filterMap,
-    filterMapWithPostcondition, InternalCombinators.filterMap]
-    congr
-    · simp [Map]
-    · simp
+  · simp
+  · simp

@[simp]
 theorem IterM.toList_filter {α : Type w} {m : Type w → Type w'} [Monad m] [LawfulMonad m]
@@ -1303,7 +1298,6 @@ theorem IterM.forIn_filterMap
  rw [filterMap, forIn_filterMapWithPostcondition]
  simp [PostconditionT.run_eq_map]

-set_option backward.isDefEq.respectTransparency false in
 theorem IterM.forIn_mapWithPostcondition
    [Monad m] [LawfulMonad m] [Monad n] [LawfulMonad n] [Monad o] [LawfulMonad o]
    [MonadLiftT m n] [LawfulMonadLiftT m n] [MonadLiftT n o] [LawfulMonadLiftT n o]
@@ -1314,9 +1308,10 @@ theorem IterM.forIn_mapWithPostcondition
    haveI : MonadLift n o := ⟨monadLift⟩
    forIn (it.mapWithPostcondition f) init g =
      forIn it init (fun out acc => do g (← (f out).run) acc) := by
-  rw [mapWithPostcondition, InternalCombinators.map, ← InternalCombinators.filterMap,
-    ← filterMapWithPostcondition, forIn_filterMapWithPostcondition]
-  simp [PostconditionT.run_eq_map]
+  unfold mapWithPostcondition InternalCombinators.map Map.instIteratorLoop Map
+  rw [Map.instIterator_eq_filterMapInstIterator]
+  rw [← InternalCombinators.filterMap, ← filterMapWithPostcondition, forIn_filterMapWithPostcondition]
+  simp

 theorem IterM.forIn_mapM
    [Monad m] [LawfulMonad m] [Monad n] [LawfulMonad n] [Monad o] [LawfulMonad o]
@@ -1480,7 +1475,7 @@ theorem IterM.foldM_filterM {α β δ : Type w}
  simp [filterM, foldM_filterMapWithPostcondition, PostconditionT.run_attachLift]
  congr 1; ext out acc
  apply bind_congr; intro fx
-  cases fx.down <;> simp [PostconditionT.run_eq_map]
+  cases fx.down <;> simp

 theorem IterM.foldM_filterMap {α β γ δ : Type w} {m : Type w → Type w'} {n : Type w → Type w''}
    [Iterator α m β] [Finite α m] [Monad m] [Monad n] [LawfulMonad m] [LawfulMonad n]
--- a/src/Init/Data/Iterators/Lemmas/Combinators/Monadic/FlatMap.lean
+++ b/src/Init/Data/Iterators/Lemmas/Combinators/Monadic/FlatMap.lean
@@ -21,14 +21,14 @@ open Std.Internal Std.Iterators
 theorem IterM.step_flattenAfter {α α₂ β : Type w} {m : Type w → Type w'} [Monad m]
    [Iterator α m (IterM (α := α₂) m β)] [Iterator α₂ m β]
    {it₁ : IterM (α := α) m (IterM (α := α₂) m β)} {it₂ : Option (IterM (α := α₂) m β)} :
-  (it₁.flattenAfter it₂).step = (do
+  (it₁.flattenAfter it₂).step = (
    match it₂ with
-    | none =>
+    | none => do
      match (← it₁.step).inflate with
      | .yield it₁' it₂' h => return .deflate (.skip (it₁'.flattenAfter (some it₂')) (.outerYield h))
      | .skip it₁' h => return .deflate (.skip (it₁'.flattenAfter none) (.outerSkip h))
      | .done h => return .deflate (.done (.outerDone h))
-    | some it₂ =>
+    | some it₂ => do
      match (← it₂.step).inflate with
      | .yield it₂' out h => return .deflate (.yield (it₁.flattenAfter (some it₂')) out (.innerYield h))
      | .skip it₂' h => return .deflate (.skip (it₁.flattenAfter (some it₂')) (.innerSkip h))
@@ -130,16 +130,16 @@ public theorem IterM.step_flatMapAfterM {α : Type w} {β : Type w} {α₂ : Typ
    {γ : Type w} {m : Type w → Type w'} [Monad m] [MonadAttach m] [LawfulMonad m] [WeaklyLawfulMonadAttach m]
    [Iterator α m β] [Iterator α₂ m γ] {f : β → m (IterM (α := α₂) m γ)} {it₁ : IterM (α := α) m β}
    {it₂ : Option (IterM (α := α₂) m γ)} :
-  (it₁.flatMapAfterM f it₂).step = (do
+  (it₁.flatMapAfterM f it₂).step = (
    match it₂ with
-    | none =>
+    | none => do
      match (← it₁.step).inflate with
      | .yield it₁' b h =>
        let fx ← MonadAttach.attach (f b)
        return .deflate (.skip (it₁'.flatMapAfterM f (some fx.val)) (.outerYield_flatMapM h fx.property))
      | .skip it₁' h => return .deflate (.skip (it₁'.flatMapAfterM f none) (.outerSkip_flatMapM h))
      | .done h => return .deflate (.done (.outerDone_flatMapM h))
-    | some it₂ =>
+    | some it₂ => do
      match (← it₂.step).inflate with
      | .yield it₂' out h => return .deflate (.yield (it₁.flatMapAfterM f (some it₂')) out (.innerYield_flatMapM h))
      | .skip it₂' h => return .deflate (.skip (it₁.flatMapAfterM f (some it₂')) (.innerSkip_flatMapM h))
@@ -171,15 +171,15 @@ public theorem IterM.step_flatMapM {α : Type w} {β : Type w} {α₂ : Type w}
 public theorem IterM.step_flatMapAfter {α : Type w} {β : Type w} {α₂ : Type w}
    {γ : Type w} {m : Type w → Type w'} [Monad m] [LawfulMonad m] [Iterator α m β] [Iterator α₂ m γ]
    {f : β → IterM (α := α₂) m γ} {it₁ : IterM (α := α) m β} {it₂ : Option (IterM (α := α₂) m γ)} :
-  (it₁.flatMapAfter f it₂).step = (do
+  (it₁.flatMapAfter f it₂).step = (
    match it₂ with
-    | none =>
+    | none => do
      match (← it₁.step).inflate with
      | .yield it₁' b h =>
        return .deflate (.skip (it₁'.flatMapAfter f (some (f b))) (.outerYield_flatMap h))
      | .skip it₁' h => return .deflate (.skip (it₁'.flatMapAfter f none) (.outerSkip_flatMap h))
      | .done h => return .deflate (.done (.outerDone_flatMap h))
-    | some it₂ =>
+    | some it₂ => do
      match (← it₂.step).inflate with
      | .yield it₂' out h => return .deflate (.yield (it₁.flatMapAfter f (some it₂')) out (.innerYield_flatMap h))
      | .skip it₂' h => return .deflate (.skip (it₁.flatMapAfter f (some it₂')) (.innerSkip_flatMap h))
--- a/Show More
+++ b/Show More