Add diff mode

2025-12-18 14:59:51 +11:00 · 2025-12-18 14:59:51 +11:00 · b5d2b99174
commit b5d2b99174
parent 55e50ebf59
6 changed files with 1131 additions and 220 deletions
--- a/README.md
+++ b/README.md
@ -4,217 +4,175 @@
  <img src="https://git.mig5.net/mig5/enroll/raw/branch/main/enroll.svg" alt="Enroll logo" width="240" />
 </div>

-**enroll** inspects a Linux machine (currently Debian-only) and generates Ansible roles for things it finds running on the machine.
+**enroll** inspects a Linux machine (currently Debian-only) and generates Ansible roles/playbooks (and optionally inventory) for what it finds.

 It aims to be **optimistic and noninteractive**:
- Detects packages that have been installed
- Detects Debian package ownership of `/etc` files using dpkg's local database.
+- Detects packages that have been installed.
+- Detects Debian package ownership of `/etc` files using dpkg’s local database.
 - Captures config that has **changed from packaged defaults** (dpkg conffile hashes + package md5sums when available).
 - Also captures **service-relevant custom/unowned files** under `/etc/<service>/...` (e.g. drop-in config includes).
 - Defensively excludes likely secrets (path denylist + content sniff + size caps).
- Captures non-system users that exist on the system, and their SSH public keys
- Captures miscellaneous `/etc` files that it can't attribute to a package, and installs it in an `etc_custom` role
- Avoids trying to start systemd services that were detected as being Inactive during harvest
+- Captures non-system users and their SSH public keys.
+- Captures miscellaneous `/etc` files it can’t attribute to a package and installs them in an `etc_custom` role.
+- Avoids trying to start systemd services that were detected as inactive during harvest.

 ---

-# Two modes: single-site vs multi-site (`--fqdn`)
+## Mental model

-**enroll** has two distinct ways to generate Ansible:
+`enroll` works in two phases:

-## 1) Single-site mode (default: *no* `--fqdn`)
-Use this when you're enrolling **one server** (or you're generating a "golden" role set you intend to reuse).
+1) **Harvest**: collect host facts + relevant files into a harvest bundle (`state.json` + harvested artifacts)
+2) **Manifest**: turn that harvest into Ansible roles/playbooks (and optionally inventory)

-**What you get**
- Config, templates, and defaults are primarily **contained inside each role**.
- Raw config files (when not templated) live in the role's `files/`.
- Template variables (when templated) live in the role's `defaults/main.yml`.
+Additionally:

-**Pros**
- Roles are more **self-contained** and easier to understand.
- Better starting point for **provisioning new servers**, because the role contains most of what it needs.
- Less inventory abstraction/duplication.
+- **Diff**: compare two harvests and report what changed (packages/services/users/files) since the previous snapshot.

-**Cons**
- Less convenient for quickly enrolling multiple hosts with divergent configs (you'll do more manual work to make roles flexible across hosts).
+---

-## 2) Multi-site mode (`--fqdn`)
-Use this when you want to enroll **several existing servers** quickly, especially if they differ.
+## Output modes: single-site vs multi-site (`--fqdn`)

-**What you get**
- Roles are **shared** across hosts, but host-specific data lives in inventory.
- Host inventory drives what's managed:
-  - which files to deploy for that host
-  - which packages are relevant for that host
-  - which services should be enabled/started for that host
- For non-templated config, raw files live in host-specific inventory under `.files/` (per role).
+`enroll manifest` (and `enroll single-shot`) support two distinct output styles.

-**Pros**
- Fastest way to retrofit **multiple servers** into config management.
- Avoids shared-role "host A breaks host B" problems by keeping host-specific state in inventory.
- Better fit when you already have a fleet and want to capture/reflect reality first.
+### Single-site mode (default: *no* `--fqdn`)
+Use when enrolling **one server** (or generating a “golden” role set you intend to reuse).

-**Cons**
- More abstraction: roles become more "data-driven".
- Potential duplication: raw files may exist per-host in inventory (even if identical).
- Harder to use the roles to **provision a brand-new server** without also building an inventory for that new host, because multi-site output assumes the server already exists and is being retrofitted.
+**Characteristics**
+- Roles are more self-contained.
+- Raw config files live in the role’s `files/`.
+- Template variables live in the role’s `defaults/main.yml`.
+
+### Multi-site mode (`--fqdn`)
+Use when enrolling **several existing servers** quickly, especially if they differ.
+
+**Characteristics**
+- Roles are shared, host-specific state lives in inventory.
+- Host inventory drives what gets managed (files/packages/services).
+- Non-templated raw files live per-host under `inventory/host_vars/<fqdn>/<role>/.files/...`.

 **Rule of thumb**
- If your goal is *"make this one server reproducible / provisionable"* → start with **single-site**.
- If your goal is *"get several already-running servers under management quickly"* → use **multi-site**.
+- “Make this one server reproducible/provisionable” → start with **single-site**
+- “Get multiple already-running servers under management quickly” → use **multi-site**

 ---

-# Key concepts
+## Subcommands

-## Harvest
+### `enroll harvest`
+Harvest state about a host and write a harvest bundle.

-**enroll** begins by 'harvesting' known state about your host. This includes detecting what running services exist, what packages have been installed 'manually' (that is, stuff that doesn't come out of the box with the OS), and anything 'custom' in `/etc` that it can't attribute to a specific package.
+**What it captures (high level)**
+- Detected services + service-relevant packages
+- “Manual” packages
+- Changed-from-default config (plus related custom/unowned files under service dirs)
+- Non-system users + SSH public keys
+- Misc `/etc` that can’t be attributed to a package (`etc_custom` role)

-It also detects if any config files have been *changed* from their packaged defaults. If they have, it will attempt to 'harvest' them. If the config file is identical to how it comes with the package, then it doesn't bother harvesting it, because there's little value in config-managing it if it's identical to what you get by simply installing the package!
+**Common flags**
+- Remote harvesting:
+  - `--remote-host`, `--remote-user`, `--remote-port`
+  - `--no-sudo` (if you don’t want/need sudo)
+- Sensitive-data behaviour:
+  - default: tries to avoid likely secrets
+  - `--dangerous`: disables secret-safety checks (see “Sensitive data” below)
+- Encrypt bundles at rest:
+  - `--sops <FINGERPRINT...>`: writes a single encrypted `harvest.tar.gz.sops` instead of a plaintext directory

-The harvest writes a state.json file explaining all the data it harvested and, if it chose not to harvest something, explanations as to why that is the case (see below: sensitive data).
+---

-### Remote harvesting (workstation → remote)
+### `enroll manifest`
+Generate Ansible output from an existing harvest bundle.

-If you'd prefer not to install **enroll** on the target host, you can run the harvest over SSH from your workstation and pull the harvest bundle back locally:
+**Inputs**
+- `--harvest /path/to/harvest` (directory)
+  or `--harvest /path/to/harvest.tar.gz.sops` (if using `--sops`)

-```bash
-enroll harvest --remote-host myhost.example.com --remote-user myuser --out /tmp/enroll-harvest
-```
+**Output**
+- In plaintext mode: an Ansible repo-like directory structure (roles/playbooks, and inventory in multi-site mode).
+- In `--sops` mode: a single encrypted file `manifest.tar.gz.sops` containing the generated output.

- `--remote-port` defaults to `22`
- `--remote-user` defaults to your local `$USER`
+**Common flags**
+- `--fqdn <host>`: enables **multi-site** output style

-This uploads a self-contained `enroll` zipapp to a temporary directory on the remote host, runs `harvest` there, then downloads the resulting harvest bundle to the `--out` directory on your workstation.
+---

-**Privilege note:** A "full" harvest typically needs root access. Remote harvesting assumes the remote user can run `sudo` **without a password prompt** (NOPASSWD) so the harvest can run non-interactively. If you don't want this, pass `--no-sudo` as well.
+### `enroll single-shot`
+Convenience wrapper that runs **harvest → manifest** in one command.
+
+Use this when you want “get me something workable ASAP”.
+
+Supports the same general flags as harvest/manifest, including `--fqdn`, remote harvest flags, and `--sops`.
+
+---
+
+### `enroll diff`
+Compare two harvest bundles and report what changed.
+
+**What it reports**
+- Packages added/removed
+- Services enabled added/removed, plus key state changes
+- Users added/removed, plus field changes (uid/gid/home/shell/groups, etc.)
+- Managed files added/removed/changed (metadata + content hash changes where available)
+
+**Inputs**
+- `--old <harvest>` and `--new <harvest>` (directories or `state.json` paths)
+- `--sops` when comparing SOPS-encrypted harvest bundles
+
+**Output formats**
+- `--format json` (default for webhooks)
+- `--format markdown` / `--format text` (human-oriented)
+
+**Notifications**
+- Webhook:
+  - `--webhook <url>`
+  - `--webhook-format json|markdown|text`
+  - `--webhook-header 'Header-Name: value'` (repeatable)
+- Email (optional):
+  - `--email-to <addr>` (plus optional SMTP/sendmail-related flags, depending on your install)
+
+---

 ## Sensitive data

-**enroll** doesn't make any assumptions about how you might handle sensitive data from your config files, in Ansible. Some people might use SOPS, others might use Vault, others might do something else entirely.
+By default, `enroll` does **not** assume how you handle secrets in Ansible. It will attempt to avoid harvesting likely sensitive data (private keys, passwords, tokens, etc.). This can mean it skips some config files you may ultimately want to manage.

-For this reason, **enroll** will attempt to read config files, and if it detects data that looks like a sensitive SSH/SSL private key, or password, or API key, etc, then it won't harvest it for config management.
+If you opt in to collecting everything:

-This inevitably means that it will deliberately miss some important config files that you probably *want* to manage in Ansible.
+### `--dangerous`
+**WARNING:** disables “likely secret” safety checks. This can copy private keys, TLS key material, API tokens, database passwords, and other credentials into the harvest output **in plaintext**.

-Nonetheless, in the Harvest 'state' file, there should be an explanation of 'excluded files'. You can parse or inspect this file to find what it chose to ignore, and then you know what you might want to augment the results with later, once you 'manifest' the harvest into Ansible configuration.
+If you intend to keep harvests/manifests long-term (especially in git), strongly consider encrypting them at rest.

-Nonetheless, in some cases it may be appropriate to truly grab as much as you can, including secrets. For that, read on for the `--dangerous` flag.
+### Encrypt bundles at rest with `--sops`
+`--sops` encrypts the harvest and/or manifest outputs into a single `.tar.gz.sops` file (GPG). This is for **storage-at-rest**, not for direct “Ansible SOPS inventory” workflows.

-### Opting in to fetching sensitive data: `--dangerous`
-
-**WARNING:** `--dangerous` disables enroll's "likely a secret" safety checks. This can cause private keys, TLS key material, API tokens, database passwords, and other credentials to be copied into your harvest output **in plaintext**.
-
-Only use `--dangerous` if you explicitly want to scoop up sensitive files and you understand where the harvest output is stored, who can read it, and how it will be handled (backups, git commits, etc, as well as risk of using `--out` with a shared `/tmp` location where other users could see the data). We offer no liability if your sensitive data is compromised through the use of this tool!
-
-**Strong recommendation:** If you plan to keep harvested files long-term (especially in git), encrypt secrets at rest. A common approach is to use **SOPS** and then use the **community.sops** Ansible collection to load/decrypt encrypted content during deploy.
-
-Install the collection:
-
-```bash
-ansible-galaxy collection install community.sops
-```
-
-Then you can use the collection's lookup/vars plugins or modules to decrypt or load SOPS-encrypted vars at runtime.
-
-Note the section below **also** talks about SOPS, but this is in the context of simply encrypting the data generated by `enroll` at rest for safe-keeping, **not** for direct integration with Ansible.
-
-
-### Encrypting harvest/manifests at rest with `--sops`
-
-If you want to use `--dangerous` (or you simply want to keep the harvested artifacts private when they're sitting on disk, in git, etc), you can pass `--sops` to `harvest`, `manifest`, or `single-shot`.
-
-To use `--sops`, you will need to have [sops](https://github.com/getsops/sops) installed on your `$PATH`.
-
- `--sops` expects one or more **GPG key fingerprints**. If `sops` is not on the `$PATH`, **enroll** will error.
- `harvest --sops ...` writes a *single* encrypted file (`harvest.tar.gz.sops`) instead of a plaintext directory.
- `manifest --sops ...` (and `single-shot --sops ...`) will:
-  - decrypt the harvest bundle with `sops -d` (if the `--harvest` input is an encrypted file), then generate manifests as normal
-  - bundle the entire generated Ansible output into a *single* encrypted file (`manifest.tar.gz.sops`)
-
-⚠️ **Important:** `manifest --sops` (and `single-shot --sops`) produces **one encrypted file**. It is **not** an Ansible repo you can point `ansible-playbook` at directly. It is **not** the same as using SOPS inventory with the Ansible SOPS collection.
-
-To use the encrypted SOPS manifest, decrypt and extract it first, then run Ansible from inside the extracted `manifest/` directory:
-
-```bash
-sops -d /path/to/manifest.tar.gz.sops | tar -xzvf -
-cd manifest
-ansible-playbook ...
-```
-
-Example:
-
-```bash
-# Harvest (encrypted-at-rest)
-enroll harvest --out /tmp/enroll-harvest --dangerous --sops <FINGERPRINT(s)>
-
-# Manifest (encrypted-at-rest)
-enroll manifest --harvest /tmp/enroll-harvest/harvest.tar.gz.sops --out /tmp/enroll-ansible --sops <FINGERPRINT(s)>
-
-# Decrypt/extract manifest output for inspection / ansible runs
-cd /tmp/enroll-ansible
-sops -d manifest.tar.gz.sops | tar -xzvf -
-cd manifest
-```
-
-(If you want to manually inspect an encrypted harvest bundle, extract it into its own directory, e.g. `mkdir -p harvest && sops -d harvest.tar.gz.sops | tar -xzvf - -C harvest`.)
-
-
-
-## Manifest
-
-The 'manifest' subcommand expects to be given a path to the 'harvest' obtained in the first step. It will then attempt to generate Ansible roles and playbooks (and potentially 'inventory') from that harvest.
-
-Manifesting is the most complex step because a lot of people will have opinions on how Ansible roles and inventory should work. No solution is perfect for everyone. However, **enroll** tries to strike a reasonable balance.
-
-Remember, the purpose of this tool is to save **time** getting your systems into a decently-managed state. It's still up to you to wrangle it into a form that works for you on an ongoing basis.
+⚠️ Important: `manifest --sops` produces one encrypted file. You must decrypt + extract it before running `ansible-playbook`.

 ---

-# Single-shot mode for the impatient sysadmin
+## JinjaTurtle integration (both modes)

-**enroll** has a 'single-shot' subcommand which combines the two other phases (harvest and manifest) into one. Use it to generate both the harvest and then manifest ansible from that harvest all in one go. Perfect if you're in a hurry!
+If [JinjaTurtle](https://git.mig5.net/mig5/jinjaturtle) is installed, `enroll` can generate Jinja2 templates for ini/json/xml/toml-style config.

---
-
-# JinjaTurtle integration (both modes)
-
-If you also have my other tool [JinjaTurtle](https://git.mig5.net/mig5/jinjaturtle) installed, **enroll** will attempt to create Jinja2 templates for any ini/json/xml/toml style configuration that it finds.
-
- Templates live in the **role** (`roles/<role>/templates/...`)
+- Templates live in `roles/<role>/templates/...`
 - Variables live in:
-  - **single-site**: `roles/<role>/defaults/main.yml`
-  - **multi-site** (`--fqdn`): `inventory/host_vars/<fqdn>/<role>.yml`
+  - single-site: `roles/<role>/defaults/main.yml`
+  - multi-site: `inventory/host_vars/<fqdn>/<role>.yml`

-JinjaTurtle will be used automatically if it is detected on the `$PATH`. You can also be explicit and pass `--jinjaturtle`, but this will throw an error if JinjaTurtle is not on the `$PATH`.
-
-If you *do* have JinjaTurtle installed, but *don't* wish to make use of it, you can use `--no-jinjaturtle`, in which case all config files will be kept as 'raw' files.
+You can force it on with `--jinjaturtle` or disable with `--no-jinjaturtle`.

 ---

-# How multi-site avoids "shared role breaks a host"
+## How multi-site avoids “shared role breaks a host”

-In multi-site mode, **roles are data-driven**. The role contains generic tasks like:
-
- "deploy all files listed for this host"
- "install packages listed for this host"
- "apply systemd enable/start state listed for this host"
-
-The host inventory is what decides which files/packages/services apply to that host. This prevents the classic failure mode where host2 adds a config file to a shared role and host1 then fails trying to deploy a file it never had.
-
-Raw non-templated files are stored under:
-
- `inventory/host_vars/<fqdn>/<role>/.files/...`
-
-…and the host's role variables describe which of those files should be deployed.
+In multi-site mode, roles are **data-driven**. The role tasks are generic (“deploy the files listed for this host”, “install the packages listed for this host”, “apply systemd enable/start state listed for this host”). Host inventory decides what applies per-host, avoiding the classic “host2 adds config, host1 breaks” failure mode.

 ---

 # Install

 ## Ubuntu/Debian apt repository
-
 ```bash
 sudo mkdir -p /usr/share/keyrings
 curl -fsSL https://mig5.net/static/mig5.asc | sudo gpg --dearmor -o /usr/share/keyrings/mig5.gpg
@ -224,25 +182,19 @@ sudo apt install enroll
 ```

 ## AppImage
-
-Download the AppImage file from the Releases page (verify with GPG if you wish, my fingerprint is [here](https://mig5.net/static/mig5.asc)),
-then make it executable and run it:
+Download it from my Releases page, then:

 ```bash
 chmod +x Enroll.AppImage
 ./Enroll.AppImage
 ```

-### Pip/PipX
-
+## Pip/PipX
 ```bash
 pip install enroll
 ```

-### Poetry
-
-Clone this repository with git, then:
-
+## Poetry (dev)
 ```bash
 poetry install
 poetry run enroll --help
@ -250,110 +202,110 @@ poetry run enroll --help

 ---

-# Usage
+## Found a bug / have a suggestion?

-## 1. Harvest state/information about the host
+My Forgejo doesn’t currently support federation, so I haven’t opened registration/login for issues.

-On the host (root recommended to harvest as much data as possible):
+Instead, email me (see `pyproject.toml`) or contact me on the Fediverse:

+https://goto.mig5.net/@mig5
+
+---
+
+# Examples
+
+## Harvest
+
+### Local harvest
 ```bash
 enroll harvest --out /tmp/enroll-harvest
 ```
-### Remote harvest over SSH (no enroll install required on the remote host, no need for --out)

+### Remote harvest over SSH
 ```bash
-enroll harvest --remote-host myhost.example.com --remote-user myuser
+enroll harvest --remote-host myhost.example.com --remote-user myuser --out /tmp/enroll-harvest
 ```

-### `--dangerous` (captures potentially sensitive files — read the warning above)
-
+### `--dangerous`
 ```bash
 enroll harvest --out /tmp/enroll-harvest --dangerous
 ```

-Remote + dangerous:
-
+### Remote + dangerous:
 ```bash
 enroll harvest --remote-host myhost.example.com --remote-user myuser --dangerous
 ```

-### `--sops` (encrypt bundles at rest)
-
-`--sops` bundles and encrypts the output as a single SOPS-encrypted `.tar.gz.sops` file (GPG). This is particularly useful if you're using `--dangerous`.
-
+### `--sops` (encrypt at rest)
 ```bash
 # Encrypted harvest bundle (writes /tmp/enroll-harvest/harvest.tar.gz.sops)
 enroll harvest --out /tmp/enroll-harvest --dangerous --sops <FINGERPRINT(s)>
+```

-# Encrypted manifest bundle (writes /tmp/enroll-ansible/manifest.tar.gz.sops)
+---
+
+## Manifest
+
+### Single-site (default: no --fqdn)
+```bash
+enroll manifest --harvest /tmp/enroll-harvest --out /tmp/enroll-ansible
+```
+
+### Multi-site (--fqdn)
+```bash
+enroll manifest --harvest /tmp/enroll-harvest --out /tmp/enroll-ansible --fqdn "$(hostname -f)"
+```
+
+### Manifest with `--sops`
+```bash
+# Generate encrypted manifest bundle (writes /tmp/enroll-ansible/manifest.tar.gz.sops)
 enroll manifest --harvest /tmp/enroll-harvest/harvest.tar.gz.sops --out /tmp/enroll-ansible --sops <FINGERPRINT(s)>

 # Decrypt/extract the manifest bundle, then run Ansible from inside ./manifest/
 cd /tmp/enroll-ansible
 sops -d manifest.tar.gz.sops | tar -xzvf -
 cd manifest
-ansible-playbook ./playbook.yml
 ```

-
-## 2. Generate Ansible manifests (roles/playbook) from that harvest
-
-### Single-site (default: no --fqdn)
-
-Good for one server, or for producing roles you want to reuse to provision new machines:
-
-```bash
-enroll manifest --harvest /tmp/enroll-harvest --out /tmp/enroll-ansible
-```
-
-### Multi-site (--fqdn)
-
-Best when enrolling multiple already-running servers into one repo:
-
-```bash
-enroll manifest --harvest /tmp/enroll-harvest --out /tmp/enroll-ansible --fqdn "$(hostname -f)"
-```
+---

 ## Single-shot

-Alternatively, do both steps in one shot:
-
 ```bash
 enroll single-shot --harvest /tmp/enroll-harvest --out /tmp/enroll-ansible --fqdn "$(hostname -f)"
 ```
-Remote single-shot (run harvest over SSH, then manifest locally):

+Remote single-shot (run harvest over SSH, then manifest locally):
 ```bash
-enroll single-shot --remote-host myhost.example.com --remote-user myuser --harvest /tmp/enroll-harvest --out /tmp/enroll-ansible --fqdn "myhost.example.com"
+enroll single-shot --remote-host myhost.example.com --remote-user myuser   --harvest /tmp/enroll-harvest --out /tmp/enroll-ansible --fqdn "myhost.example.com"
 ```

-In multi-site mode (`--fqdn`), you can run single-shot repeatedly against multiple hosts while reusing the same `--out` directory so each host merges into the existing Ansible repo.
+---

+## Diff

-## 3. Run Ansible
+### Compare two harvest directories
+```bash
+enroll diff --old /path/to/harvestA --new /path/to/harvestB --format json
+```
+
+### Diff + webhook notify
+```bash
+enroll diff   --old /path/to/golden/harvest   --new /path/to/new/harvest   --webhook https://nr.mig5.net/forms/webhooks/xxxx   --webhook-format json   --webhook-header 'X-Enroll-Secret: xxxx'
+```
+
+`diff` mode also supports email sending and text or markdown format, as well as `--exit-code` mode to trigger a return code of 2 (useful for crons or CI)
+
+---
+
+## Run Ansible

 ### Single-site
-
-You can run it however you prefer (local connection or your own inventory). Example:
-
 ```bash
 ansible-playbook -i "localhost," -c local /tmp/enroll-ansible/playbook.yml
 ```

 ### Multi-site (--fqdn)
-
-In multi-site mode, enroll generates an ansible.cfg, `host_vars` inventory, and a host-specific playbook:
-
 ```bash
 ansible-playbook /tmp/enroll-ansible/playbooks/"$(hostname -f)".yml
 ```
-
---
-
-# Found a bug, have a suggestion?
-
-My Forgejo doesn't yet support proper federation, and for that reason I've not opened up registration/login to use the issue queue.
-
-Instead, you can e-mail me (see the pyproject.toml for details) or contact me on the Fediverse:
-
-https://goto.mig5.net/@mig5