Three changes expanding PHI safety envelope on the tool-result surface. Closes V2 + V12 + V2-sub from Vera's audit. No behavior change for users not interacting with HL7-shaped data. - Tool-name allow-list dropped. The v0.7.3 tool-result auto-PHI gate ran only on read_file (.hl7|.txt), nc_msgs, hl7_field, hl7_diff. v0.8.1 runs _auto_phi_looks_like_hl7 on EVERY tool result. On hit → route through lib/hl7-sanitize.sh. On miss → pass through unchanged. Closes V2: bash_exec / ssh_exec / grep_files / read_file of any extension all get scanned when their output is HL7-shaped. False- positive cost is negligible (extra regex pass on non-HL7 has zero behavioral impact). - Base64-wrapped HL7 round-trip. New _auto_phi_b64_roundtrip helper. Detects candidate base64 runs (length >= 200, [A-Za-z0-9+/=] only, length divisible by 4 — NOT entropy-based per Pax §V2-sub: HL7's repetitive prefixes survive base64 with LOW entropy, so entropy is the wrong signal). Speculatively decodes each candidate; if decoded bytes look like HL7, routes through hl7-sanitize.sh and re-encodes back into the result. Catches ssh_pull_smat sampled mode's TSV format. Requires python3 (installed everywhere larry runs); skipped with a one-time stderr warning when unavailable. Server-side TSV encoding kept (binary-safe transport); client-side unwrap handles the safety concern, no remote refactor needed. - Operator review gate for bash_exec/ssh_exec/ssh_pull/ssh_pull_smat results. When the tool produced HL7-shaped output OR the result exceeds LARRY_TOOL_RESULT_REVIEW_THRESHOLD bytes (default 8192), Larry prompts [Y/n/i] before passing the result back to the model. 'i' opens the full output in $PAGER then re-prompts. Default Y (zero friction). N substitutes a refusal JSON so the model surfaces that something was withheld. Skipped when LARRY_AUTO_PHI=off (opt-out consistency) OR no TTY (headless scripts unaffected). Override with LARRY_TOOL_RESULT_REVIEW=always for paranoid mode. Closes V12. Proactive same-pattern sweep. Searched for other call sites where tool output bypasses content-shape gating: only the one in agent_turn. The v0.8.0-c strict-mode tool-result branch was updated in lockstep so it now triggers on the broader (content-only) eligibility. Verification: bash -n clean; b64 round-trip unit-tested with three cases (real-world HL7 base64 → decoded contains tokenized PHI not clear-text PHI; plain text → passthrough; non-HL7 b64 → passthrough, no false positive). Co-Authored-By: Clover (Claude Opus 4.7) <noreply@anthropic.com> |
||
|---|---|---|
| agents | ||
| lib | ||
| .gitignore | ||
| CHANGELOG.md | ||
| install-larry.sh | ||
| larry-auth.sh | ||
| larry-rollback.sh | ||
| larry-tunnel.sh | ||
| larry.sh | ||
| MANIFEST | ||
| MANUAL.md | ||
| README.md | ||
| VERSION | ||
Larry-Anywhere
Portable AI agent for Cloverleaf integration work. Single bash script, no installs, no root, no package manager. Runs on Linux and inside MobaXterm on Windows. 26 native v3 tools for NetConfig analysis, message search, system documentation, regression testing, and safe NetConfig modification — all implemented directly in bash with no dependency on v1 wrapper scripts or v2 cloverleaf-tools.pyz.
When Cloverleaf is installed, Larry uses the shipped product binaries (tclsh, hcienginerun, etc.) directly. Otherwise it falls back to bash one-liners it composes itself. Never relies on the v1/v2 wrapper layers.
Install
One-liner (recommended)
On any client box with curl and bash (essentially any Linux + MobaXterm shell):
curl -fsSL https://raw.githubusercontent.com/bojj27/cloverleaf-larry/main/install-larry.sh | bash
The installer:
- Detects platform (Linux / Darwin / MobaXterm-cygwin) and arch
- Creates
~/.larry/(or wherever$LARRY_HOMEpoints) - Pulls every script + agent file from
bojj27/cloverleaf-larryraw URLs - Downloads a static
jqbinary into~/.larry/bin/ifjqisn't on PATH - Drops a
larryshim into~/bin/ - Makes no system changes, requires no root
First run:
larry # prompts for ANTHROPIC_API_KEY once
# saved to ~/.larry/.env mode 0600
Auto-update
Every time you run larry, it self-updates from the canonical GitHub URL. To suppress for one launch: larry --no-update. To disable permanently: export LARRY_NO_UPDATE=1.
Offline / scp install (when the client box can't reach github.com)
# from a machine that CAN reach github
git clone https://github.com/bojj27/cloverleaf-larry
scp -r cloverleaf-larry/ user@client-box:~/cloverleaf-larry/
ssh user@client-box
cd ~/cloverleaf-larry && ./install-larry.sh
The installer detects local files and uses them when LARRY_BASE_URL isn't reachable.
Use
Set the Cloverleaf runtime context, then point Larry at your site:
export HCIROOT=/opt/cloverleaf/cis2025/integrator
export HCISITE=adt
larry "$HCIROOT/$HCISITE"
you> list every protocol in this site
you> find threads with codametrix in the name
you> show messages from to_3m in the last 3 days for MRN 5720501458
you> generate jump threads for every TCP-listener inbound, target host=newlinux01.test, jump port = orig+10000
you> diff the ADTto_3m interface + connected threads between test and prod
you> document the codametrix system into ~/.larry/knowledge/codametrix.md
you> /quit
What Larry can do natively (v3 tools)
| domain | tools |
|---|---|
| File system | read_file, list_dir, grep_files, glob_files, write_file, bash_exec |
| NetConfig (read) | nc_list_protocols, nc_list_processes, nc_protocol_block, nc_protocol_field, nc_protocol_nested, nc_protocol_summary, nc_destinations, nc_sources, nc_xlate_refs, nc_tclproc_refs |
| NetConfig (write, journaled) | nc_insert_protocol, nc_add_route |
| Workflows | nc_find_inbound, nc_make_jump, nc_document, nc_find, nc_diff_interface |
| Messages (smat is SQLite!) | hl7_field, nc_msgs, hl7_diff |
| Safety | larry_rollback_list + larry-rollback.sh CLI |
Every write goes through a journal (~/.larry/journal/<session>/) — original snapshotted, diff saved, atomic replacement. Roll back any subset with larry-rollback.sh --list, --target /path/to/file, --session <id>, or --entry <id>.
Slash commands in the REPL
| command | what |
|---|---|
/env |
show detected HCIROOT/HCISITE + tool layer presence |
/sites |
list site dirs under HCIROOT |
/site <name> |
switch HCISITE mid-session |
/cd <path> |
change working directory |
/model <name> |
switch Claude model |
/reset |
clear conversation history |
/load <file> |
load a file as your next message |
/help |
full slash-command help |
Working examples (battle-tested against a 22-site Cloverleaf install)
- Migration jump-threads: "find every TCP-listener inbound, generate the 3-thread jump pair (linux_out / windowsin / windows_out) for each." Inserts via journaled write. Roll back instantly.
- MRN search: "messages from to_3m in last 3 days for patient MRN X." Reads smat via
sqlite3 -ascii, parses HL7 natively, filters by PID field — no Cloverleaf binary involved. - System documentation: "find all threads matching , document them." Cross-site walk, threads + ports + processes + xlates + tclprocs, adjacent-thread map, placeholder POC/status/escalation sections.
- Interface diff: "diff ADTto_3m + connected (depth 1) between test and prod." Connected-graph BFS, protocol-block diff + xlate-file diff + tclproc-file diff.
- Regression diff (Phase 6):
hl7_difffor any two HL7 message files, with--ignore MSH.7by default and configurable field-level exceptions. The orchestrator that drives Cloverleaf'sroute_testend-to-end is the only Example 6 piece pending an engine to invoke against.
Architecture in one diagram
Agent layer Larry-Anywhere (this repo)
├── bash REPL → Anthropic API
├── personas: Larry + Clover + Regress + Cheatsheet
├── 26 native tools (no v1/v2 deps)
└── journal-backed writes with rollback
│
↓ acts on
Cloverleaf install $HCIROOT / $HCISITE
NetConfig, Xlate/, tables/, tclprocs/, formats/
.smatdb files (SQLite!) under exec/processes/
shipped binaries (tclsh, hcienginerun, ...) — invoked
directly via bash_exec when needed for engine ops
No layer between Larry and Cloverleaf except plain bash. The v1 wrapper scripts (tbn, hlq, mr, mp, mg, awkcut, ...) and the v2 cloverleaf-tools.pyz are intentionally absent.
Environment cheat-sheet
| var | default | purpose |
|---|---|---|
LARRY_HOME |
~/.larry |
where state lives (sessions, journal, .env, agent overrides) |
LARRY_MODEL |
claude-sonnet-4-6 |
Claude model (try claude-opus-4-7 for deeper work) |
LARRY_MAX_TOKENS |
8192 |
per-turn output cap |
LARRY_NO_UPDATE |
0 |
set to 1 to disable self-update |
LARRY_UPDATE_URL |
github.com/bojj27/cloverleaf-larry/main/larry.sh | self-update source |
LARRY_AGENTS_URL |
github.com/bojj27/cloverleaf-larry/main/agents | persona refresh source |
ANTHROPIC_API_KEY |
(prompted on first run) | API key, saved to $LARRY_HOME/.env |
HCIROOT / HCISITE |
(unset) | auto-detected and surfaced in system prompt |
Roll back any change Larry made
larry-rollback.sh --list # see every write Larry made, newest first
larry-rollback.sh --target /opt/cloverleaf/.../NetConfig # undo every change to this file
larry-rollback.sh --session 2026-05-26-090724-12345 # undo a whole Larry session
larry-rollback.sh --last 1 # undo the most recent write
larry-rollback.sh --entry <session>/<NNN_filename> # undo one specific write
Pre-rollback copies are left at <target>.larry-prerollback.<unix-ts> so you can re-do if needed.
Hard limits (V3)
- No subagent dispatch — Larry + Clover + Regress live in one head. No Pax / Iris / Vera / etc. in portable mode.
- No memory layer — Honcho / Hindsight / mem0 aren't reachable from a remote client box yet. Session history is the markdown logs in
$LARRY_HOME/sessions/. read_filecapped at 250 KB,grep_files/glob_files300 results,bash_exec500 lines of output. Use targeted queries.- Subscription OAuth not yet wired — API key path only. Claude.ai Max subscription quota uses a different auth flow (OAuth device-code); landing in a future release.
Reverse SSH tunnel back home (optional)
If you also want your home Larry to dial into the client shell:
~/.larry/larry-tunnel.sh --serveo # zero-config (serveo.net, third-party)
~/.larry/larry-tunnel.sh --hop=user@bjnoela.com:22 # your controlled hop
Auto-reconnect built in. PID and public URL written to ~/.larry/tunnel.{pid,url}.
License
GPL? MIT? TBD. Bryan decides before this repo gets shared widely.