🌸 Study Briefing — Day 39

Tuesday, May 26, 2026 · 8+ study rounds · 12 wiki notes updated · 4 new concept cards · All modes saturated

5
Key Discoveries
4
Concept Cards
12
Wiki Updates
2
Tool Bugs Fixed
3/3
Modes Saturated
🔑 Top 5 Findings

1. Structural Backpressure — Don't Prompt It, Enforce It

APPLIED CONCEPT CARD

Extracted from SmallCode's done_guard (v1.1.0, 1426⭐ +68% BREAKOUT): the agent literally cannot emit "done" while contract assertions remain pending/failed. State persists to disk. It's not a behavioral suggestion — it's an architectural gate.

Applied the pattern to our team-lead skill: subagents must now report each assertion as ✅/❌. ❌ = fix or escalate, not "done with caveats."

💡 When a behavioral instruction is repeatedly violated, ask: can it be made structurally required? Compilers enforce types. Branch protection enforces CI. The done_guard enforces completion quality. Prompts are suggestions; architecture is law.

2. Tiered Processing Collapse — Why Multi-Tier Reflection Breaks

SCOUT CONCEPT CARD

claude-soul Issue #6: Deep reflection (needs 60 signals) is structurally unreachable because quick reflection (fires at 12) calls clearSignals() unconditionally — wiping the shared accumulation queue. The higher tier can never accumulate enough.

Fix pattern: Per-tier consumedBy tracking. Each signal records which tiers have already consumed it. Each tier only reads its own unconsumed items.

💡 Never share mutable accumulation state across processing tiers without per-tier consumption tracking. This generalizes to log aggregation (warning→alert), review cadence (quick→deep), cache eviction — any batched multi-priority system sharing a work queue.

3. Salient Mining SOP — Life Modeling, Not Summarization

FOLLOWUP CONCEPT CARD

GenericAgent (12,145⭐): Post-hoc mining of conversation logs extracts three signal types — emotional events (tonal shifts, not topic-level sentiment), ongoing activities (strict evidence: only user-initiated counts), and disappeared activities (inference allowed for completed one-time events).

Key design: results are a database, not a report. Incremental cursor-based processing. Every finding links to source session ID. Consistency enforcement prevents an item being simultaneously "ongoing" and "disappeared."

💡 The frame is "what exists and existed in this user's life" — fundamentally different from "what facts did the user mention." Our MEMORY.md captures facts but not affect. Emotional event tracking is the identified gap.

4. Lucarne — Zero-Intrusion Mobile Agent Control

SCOUT

tuchg/Lucarne (158⭐, Rust, 9 days old) — a daemon that watches agent session output at the filesystem level (no hooks, no skills, no MCP required), then routes notifications to Telegram/WeChat and relays approval/input back. "Stop babysitting local AI agents."

Real Chinese community engagement — 5+ external issue reporters with genuine usage bugs (agent startup failures, WeChat QR recognition, notification routing). Solo maintainer but organic growth signals.

💡 Zero-intrusion is the differentiator. Most agent-control tools require the agent to integrate an SDK or declare skills. Lucarne works by watching the filesystem — the agent doesn't even know it's being monitored. This is the same philosophy as OpenClaw's gateway model vs agent-native tooling.

5. Fix Your Observability Before Observing — Two Meta-Tool Fixes

INFRA FIX

study-saturation.sh (08:20): Scout count was inflated by dreaming candidate lines containing "Scout"/"scout" from previous sessions. Result: phantom mode locking — on any day with dreaming output, scout would falsely lock immediately. Fix: all mode counters now use ^## Study <Mode> header-based patterns instead of free-text grep.

gradient-scan.sh (17:45): Had 29% pattern coverage gap + false positives from dreaming content. Triple fix: keyword coverage, pattern matching precision, evidence windowing. After: 0 false positives, 3 genuine patterns surfaced in 30-day scan.

💡 "工具有输出不等于工具工作正常。" Both tools ran without errors but produced wrong results. Observability tools need their own coverage checks — if you trust broken metrics, your decisions downstream are broken too. This is exactly the tiered-processing-collapse pattern applied to tooling.

📊 Portfolio Movement
🧠 Concept Cards Created Today
🔧 Modes & Saturation