Analyzed ~89 Claude Code sessions across 7 unstructured_org projects to
identify recurring failures and friction points, then applied fixes:
- Fix "ask then die" bug: skill now injects AUTONOMOUS MODE directive so
domain agents work without interactive questions that kill the Agent tool
- Fix git add -A: all 4 domain agents now stage specific files instead of
blindly staging everything (caused accidental commits of scratch files)
- Add pre-commit step: agents run pre-commit before every commit to catch
linting failures before CI (ruff/undersort failures were recurring)
- Add measurement methodology lock: prevents changing profiling flags
mid-experiment which created uninterpretable deltas
- Add branch state verification to router startup (prevents wrong-branch
confusion that wasted multiple sessions)
- Add multi-repo detection to router (original work spanned 4 repos)
- Add library vs application awareness to memory agent (prevents wasting
time on import-time optimizations in library projects)
- Add dependency resilience to setup agent (uv run --with isolation
warning, private PyPI failure guidance)
- Add PR text quality guidelines (sessions showed AI-sounding text that
required multiple user corrections)
- Add chart generation guidelines to pr-preparation.md
- Add context conservation rules (max 2 background tasks, use subagents)
- Add cross-session learnings template for .codeflash/learnings.md
- All domain agents now read learnings.md at startup