codeflash-agent/evals
Kevin Turcios 33faedf427
Add Unstructured report, rewrite statusline, format evals/scripts (#20)
* Add Unstructured engagement report as uv workspace member

Three-tier Plotly Dash app (Executive Brief, Engineering Team, Full
Detail) with data in JSON, theme constants in theme.py, and Dash
production improvements (Google Fonts, clientside callbacks, meta tags).

Also: add .playwright-mcp/ to .gitignore, add reports/* ruff overrides,
remove tracked .codeflash/observability/read-tracker.

* Rewrite statusline to derive context from git state

Detects active area from changed files (reports, packages, plugin,
.codeflash, case-studies, evals), falls back to branch name convention
(perf/*, feat/*, fix/*), shows dirty indicator. Uses whoami for
cross-platform user detection.

* Add pre-push lint rule to commit guidelines

* Exclude .codeflash/ from ruff linting

Benchmark and profiling scripts in .codeflash/ are scratch work, not
package source. Excluding them prevents CI failures from ad-hoc scripts.

* Run ruff format across packages, scripts, evals, and plugin refs

* Fix github-app async test failures in CI

Add asyncio_mode = "auto" to root pytest config so async tests
are detected when running from the repo root via uv run pytest packages/.
2026-04-15 03:06:16 -05:00
..
repos/codeflash-internal-psycopg-serialization Merge main-teammate branch 2026-04-03 17:36:50 -05:00
templates Add Unstructured report, rewrite statusline, format evals/scripts (#20) 2026-04-15 03:06:16 -05:00
.gitignore Merge main-teammate branch 2026-04-03 17:36:50 -05:00
baseline-scores.json Merge main-teammate branch 2026-04-03 17:36:50 -05:00
check-regression.sh Merge main-teammate branch 2026-04-03 17:36:50 -05:00
run-eval.sh squash 2026-04-13 14:12:17 -05:00
score-eval.sh Merge main-teammate branch 2026-04-03 17:36:50 -05:00
score.py Add Unstructured report, rewrite statusline, format evals/scripts (#20) 2026-04-15 03:06:16 -05:00