mirror of https://github.com/codeflash-ai/codeflash-agent.git synced 2026-05-04 18:25:19 +00:00

Claude Code plugin for autonomous Python runtime performance optimization

Find a file

Kevin Turcios ebb9658dfd Merge main-teammate branch		2026-04-03 17:36:50 -05:00
.claude	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
.github/workflows	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
docs	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
evals	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
languages	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
packages	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
plugin	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
services/github-app	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
vendor/codex	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
.gitignore	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
.pre-commit-config.yaml	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
CLAUDE.md	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
design.md	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
LICENSE	Hello World	2026-03-24 16:14:04 -05:00
Makefile	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
pyproject.toml	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
README.md	Merge main-teammate branch	2026-04-03 17:36:50 -05:00
uv.lock	Merge main-teammate branch	2026-04-03 17:36:50 -05:00

README.md

codeflash-agent

A Claude Code plugin for autonomous Python runtime performance optimization. Profiles code, implements optimizations, benchmarks before and after, and iterates until plateau.

Domains

Domain	When to use
Memory	Peak memory, OOM, memory leaks, RSS reduction
Async	Concurrency, event loop blocking, sequential awaits, throughput/latency
Data Structures	CPU time, O(n²) loops, wrong containers, algorithmic complexity
Structure	Import time, circular deps, module reorganization for performance

The agent auto-detects which domain(s) apply based on your request.

Install

Inside Claude Code, run:

/plugin marketplace add codeflash-ai/codeflash-agent
/plugin install codeflash-agent@codeflash

Team setup

Add to your repo's .claude/settings.json so everyone on the team gets it automatically:

{
  "extraKnownMarketplaces": {
    "codeflash": {
      "source": {
        "source": "github",
        "repo": "codeflash-ai/codeflash-agent"
      }
    }
  },
  "enabledPlugins": {
    "codeflash-agent@codeflash": true
  }
}

Local (development)

git clone https://github.com/codeflash-ai/codeflash-agent.git
claude --plugin-dir ./codeflash-agent

Usage

The agent triggers automatically when you describe a performance problem:

> Our /process endpoint takes 5s but individual calls should only take 500ms each
> test_process_large_file is using 3GB, find ways to reduce it
> process_records is too slow, it's doing O(n²) lookups

Or use the slash command:

> /codeflash-optimize start    # begin a new session
> /codeflash-optimize resume   # continue from where you left off
> /codeflash-optimize status   # check progress

How it works

Discovery — reads project structure, detects package manager, identifies target code
Baseline — profiles the target before making any changes (mandatory)
Analysis — ranks bottlenecks by measured impact, not source-reading intuition
Experiment loop — implements fixes one at a time, re-profiles after each, keeps or discards based on measured improvement
Plateau detection — stops when gains diminish or stall

Session state persists in HANDOFF.md and results.tsv, so you can resume across conversations.

Repo structure

packages/
  codeflash-core/              # shared foundation (models, AI client, telemetry, git)
  codeflash-python/            # Python language CLI — extends core
  codeflash-mcp/               # MCP server (stub)
  codeflash-lsp/               # LSP server (stub)

services/
  github-app/                  # GitHub App integration service

plugin/                        # Claude Code plugin (language-agnostic)
  .claude-plugin/              # plugin manifest & marketplace config
  agents/                      # review & research agents
  commands/                    # codex CLI integration commands
  hooks/                       # session lifecycle & review gate hooks
  references/shared/           # shared methodology & benchmarking guides

languages/python/plugin/       # Python-specific plugin content
  agents/                      # router + domain agents (cpu, memory, async, structure)
  references/                  # domain-specific deep-dive guides
  skills/                      # /codeflash-optimize, memray profiling

vendor/
  codex/                       # OpenAI Codex runtime (vendored)

evals/                         # eval templates & real-repo scenarios