Commit graph

217 commits

Author SHA1 Message Date
Kevin Turcios
5877864fc8 ci: consolidate required checks into single ci.yaml with gate job
Replace 7 individual required-check workflows (unit-tests, mypy,
5 E2E tests) with a single ci.yaml following the astral-sh/ruff
gate pattern:

- determine-changes job uses native git diff (no third-party deps)
- Each test job skipped at job level when paths don't match
- Single required-checks-passed gate job accepts success + skipped
- E2E security preserved: environment gating, author allowlists

This fixes the long-standing issue where workflow-level path filters
leave required checks "Pending" on PRs that don't touch code paths,
blocking merge without admin override.

Estimated savings: ~$1.05/skipped PR ($0.64 unit-tests + $0.01
type-check + $0.40 E2E), ~$50-100/yr in compute, plus eliminating
all admin-merge workarounds.
2026-04-09 05:43:16 -05:00
Kevin Turcios
46e4035b05 ci: bump astral-sh/setup-uv from v6 to v8.0.0
v8 uses immutable releases (no major/minor tags) for supply chain
security. Pinning to exact version tag per upstream recommendation.
2026-04-09 05:16:10 -05:00
Kevin Turcios
07e4566271 ci: remove self-references from workflow path filters
Editing a workflow YAML file should not trigger that same workflow
to run. Removes .github/workflows/<file> from its own paths filter
in mypy.yml, prek.yaml, and unit-tests.yaml.
2026-04-09 05:08:19 -05:00
Kevin Turcios
e2e8557958 ci: remove self-reference from codeflash-optimize paths
Editing the workflow config shouldn't trigger a full optimization run.
2026-04-09 05:04:32 -05:00
Kevin Turcios
b38cfd2186 ci: scope workflow triggers to relevant paths
- codeflash-optimize.yaml: replace paths: ['**'] wildcard with targeted filters
- mypy.yml: add path filters (was firing on every PR/push including docs)
- prek.yaml: add path filters (was firing on every PR)
- unit-tests.yaml: add path filters (was firing on every PR/push)

Docs-only, README, experiment, and LICENSE changes no longer trigger
these workflows. Saves ~20 workflow runs per docs-only PR.
2026-04-09 04:59:24 -05:00
Kevin Turcios
507319066a ci: temporarily disable automatic PR review job
Adds `false &&` guard to the pr-review job condition. The job will
be skipped on all triggers until this is reverted. The @claude mention
job is unaffected.
2026-04-09 04:33:11 -05:00
HeshamHM28
f5777947c6 Merge remote-tracking branch 'origin/main' into cf-java-void-optimization 2026-04-09 08:15:53 +00:00
Kevin Turcios
3dc21bdec3 ci: pin claude-code-action to v1.0.89 to fix Bedrock auth
v1.0.90 broke Bedrock OIDC auth — all Claude Code runs have been
failing with 403 since Apr 8.

Root cause: anthropics/claude-code-action#1196
Pinning to v1.0.89 (last working version) until upstream fix lands.
2026-04-09 02:12:14 -05:00
Kevin Turcios
d344324325 ci: replace wildcard path triggers with targeted filters on E2E tests
All 12 E2E workflows used `paths: ['**']` which triggered on every file
change — docs, configs, experiments, etc. This caused ~140-200 min of
compute per push event (18+ parallel workflows).

Now E2E tests only trigger when relevant source code changes:
- Python E2E: codeflash/**, tests/**, pyproject.toml, uv.lock, workflow files
- JS E2E: same + packages/**
- Java E2E: already had proper path filters (no change needed)

Estimated savings: ~$150-200/mo in CI compute.
2026-04-09 02:06:28 -05:00
HeshamHM28
f42b58bb98 feat: implement bubble sort optimization and corresponding tests in Java 2026-04-03 14:55:09 +02:00
misrasaurabh1
0fc5bba763 fix: e2e java tracer runs on all codeflash changes and validates replay tests + speedups
- Trigger on any codeflash/** or tests/** changes (not just java subset)
- Validate replay test files are discovered per-function
- Already validates: replay test generation, global discovery count,
  optimization success, and minimum speedup percentage

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-19 22:34:56 -07:00
Saurabh Misra
59031a145e
Merge pull request #1874 from codeflash-ai/java-tracer
feat: Java tracing agent with end-to-end optimization pipeline
2026-03-19 15:12:11 -07:00
misrasaurabh1
c699093a26 Initial e2e tracer implementation 2026-03-18 23:03:35 -07:00
Kevin Turcios
f4319aac67 fix: scale review depth to PR size, add scope boundaries
- Triage now classifies PRs as TRIVIAL/SMALL/LARGE based on lines changed
- SMALL PRs: focused correctness check, quick duplicate scan, skip coverage
- LARGE PRs: full review with design checks, deep duplicate detection, coverage
- Optimization PRs: concise correctness verdict instead of long essays
- Added explicit scope rules: only read files in the diff, don't explore broadly
2026-03-18 19:06:40 -06:00
Kevin Turcios
f1cf46ce6b fix: improve Claude GHA review behavior
- Stop closing PRs on unrelated CI failures (check base branch first)
- Remove age-only auto-close rule; require individual evaluation
- Prevent mass-closing without per-PR analysis
- Merge conflicts only close after 3 days (not immediately)
- Resolve stale threads silently (no " Fixed" reply noise)
- Add design review checks (module placement, root cause, config files)
- Flag accidental file inclusions (binaries, auto-generated versions)
- Request tests for untested new public functions
2026-03-18 19:01:18 -06:00
Mohamed Ashraf
fa9d32f1c4 Merge branch 'main' into omni-java
Resolve 7 merge conflicts from main's modular refactoring + JS improvements:

- aiservice.py: combine multi-language metadata (omni-java) with main's structure
- cmd_init.py: adopt main's modular split (init_config, init_auth, github_workflow) + add Java import
- code_replacer.py: main's clean early-return style + omni-java's non-Python single-block fallback
- version.py, test_support_dispatch.py, test_javascript_test_runner.py: take main's versions
- uv.lock: regenerated

Port Java into main's modular structure:
- Fix init_java.py lazy imports to point to new modules (init_config, init_auth, github_workflow)
- Add Java workflow support to github_workflow.py (detection, template, customization)
- Fix broken Java imports (function_optimizer, line_profiler) after main's module moves

Add safety tests for merge-critical functions:
- test_add_language_metadata.py: 10 tests covering per-language payload correctness
- test_code_replacer_matching.py: 8 tests covering fallback chain

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-13 00:15:19 +00:00
Kevin Turcios
3ba68c0288 fix: attempt to fix failing CI on codeflash optimization PRs before closing
Instead of immediately closing optimization PRs when CI fails, Claude
now checks out the branch, inspects failures, and attempts to fix them.
Only closes if unfixable, with a specific explanation of the failures.
2026-03-10 16:34:55 -06:00
Kevin Turcios
89faf1e2fd chore: use default testgen-review-turns in GHA 2026-03-06 17:05:01 -05:00
Kevin Turcios
70a410a656 feat: add --testgen-review-turns CLI flag to control repair cycles 2026-03-06 01:12:39 -05:00
Kevin Turcios
5c34cb994b chore: enable --testgen-review flag in CI optimization workflow 2026-03-06 00:45:23 -05:00
Kevin Turcios
eceac13fc3 Merge remote-tracking branch 'origin/main' into omni-java
# Conflicts:
#	.claude/rules/architecture.md
#	.claude/rules/code-style.md
#	.github/workflows/claude.yml
#	.github/workflows/duplicate-code-detector.yml
#	codeflash/api/aiservice.py
#	codeflash/cli_cmds/console.py
#	codeflash/cli_cmds/logging_config.py
#	codeflash/code_utils/deduplicate_code.py
#	codeflash/discovery/discover_unit_tests.py
#	codeflash/languages/base.py
#	codeflash/languages/code_replacer.py
#	codeflash/languages/javascript/mocha_runner.py
#	codeflash/languages/javascript/support.py
#	codeflash/languages/python/support.py
#	codeflash/optimization/function_optimizer.py
#	codeflash/verification/parse_test_output.py
#	codeflash/verification/verification_utils.py
#	codeflash/verification/verifier.py
#	packages/codeflash/package-lock.json
#	packages/codeflash/package.json
#	tests/languages/javascript/test_support_dispatch.py
#	tests/test_codeflash_capture.py
#	tests/test_languages/test_javascript_test_runner.py
#	tests/test_multi_file_code_replacement.py
2026-03-04 01:52:32 -05:00
Kevin Turcios
bccc02aade merge: incorporate omni-main-java sync work
Merges the omni-main-java branch which synced main into omni-java,
including JavaFunctionOptimizer, removal of is_java()/is_python() guards,
protocol dispatch for parse_test_xml, and deletion of concolic_testing.py.
2026-03-03 23:42:39 -05:00
Kevin Turcios
c7854fb605 chore: remove simplify step from Claude PR review and add prek rule
Drop the /simplify step that caused unprompted refactors and scope
creep in PR reviews. Also add prek pre-commit rule to project config
so the PR bot and all contributors see it.
2026-03-03 22:21:55 -05:00
Kevin Turcios
8f50f31251 fix: add gh pr close permission to pr-review workflow
The merge_optimization_prs step needs to close stale PRs but
gh pr close was missing from the allowed tools list.
2026-03-03 21:04:14 -05:00
Kevin Turcios
dee1175acd chore: remove simplify step from Claude PR review and add prek rule
Drop the /simplify step that caused unprompted refactors and scope
creep in PR reviews. Also add prek pre-commit rule to project config
so the PR bot and all contributors see it.
2026-03-03 20:02:22 -05:00
Mohamed Ashraf
50ba89c6be ci: build codeflash-runtime JAR in unit-tests workflow
The unit-tests workflow relied on a pre-committed JAR binary in
resources/ which could become stale when Comparator.java changes.
Now the workflow builds the JAR from source and installs it to the
local Maven repo, matching what java-e2e and fibonacci-nogit already do.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-03 20:01:40 +00:00
Kevin Turcios
08dad4e6a4 chore: sync docs, rules, and workflows from main 2026-03-02 22:48:28 -05:00
Kevin Turcios
12b18de2d1 chore: limit claude workflow triggers to reduce spam
- Add paths-ignore to skip reviews for docs, config, CI, and non-production files
- Use github.event.sender.login instead of github.actor for reliable bot detection
- Add triage step to early-exit on trivial PRs
2026-03-02 22:01:11 -05:00
aseembits93
7ee1e7bd2f chore: switch claude workflow model from opus to sonnet 4.6
Use claude-sonnet-4-6 instead of claude-opus-4-6 for CI cost/speed savings.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-03 08:08:28 +05:30
Kevin Turcios
ad4ebdb710 fix: merge duplicate detector into claude workflow and add concurrency
Consolidates duplicate-code-detector.yml into claude.yml as a step in
the pr-review job. Adds concurrency groups with cancel-in-progress to
prevent comment spam from racing workflow runs.
2026-03-02 16:36:35 -05:00
Kevin Turcios
a538722ff7 chore: sync docs, CI, config, and rules from main
Pulls latest changes for .claude/rules/, .github/workflows/, CLAUDE.md,
and docs/ — all main-only changes with no omni-java overlap.
2026-03-02 14:56:26 -05:00
Kevin Turcios
29d0d2cb48 fix: rewrite Claude Code PR review prompt for better instruction following
- Use XML structure instead of markdown for clearer step boundaries
- Resolve stale review threads via GraphQL instead of leaving them
- Positive framing instead of negation for instructions
- Replace aggressive language with calm direct instructions
- Add /simplify skill invocation for code quality pass
- Add verification checkpoint at the end
- Auto-close stale codeflash optimization PRs (age, conflicts, CI failures, deleted functions)
- Remove inline comment MCP tool, add Skill tool
2026-03-02 10:53:35 -05:00
aseembits93
efb5cd9987 reverting workflows to original state 2026-02-24 21:54:58 +05:30
aseembits93
bdd22afe20 chore: sync workflow files from main
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-24 21:49:11 +05:30
aseembits93
51028a6edf updating workflows to mains 2026-02-24 21:41:39 +05:30
Kevin Turcios
0c786840e7 rework duplicate-code-detector a bit. 2026-02-23 08:16:43 -05:00
Kevin Turcios
c74782757b Merge commit '6346c740' into sync-main-batch-4
# Conflicts:
#	.github/workflows/windows-unit-tests.yml
#	codeflash/code_utils/config_consts.py
#	codeflash/code_utils/instrument_existing_tests.py
#	codeflash/languages/python/context/unused_definition_remover.py
#	codeflash/languages/python/static_analysis/code_replacer.py
#	codeflash/optimization/function_optimizer.py
#	codeflash/optimization/optimizer.py
#	pyproject.toml
2026-02-19 21:26:23 -05:00
Kevin Turcios
85d1d4fbf6 Merge commit '6020c4fa' into sync-main-batch-3 2026-02-19 20:33:09 -05:00
Kevin Turcios
c66953d110 Merge commit 'd578d996' into sync-main-batch-2
# Conflicts:
#	codeflash/github/PrComment.py
#	codeflash/optimization/function_optimizer.py
#	codeflash/tracer.py
#	codeflash/verification/parse_test_output.py
#	codeflash/verification/verification_utils.py
2026-02-19 20:27:14 -05:00
Kevin Turcios
7d7a2a21c0 Merge commit '3dd19c62' into sync-main-batch-1
# Conflicts:
#	codeflash/optimization/function_optimizer.py
#	codeflash/verification/verification_utils.py
#	codeflash/version.py
2026-02-19 20:10:05 -05:00
Saurabh Misra
67c12bca07
Merge branch 'main' into cf-release-benchmark-0.2.0 2026-02-19 10:31:39 -08:00
Kevin Turcios
81997282d2 ci: consolidate Windows tests into main unit-tests workflow
Remove redundant windows-unit-tests.yml and add Windows Python 3.13 job
to the main unit-tests.yaml workflow. Add PYTHONIOENCODING env var for
Windows compatibility.
2026-02-19 01:36:11 -05:00
Kevin Turcios
d09fa5ceaa ci: add Windows to unit tests matrix
Run unit tests on Windows with Python 3.13 in addition to all Python
versions (3.9-3.14) on Ubuntu. This ensures cross-platform compatibility
is tested while keeping Windows test duration reasonable.
2026-02-19 01:33:52 -05:00
Kevin Turcios
a76f28f196 chore: bump codeflash-benchmark version to 0.3.0 2026-02-18 17:45:55 -05:00
Kevin Turcios
20c956c0e9 feat: add codeflash-benchmark automated release to publish workflow
Extend the publish workflow to handle both codeflash and codeflash-benchmark
releases from a single workflow file, triggered by their respective version
files. Also syncs benchmark __init__.py version to match pyproject.toml.
2026-02-18 17:39:36 -05:00
HeshamHM28
3359c9ab0d add JDK for windows 2026-02-18 03:57:09 +02:00
HeshamHM28
dc1083b3f9 add java jdk 2026-02-18 03:47:42 +02:00
aseembits93
09c026a7b9 fix: use correct Bedrock inference profile ID (no :0 suffix)
The cross-region inference profile for Claude Opus 4.6 on Bedrock is
`us.anthropic.claude-opus-4-6-v1`, not `us.anthropic.claude-opus-4-6-v1:0`.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 20:34:49 +05:30
Aseem Saxena
0b9dc5d872
Merge branch 'main' into chore/claude-workflow-aws-bedrock 2026-02-17 20:04:47 +05:30
aseembits93
e1a45dd0c8 chore: switch Claude workflows from Foundry to AWS Bedrock
Replace Azure Foundry authentication with AWS Bedrock OIDC in all
Claude Code GitHub Actions workflows.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-17 19:02:00 +05:30
ali
066980b06f
Merge branch 'fix/jest30-pnpm-resolution' of github.com:codeflash-ai/codeflash into fix/jest30-pnpm-resolution 2026-02-16 14:36:55 +02:00
ali
2fb4b2dbfd
cleaning up 2026-02-16 14:36:39 +02:00
HeshamHM28
ca4f01f7c5 Add Java end to end tests 2026-02-16 08:43:51 +02:00
Kevin Turcios
02b9a5e226 chore: replace gh-aw duplicate detector with claude-code-action + Serena
gh-aw doesn't support Azure Foundry auth. Use claude-code-action directly
with use_foundry and Serena MCP server for semantic code analysis.
2026-02-14 19:05:47 -05:00
Kevin Turcios
ef661394b7 fix: configure duplicate code detector for Azure Foundry auth
Pass ANTHROPIC_FOUNDRY_API_KEY and ANTHROPIC_FOUNDRY_BASE_URL env vars
so Claude Code CLI authenticates via Azure Foundry instead of direct API.
2026-02-14 18:26:00 -05:00
Kevin Turcios
f819d6061e chore: add gh-aw duplicate code detector workflow
Adds automated duplicate code detection using GitHub Agentic Workflows
with Serena semantic analysis, configured for Python.
2026-02-14 18:14:16 -05:00
Kevin Turcios
1a3dba2574 Update claude.yml 2026-02-12 00:13:47 -05:00
Mohamed Ashraf
9be69106f6 fix: resolve merge conflicts with omni-java base
Merged omni-java base into PR #1279 to resolve conflicts.

Resolution approach:
1. test_discovery.py: Used refactored method call resolution from base
   - New approach uses sophisticated type tracking (jedi-like "goto")
   - Already includes duplicate checking (line 141)
   - Removed old Strategy 3 (class-based fallback) as it's not needed
     and caused single-function optimization issues

2. test_instrumentation.py: Combined both changes
   - Added API key setup from PR #1279
   - Kept FunctionToOptimize imports from base

The refactored code is more accurate and fixes the single-function
optimization issue that existed in the original PR.

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
2026-02-10 14:31:25 +00:00
Kevin Turcios
2847edc719 feat: upgrade Claude GHA to Opus 4.6 2026-02-06 02:30:37 -05:00
Kevin Turcios
91454e27d7 fix: add verification step to CI Claude workflow to prevent hallucinated fixes
The PR review bot was claiming lint issues were fixed without actually
fixing or committing them. Add a mandatory re-run of prek after fixes
and explicit instructions to report unfixed issues honestly.
2026-02-06 01:53:04 -05:00
Kevin Turcios
e9aadb8d14 feat(ci): auto-merge codeflash optimization PRs when CI passes 2026-02-04 06:23:39 -05:00
Kevin Turcios
7cf2e4e67b feat(ci): add mypy checks to Claude PR review workflow
Claude will now run mypy on changed files and fix type annotation issues.
2026-02-04 06:12:21 -05:00
Kevin Turcios
fe9f22b3ad docs: update pre-commit references to prek
Replace outdated pre-commit terminology with prek across documentation
and CI workflow.
2026-02-04 06:09:18 -05:00
Kevin Turcios
8231c60b70 fix(ci): consolidate Claude review into single sticky comment
- Remove instruction to use gh pr comment for summaries
- Add STEP 4 with explicit single-comment policy
- Include instructions to update existing comments
- Add cleanup step to delete duplicate comments
2026-02-04 06:05:12 -05:00
Kevin Turcios
95cc60397d
Merge branch 'main' into omni-java 2026-02-04 03:22:37 -05:00
Kevin Turcios
cb9248e022 feat: add merge/close permissions and secure workflow
- Add git merge/fetch/checkout/branch to allowed tools
- Add gh pr merge/close to allowed tools
- Add allowed_bots for claude[bot] to trigger pr-review
- Restrict @claude mentions to maintainers only (OWNER/MEMBER/COLLABORATOR)
- Block fork PRs from triggering pr-review and claude-mention
2026-02-03 23:54:43 -05:00
Kevin Turcios
831d296052 fix: skip pr-review when triggered by claude bot 2026-02-03 23:40:45 -05:00
Kevin Turcios
d5ec877a78 feat: add coverage analysis to PR review workflow
- Run tests with coverage on changed files
- Compare coverage between PR and main branch
- New files require ≥75% test coverage
- Modified files must have changed lines covered
- Flag coverage regressions in PR comment
2026-02-03 22:57:56 -05:00
Kevin Turcios
6289c5325a feat: improve Claude PR review workflow
- Consolidate claude-code-review.yml into claude.yml with two jobs
- Add auto-fix for safe linting issues (formatting, imports) before review
- Use --from-ref origin/main to only check changed files
- Add smart re-review logic that resolves fixed comments
- Add inline comment support via MCP tool with 5-7 comment limit
2026-02-03 22:51:32 -05:00
Kevin Turcios
9f4776eb2e chore: migrate from pre-commit to prek
Replace pre-commit with prek (faster Rust-based alternative) for linting.
- Add prek to dev dependencies
- Replace pre-commit workflow with prek workflow using setup-uv@v6
- Update Claude workflow allowed tools to use prek
2026-02-03 19:56:58 -05:00
Kevin Turcios
4dc09174da fix: add uv setup to Claude workflow
Install uv and project dependencies before running Claude Code action
so that pre-commit, ruff, and other tools are available.
2026-02-03 17:34:54 -05:00
Kevin Turcios
32b5d468b4 chore: add tool permissions to Claude workflow and strengthen CLAUDE.md guidelines
Add allowedTools for pre-commit, ruff, pytest, mypy, coverage, and git/gh commands
to enable Claude to run linting and testing. Strengthen naming convention guidance
to explicitly forbid leading underscores on functions.
2026-02-03 16:59:28 -05:00
misrasaurabh1
c40798fa73 Merge branch 'omni-java' of github.com:codeflash-ai/codeflash into omni-java 2026-02-02 19:15:37 -08:00
Mohamed Ashraf
131597caa9 fix: add API key for tests and build codeflash-runtime JAR in CI
- Add CODEFLASH_API_KEY for test_instrumentation.py tests that instantiate Optimizer
- Create pom.xml for codeflash-java-runtime with Gson and SQLite JDBC dependencies
- Add CI step to build and install JAR before running tests
- Update .gitignore to allow pom.xml in codeflash-java-runtime
- All 348 Java tests now pass including 5 Comparator JAR integration tests
2026-02-03 02:18:49 +00:00
Saurabh Misra
a00eb39cd2 feat: add Java end-to-end tests and CI workflow
Add comprehensive e2e tests for the Java optimization pipeline:
- Function discovery (BubbleSort, Calculator)
- Code context extraction
- Code replacement
- Test discovery (JUnit 5)
- Project detection (Maven)
- Compilation and test execution

Also add:
- GitHub Actions workflow for Java e2e tests (java-e2e-tests.yml)
- Maven pom.xml for the Java sample project
- .gitignore exception for pom.xml

The e2e tests verify the full Java pipeline works correctly,
from function discovery through code replacement.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-03 01:17:17 +00:00
Saurabh Misra
82d9e435ef
Merge branch 'main' into add_vitest_support_to_js 2026-02-01 12:32:09 -08:00
Kevin Turcios
90aec153d5 more perms to write 2026-02-01 09:12:39 -05:00
Sarthak Agarwal
4745b0c347 reset test strings to strict 2026-01-31 12:32:23 +05:30
Sarthak Agarwal
c56002f287 vitest support add to js/ts project 2026-01-31 01:09:52 +05:30
Kevin Turcios
73039364ee not needed 2026-01-29 13:14:43 -05:00
ali
ecbace9cf8
fix unit tests 2026-01-29 17:11:10 +02:00
Sarthak Agarwal
483fa44f36 package installer to e2e test 2026-01-29 14:31:06 +05:30
Sarthak Agarwal
2cc1fb2809 tests for extractor and replacer 2026-01-29 01:27:19 +05:30
misrasaurabh1
79ae59f399 add gha e2e tests 2026-01-27 20:57:08 -08:00
codeflash-ai[bot]
6e40ff8488
Add CodeFlash GitHub Actions workflow 2026-01-28 01:29:39 +00:00
Sarthak Agarwal
bcdb0ef39d
Merge branch 'main' into multi-language 2026-01-27 22:53:40 +05:30
Kevin Turcios
1b92d11058 Update mypy.yml 2026-01-24 06:51:43 -05:00
Sarthak Agarwal
29e0e190c5 add refinement and cleanup 2026-01-21 05:28:03 +05:30
Kevin Turcios
9fcbd2bb8f Rename secret to AZURE_ANTHROPIC_API_KEY 2025-12-26 13:39:12 -05:00
Kevin Turcios
2a5d303e6f Rename secret to AZURE_ANTHROPIC_ENDPOINT 2025-12-26 13:37:02 -05:00
Kevin Turcios
e1b7425d54 Switch Claude workflows to Azure Foundry 2025-12-26 13:35:56 -05:00
Kevin Turcios
1cc97b7517 Remove qodo-ai pr-agent workflow (replaced by Claude) 2025-12-26 13:22:52 -05:00
Kevin Turcios
09c6ec7cf0 sticky comments 2025-12-26 13:03:29 -05:00
Kevin Turcios
508df04c1e "Claude Code Review workflow" 2025-12-22 20:51:48 -05:00
Kevin Turcios
a88ce7e566 "Claude PR Assistant workflow" 2025-12-22 20:51:47 -05:00
Kevin Turcios
2e34d83c52
remove test_framework from pyproject.toml (#955)
* follow up

* remove requirement

* Delete uv.lock

* refresh uv-lock

* first pass

* cleanup test_framework here

* cleanup

* code_review

* cleanup tests

* fix for E2E

* fix tests dir missing

* one more cleanup

* cancel-in-progress

* Revert "cancel-in-progress"

This reverts commit f4bb9079cb.

* not needed here

* lower threshold and cleanup comments

* debug

* temp

* debug

Revert "debug"

This reverts commit fc3655149486c8b980e245e97b8304232086f08d.

fix(discover): Fix pytest discovery for futurehouse structure

Revert "fix(discover): Fix pytest discovery for futurehouse structure"

This reverts commit 40c48882b7413f5876af0e2e08d8f17a65bab091.

Reapply "debug"

This reverts commit c8297e57fbdca2462a8ca1199657748b8bc225e9.

Revert "not needed here"

This reverts commit dd2c5cdf76.

Revert "lower threshold and cleanup comments"

This reverts commit 0e2f57e292.

Reapply "lower threshold and cleanup comments"

This reverts commit e3b24f4a2967551eca8a19f96bf6647b23acdbbc.

Reapply "not needed here"

This reverts commit aec32103c931ff6d57dfa0d012113c2cec5d37a7.

Revert "Reapply "debug""

This reverts commit 77ab9f34f858a17fb29764c544769a0eb72ce7f0.

Reapply "fix(discover): Fix pytest discovery for futurehouse structure"

This reverts commit 506b94ab4fe17a7c8e0d458253812758cced3f22.

feat(futurehouse): Make futurehouse structure pytest compatible

* Revert "debug"

This reverts commit 271c5a37ec.

* Revert "temp"

This reverts commit b363acda1c.

* Revert "debug"

This reverts commit ac29b6beb3.

* just for now
2025-12-09 02:53:08 -08:00
Kevin Turcios
66ff6f065d apply suggestion 2025-12-07 00:53:32 -06:00
Kevin Turcios
a22adba91a apply suggestion 2025-12-07 00:51:47 -06:00
Kevin Turcios
4cb39b8ab3 cancel-in-progress 2025-12-07 00:46:42 -06:00
Kevin Turcios
33437d39e3
use pytest as the execution engine for all tests (#951)
* first pass

restore

restore this too

Revert "first pass"

This reverts commit b507770b2c79cc948b33222d8877fb784bfe108a.

* continue

* Update uv.lock

* refresh lockfile

* bugfix

* temp

* fix these

* pytest changes

* formatting

* set up test env properly here too

* ruff

* make ruff happy

* Update e2e-bubblesort-unittest.yaml

* with pytest

* bugfix

* oops
2025-12-06 22:40:25 -06:00