Commit graph

1505 commits

Author SHA1 Message Date
Aseem Saxena
0380f9ad0d
Merge branch 'main' into reduce-recompilations 2026-02-25 02:27:47 +05:30
Aseem Saxena
14feee119f
Merge branch 'main' into testgen-jit-iter 2026-02-25 02:27:41 +05:30
claude[bot]
c6e9fc4530 fix: remove duplicate return statement in _find_error_location
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-24 12:57:02 +00:00
mohammed ahmed
f301be093c
Update django/aiservice/aiservice/validators/javascript_validator.py
Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
2026-02-24 14:54:56 +02:00
ali
c2eb63eb2e
feat: improve JS/TS validator with markdown support and error locations
Add markdown code block parsing, detailed syntax error locations with
line/col info, and structured logging to the JavaScript/TypeScript
validators.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-24 14:50:50 +02:00
claude[bot]
ae7110491c fix: add type ignore for Django ORM field type mismatch
Update type hints for `add_months_safe` and `get_next_subscription_period`
to accept both datetime.datetime and datetime.date, and add ty:ignore
comment for Django ORM field type that ty cannot infer correctly.

Co-authored-by: Aseem Saxena <aseembits93@users.noreply.github.com>
2026-02-24 10:37:33 +00:00
aseembits93
7f824ce101 fix: eliminate redundant DB queries in middleware and unblock LLM responses
Auth now attaches fetched organization/subscription to the request so
TrackUsageMiddleware reuses them instead of re-querying. RateLimitMiddleware
caches restricted_paths at init and uses async cache methods. LLM call
recording is fire-and-forget via asyncio.create_task to avoid blocking
responses on DB writes.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-23 20:43:18 +05:30
aseembits93
d4867ef18e refactor: make line profiler JIT handling consistent with regular optimizer
Move JIT instructions appending from the per-call level
(optimize_python_code_line_profiler_single) to the endpoint level
(optimize endpoint), matching the regular optimizer's pattern.
This removes the is_numerical_code parameter threading through
the call chain.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-23 19:54:03 +05:30
aseembits93
0b523fc367 fix: enforce direct JIT decorator in optimizer prompt for numerical code
When is_numerical_code is true, the LLM sometimes outputs conditional
fallback paths (try/except, if/else) instead of applying the JIT
decorator directly. Add explicit output format instructions to prevent
this behavior.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-02-23 19:49:24 +05:30
Kevin Turcios
033d14ea87
Merge branch 'main' into testgen-jit-iter 2026-02-23 08:56:11 +00:00
Kevin Turcios
f14ff077a6
Merge branch 'main' into reduce-recompilations 2026-02-23 08:55:29 +00:00
claude[bot]
bf4e38c301 fix: add cast to satisfy ty type checker for list covariance
The ty type checker correctly flags that list[str] is not a subtype of list[str | None] due to list invariance. Added explicit cast.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-23 08:42:24 +00:00
Kevin Turcios
16e043883a style: auto-format ranker and test_markdown_utils 2026-02-23 03:39:38 -05:00
Kevin Turcios
85a1c8b183 fix: derive ranker ranking from structured scores instead of LLM array
The JSON parsing path returned the LLM's explicit ranking array,
which sometimes contradicted its own per-dimension scores. Use
_scores_to_ranking() to compute the ranking from weighted scores
when available, falling back to the LLM ranking only when scores
are absent.
2026-02-23 03:37:42 -05:00
Kevin Turcios
20ee6d5b62 fix: penalize local variable caching of globals in ranker prompt
The ranker LLM was rewarding candidates that cache global variables
into locals as a performance win. Add an explicit rule: this is only
relevant on Python ≤3.10; on 3.11+ LOAD_GLOBAL uses adaptive
specialization and is nearly as fast as LOAD_FAST.
2026-02-23 03:37:21 -05:00
Kevin Turcios
c95a36cf38 fix: handle nested code fences in extract_code_block
The non-greedy regex in FIRST_CODE_BLOCK_PATTERN stopped at the first
``` occurrence, even inside triple-quoted strings or nested code fence
blocks. This truncated the extracted code and lost test functions when
LLMs embedded function definitions using ```python:filepath syntax.

Switch to greedy matching and require the closing ``` to be alone on
its line so intermediate backticks are skipped.
2026-02-23 03:36:50 -05:00
Kevin Turcios
ca71d0c8a0 refactor: remove constructor notes preprocessing from testgen pipeline
Full class source is now included in the client-side testgen context,
making the server-side constructor signature extraction redundant.
2026-02-23 03:36:50 -05:00
Kevin Turcios
bfd9f2cd04 fix: respect test_index when creating optimization_features row
The get_or_create defaults passed test lists without positional
indexing, so when a higher test_index created the row first its
content landed at index 0 and was overwritten by the lower index
update, losing a test.
2026-02-23 03:36:50 -05:00
Kevin Turcios
af3185edff fix: handle non-numeric patch suffixes and support Python 3.15 2026-02-23 03:36:50 -05:00
Aseem Saxena
852274e2be
Merge branch 'main' into reduce-recompilations 2026-02-21 00:59:24 +05:30
aseembits93
85c5a2ec82 reduce rcompilations in the tests 2026-02-21 00:57:52 +05:30
Aseem Saxena
8f6d1d0602 fix: improve JIT testgen prompt to avoid error-checking tests
Add explicit guidance to avoid generating tests that check for specific
exception types, since JIT compilers (numba, torch.compile) produce
different error types than uncompiled code. This ensures generated tests
work consistently for both compiled and uncompiled versions.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-20 18:59:04 +00:00
Aseem Saxena
5553b01bc1
Merge branch 'main' into testgen-jit-iter 2026-02-21 00:06:44 +05:30
claude[bot]
4fa972edd3 refactor: remove unused TORCH_TENSOR_FUNCTIONS constant
Co-authored-by: Aseem Saxena <aseembits93@users.noreply.github.com>
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-20 18:33:41 +00:00
Sarthak Agarwal
eb5f4b460e
Migrate to AWS bedrock (#2430)
AWS_ACCESS_KEY_ID=
AWS_SECRET_ACCESS_KEY=
AWS_REGION=us-east-1



Will require these for boto3 authentication
2026-02-20 23:52:48 +05:30
claude[bot]
46da033b05 style: fix ruff formatting and add mypy type annotation 2026-02-20 18:09:05 +00:00
Aseem Saxena
7e1b2a3ade investigate 2026-02-20 18:03:28 +00:00
claude[bot]
1bb1407c6b fix: resolve type checker errors 2026-02-15 12:33:05 +00:00
Kevin Turcios
d6a3c6254f feat: add constructor notes for non-dataclass classes with __init__
The LLM prompt preprocessing now highlights __init__ signatures for
regular classes, not just @dataclass ones, reducing brute-force
constructor guessing and pytest.skip() fallbacks in generated tests.
2026-02-15 07:29:05 -05:00
Kevin Turcios
e5d70443db fix: use positional insertion in log_features to preserve model attribution
log_features() appended test results in call-completion order, causing
model attribution swaps when LLM responses arrived out of order. Pass
test_index through and use positional insertion instead of append.
2026-02-15 03:58:05 -05:00
Kevin Turcios
c13835963c docs: restructure CLAUDE.md files into modular rules
Slim down CLAUDE.md files and move content into path-scoped
.claude/rules/ files to reduce context bloat.
2026-02-14 19:36:21 -05:00
Kevin Turcios
4c3deeb7b8
Restructure CLAUDE.md files and add path-scoped rules for monorepo (#2417)
## Summary

- Restructure CLAUDE.md hierarchy so Claude Code auto-discovers
project-specific instructions
- Delete dead `AGENTS.md` files (referenced non-existent
`.tessl/RULES.md`)
- Rename `django/aiservice/AGENTS.md` → `CLAUDE.md` for auto-discovery
- Create `js/CLAUDE.md` with package commands and gotchas
- Move PR review guidelines to `.claude/rules/pr-review.md` (auto-loaded
rule)
- Move prek workflow to `.claude/skills/fix-prek.md` (on-demand skill)
- Add path-scoped rules for Python and Next.js patterns
- Add domain glossary, service architecture diagram, and per-package
gotchas

## Test plan

- Verify `CLAUDE.md` files exist at root, `django/aiservice/`, and `js/`
- Verify no remaining references to `AGENTS.md` or `.tessl/`
- Verify `.claude/rules/` and `.claude/skills/` files are committed
2026-02-14 17:13:09 -05:00
Kevin Turcios
e26a8ea486
Reorganize top-level feature modules under core/ (#2416)
## Summary

- Move `log_features/` → `core/log_features/` (Django app with
`managed=False` models, no DB impact)
- Move `ranker/`, `workflow_gen/`, `adaptive_optimizer/` →
`core/languages/python/` (Python-focused API modules)
- Update all imports across the codebase (19 files)

## Test plan

- [x] All 548 tests pass
- [x] No stale top-level imports (`from log_features.`, `from ranker.`,
etc.)
- [x] `log_features` AppConfig preserves `label = "log_features"` for
Django app registry compatibility
2026-02-14 17:07:40 -05:00
Kevin Turcios
6caf7469c6
Decouple language modules and remove stale cross-module code (#2415)
## Summary

- Extract testgen and optimizer API routers from
`core/languages/python/` into `core/shared/` with lazy imports,
eliminating cross-module coupling between language modules
- Delete stale JavaScript prompt files left in the Python module after
migration to `js_ts/`
- Remove backward-compat fallback paths for prompt files that already
exist at their new locations
- Remove unused `is_multi_context_any()` and its cross-language imports
- Remove unused `BEGIN_PATCH`/`END_PATCH` constants and stale TODO

## Test plan

- [ ] Verify testgen endpoint dispatches correctly for Python, JS/TS,
and Java
- [ ] Verify optimizer endpoint dispatches correctly for all languages
- [ ] Run existing testgen and optimizer tests
2026-02-14 00:09:44 -05:00
Kevin Turcios
2614393793
Add test_index to LLM call context for observability chat (#2414)
## Summary

- Pass test_index through LLM call context so observability chat can
attribute responses to specific test generation calls
- Fix SSE streaming to send keepalive pings from the start

CF-504
2026-02-13 23:49:20 -05:00
Sarthak Agarwal
c721723971
remove demo test loops (#2412) 2026-02-14 00:43:09 +05:30
Saurabh Misra
198c0c1a4e
codeflash-omni-java (#2335)
# Pull Request Checklist

## Description
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->

---------

Co-authored-by: Claude Opus 4.5 <noreply@anthropic.com>
Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
Co-authored-by: HeshamHM28 <HeshamMohamedFathy@outlook.com>
Co-authored-by: Ubuntu <ubuntu@ip-172-31-39-200.ec2.internal>
Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com>
Co-authored-by: Kevin Turcios <turcioskevinr@gmail.com>
Co-authored-by: Kevin Turcios <106575910+KRRT7@users.noreply.github.com>
2026-02-13 23:26:55 +05:30
Kevin Turcios
ad26be10b8
Fix JS/TS cross-imports from Python module (#2396)
## Problem

The JS/TS language handler (`core/languages/js_ts/`) was importing
models, schemas, config, prompts, and helpers directly from the Python
language handler. This created a confusing architectural dependency and
risked serving wrong language-specific prompt content.

## What Changed

- Created `core/shared/` for genuinely language-agnostic code (optimizer
schemas, models, config, testgen models, context helpers)
- Moved JS/TS-specific prompts and context helpers into
`core/languages/js_ts/`
- Updated all consumers (20+ files) to import from the correct locations
- Removed backwards-compat re-exports from the Python module

## Result

- **Before:** 11 imports from `core.languages.python` in
`core/languages/js_ts/`
- **After:** 0
2026-02-12 22:34:38 -05:00
Kevin Turcios
0df421eccb
Add chat interface to observability timeline (#2395)
## Summary
- Chat panel on the observability timeline that uses Claude to answer
questions about optimization traces
- Tool-based context retrieval (fetches candidates, tests, errors on
demand instead of stuffing everything upfront)
- Uses `@anthropic-ai/sdk` via Azure AI Foundry
- Strengthened testgen prompts to ban mocks/fakes for test inputs
2026-02-12 20:45:33 -05:00
Kevin Turcios
e28642cf22
Fix FTO display showing wrong function for methods with common names (#2391)
Store qualified function name (e.g., HttpInterface.__init__) and
file_path in testgen metadata instead of bare function_name (__init__).
Update the frontend parser to handle qualified names by splitting into
class + method and searching within the correct class using both
tree-sitter and regex. Prioritize the file matching filePath before
searching all files.

# Pull Request Checklist

## Description
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->
2026-02-12 00:30:33 -05:00
Kevin Turcios
db973a0487
fix: relax testgen assertion rule to allow imports from function depe… (#2388)
…ndencies

The old rule ("NOT in libraries such as numpy, pandas etc.") forced LLMs
to reinvent helpers like np.allclose using slow / inaccurate Python
loops. The new rule allows assertions from packages already imported by
the function under test.

# Pull Request Checklist

## Description
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->
2026-02-09 15:05:19 -05:00
Kevin Turcios
629442cc5e
Restructure aiservice to language-first architecture (#2383)
## Summary
- Reorganizes `django/aiservice/` from feature-first layout (separate
`optimizer/`, `testgen/`, `code_repair/` dirs) to language-first layout
under `core/languages/{python,js_ts}/`
- Adds handler/registry/dispatcher pattern for routing requests to
language-specific implementations
- All existing module code preserved via `git mv` for history tracking;
no logic changes to existing modules

## What changed
- New `core/` app with registry, dispatcher, protocols, and error
hierarchy
- `PythonHandler` and `JSTypeScriptHandler` delegate to existing module
functions
- All imports updated across the codebase (views, tests,
adaptive_optimizer, etc.)
- Integration tests for handler registration and dispatch
- 155 files changed, ~880 additions / ~207 deletions (mostly import path
updates and moves)

## Test plan
- [ ] `python manage.py check` passes
- [ ] Integration tests in
`tests/integration/test_handler_integration.py` pass
- [ ] Existing test suite passes with updated import paths
- [ ] Ruff and ty clean on all new infrastructure files

---------

Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com>
2026-02-09 09:15:50 -05:00
Kevin Turcios
b9d318279c
feat: observability improvements and testgen prompt modernization (#2382)
## Summary
- Rewrite testgen system prompts from constraint-heavy to positive-first
structure with chain-of-thought instructions
- Simplify LLM message structure from `[system, user, user, user]` to
`[system, user]` by absorbing plan_content guidelines into system
prompts
- Observability UI: add search to LLM debug dialog, expand timeline view
- Fix data capture: raw LLM responses, all user messages in prompt
column, nested code fences, empty notes handling

## Test plan
- [ ] Verify testgen produces valid test suites with the new prompt
structure
- [ ] Verify observability timeline displays LLM prompts/responses
correctly
- [ ] Check that search works in the LLM debug dialog
2026-02-09 01:20:59 -05:00
Kevin Turcios
752e2504e4
Restructure and improve refinement prompt (#2379)
## Summary
- Restructure the refinement system prompt into clear numbered sections
(Preserve Behavior, Minimize Diff, Revert Anti-Patterns, Maintain
Readability) with an explicit 6-step refinement process
- Extract inline prompt strings into separate markdown files
(`refinement_system_prompt.md`, `refinement_user_prompt.md`), matching
the convention used by other optimizer prompts
- Add `AuthenticatedRequest` type hint to `refine()` endpoint and fix
grammar in tool use section

## Test plan
- [ ] Verify refinement endpoint still works end-to-end with a test
optimization candidate
- [ ] Confirm prompt content is loaded correctly from markdown files at
startup

---------

Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com>
2026-02-08 02:10:20 -05:00
Kevin Turcios
47053591f4
observability v2 toggle (#2378) 2026-02-07 15:50:12 -05:00
Kevin Turcios
f03a06f4e1
Reintroduce enriched obs_context for testgen LLM calls (#2377)
## Summary
- Re-adds the enriched observability context from CF-1041 that was
reverted
- Passes `module_path`, `test_module_path`, `helper_function_names`,
`is_async`, and `function_to_optimize` details to `call_llm` in testgen

## Test plan
- [ ] Verify testgen LLM calls include the enriched context
- [ ] Confirm no regressions in test generation flow
2026-02-07 10:33:13 -05:00
Sarthak Agarwal
98fb2d1579
Revert "CF-1041 observability v2 " need more changes and testing (#2375)
Reverts codeflash-ai/codeflash-internal#2329
2026-02-06 01:18:17 +05:30
Kevin Turcios
07d33edd9f
CF-1041 observability v2 (#2329)
introducing this due to pain points in V1, not a complete rewrite, based
off v1

---------

Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com>
Co-authored-by: Kevin Turcios <KRRT7@users.noreply.github.com>
Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
2026-02-05 14:08:02 -05:00
Sarthak Agarwal
08fd1a8787
adding validation for ts in refiner and testgen (#2372)
1. languages/js_ts/testgen.py:
- Updated parse_and_validate_js_output to accept a language parameter
- Uses validate_typescript_syntax when language="typescript", otherwise
uses validate_javascript_syntax
- Updated generate_and_validate_js_test_code to accept and pass the
language parameter
- Updated the call chain to pass language through to the validation
2. optimizer/context_utils/refiner_context.py:
- Added import for validate_typescript_syntax
- Fixed is_valid_refinement method to use correct validator based on
language
- Fixed validate_code_syntax in SingleRefinerContext class
- Fixed validate_code_syntax in MultiRefinerContext class
3. tests/optimizer/test_javascript_validator.py:
- Added test_typescript_type_assertion_valid_in_ts - verifies as unknown
as number is valid TypeScript
- Added test_typescript_type_assertion_invalid_in_js - verifies as
unknown as number is INVALID JavaScript (this would have caught the
original bug)
- Added test_typescript_generic_valid_in_ts - verifies generics are
valid TypeScript
- Added test_typescript_generic_invalid_in_js - verifies generics are
INVALID JavaScript
Files Already Correct (no changes needed):
- languages/js_ts/optimizer.py - already correctly checks language
- languages/js_ts/optimizer_lp.py - already correctly checks language
- optimizer/optimizer_line_profiler.py - already correctly checks
language

---------

Co-authored-by: claude[bot] <41898282+claude[bot]@users.noreply.github.com>
2026-02-04 22:54:44 +00:00
Aseem Saxena
648c95c909
Merge branch 'main' into match-testdiff-schema 2026-02-02 15:07:22 -08:00
Sarthak Agarwal
eb8ad603ff
vitest related changes to prompt (#2366) 2026-02-03 03:29:36 +05:30
Aseem Saxena
90597c52e3
markdown more info 2026-02-02 10:11:44 -08:00
aseembits93
5d0ca8d01b fn var was not used in .format() 2026-02-02 10:00:40 -08:00
Aseem Saxena
019f220c11
cleaning up 2026-02-02 09:45:48 -08:00
Aseem Saxena
3276f9542e
Update django/aiservice/code_repair/code_repair_context.py
Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
2026-02-02 09:44:36 -08:00
aseembits93
c2cd6e5e72 minor fix 2026-02-02 09:37:34 -08:00
aseembits93
2e523313b5 prek fixes 2026-02-02 09:25:44 -08:00
Aseem Saxena
1ffdee3000
Fix check for empty test source code section
Ensure sections[diff.test_src_code] is not None before assignment.
2026-02-02 09:24:28 -08:00
Aseem Saxena
a39e155a84
bug: mismatch in cli and internal schema for code repair
Change test_src_code to allow None type
2026-02-02 09:08:17 -08:00
Sarthak Agarwal
b48a8d9a43
Add vitest support in backend (#2363) 2026-02-02 20:51:52 +05:30
Sarthak Agarwal
cbfebf8ee4 fix(js-testgen): escape curly braces in prompt template
The JavaScript test generation prompt contained `{fn}` as part of
example code showing import syntax. However, Python's `.format()`
method interprets this as a placeholder and tries to substitute it,
causing a KeyError.

Fixed by escaping the curly braces as `{{fn}}` so they render as
literal `{fn}` in the final prompt.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-02-01 03:50:05 +05:30
Saurabh Misra
70360436bd fix: strip file extensions from JS/TS import paths in generated tests
LLMs often add .js extensions to TypeScript import paths (e.g.,
`import { func } from '../module.js'`), but TypeScript/Jest module
resolution doesn't require explicit extensions. This causes
"Cannot find module" errors.

This change adds `strip_js_extensions()` function that removes
.js/.ts/.tsx/.jsx/.mjs/.mts extensions from relative import paths
in generated tests. The function handles:
- ES module imports: import { x } from '../path.js'
- CommonJS requires: require('../path.js')
- Jest mocks: jest.mock('../path.js'), jest.doMock(), etc.

External package imports (lodash, react, etc.) are preserved.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-31 04:22:07 +00:00
Saurabh Misra
b801254d13 fix: strengthen import path extension guidance in prompts
Add more explicit instructions to prevent LLMs from adding .js/.ts
extensions to import paths. The previous guidance was being ignored
by some models.

- Add dedicated "CRITICAL: IMPORT PATH RULES" section with examples
- Show both WRONG and CORRECT patterns explicitly
- Remind to copy the provided import statement exactly

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-31 02:35:21 +00:00
Saurabh Misra
d59c48426e fix: merge prompt extension fixes and LLM client improvements
- Cherry-pick: Remove .js extension guidance from prompts (from fix/js-import-extension-prompt)
- Add get_llm_client() to create fresh clients per request (fixes event loop issues)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-31 02:32:58 +00:00
Saurabh Misra
8461f71668 fix: use JavaScript identifier regex instead of Python isidentifier()
Python's str.isidentifier() validates Python identifiers, not JavaScript
identifiers. This caused valid JS identifiers like '$handler' to be
rejected (since $ is not valid in Python identifiers).

Changed to use a regex pattern that matches JavaScript identifier rules:
- Can start with letter, underscore, or $
- Can contain letters, digits, underscores, or $

Added tests for $ identifiers to ensure they are correctly handled.
2026-01-31 02:21:02 +00:00
Kevin Turcios
a394db3382 formatting 2026-01-30 20:00:11 -05:00
Saurabh Misra
addbaad370
Merge branch 'main' into fix/class-method-import-syntax 2026-01-30 16:36:03 -08:00
Saurabh Misra
09e6a1710f Address review: add validation for edge cases in import generation
- Add _is_valid_js_identifier() to check for reserved words (module, exports, prototype, etc.)
- Only use class import pattern for single-dot names where class name is valid identifier
- Fall back to module import for:
  - Multiple dots (e.g., Constructor.prototype.method)
  - Reserved words (e.g., module.exports)
- Add comprehensive tests for edge cases
2026-01-31 00:35:10 +00:00
Saurabh Misra
b2fb58eba6 Fix invalid JavaScript import syntax for class methods
When generating test imports for class methods like `Validator.validateRequest`,
the previous code produced invalid JavaScript:
  const { Validator.validateRequest } = require('../middlewares/Validator');

This is invalid because dots are not allowed in destructuring patterns.

The fix:
- Add _generate_import_statement() function to detect class methods (names with dots)
- For class methods: generate `const ClassName = require('...')`
- For simple functions: keep destructuring `const { funcName } = require('...')`
- Update prompt templates to use {import_statement} placeholder

Includes unit tests for the new import generation logic.
2026-01-31 00:35:10 +00:00
Saurabh Misra
289827e5cb
Merge pull request #2337 from codeflash-ai/fix/improve-typescript-validation-error-messages
fix: improve TypeScript/JavaScript validation error messages
2026-01-30 16:03:02 -08:00
Saurabh Misra
d255a29203
Update django/aiservice/aiservice/validators/javascript_validator.py
Co-authored-by: claude[bot] <209825114+claude[bot]@users.noreply.github.com>
2026-01-30 16:00:05 -08:00
Saurabh Misra
8800614d1c Add unit tests for TypeScript/JavaScript validator error reporting
Tests for:
- Error location reporting with line numbers and code snippets
- Markdown code block parsing with various scenarios
- Multiple code blocks with mixed valid/invalid content
- Real-world TypeScript patterns (async, try-catch, template literals)

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-30 23:53:26 +00:00
Saurabh Misra
07ae9db684 fix: improve TypeScript/JavaScript validation error messages
Add better error diagnostics for TypeScript/JavaScript syntax validation:

- Add line numbers and code snippets to error messages
- Log warnings when markdown parsing finds no code blocks
- Show the actual problematic code in error logs
- Help debug "Invalid syntax" errors by showing exact location

This helps diagnose issues where the API rejects code that tree-sitter
parses correctly on the client side by providing more context in the
error messages and logs.

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
2026-01-30 23:47:39 +00:00
aseembits93
af2935f4f2 0-index finally 2026-01-30 14:42:28 -08:00
Kevin Turcios
c1a25b33e5
Merge branch 'main' into ranker-multidim-scoring 2026-01-30 22:16:08 +00:00
ali
99a7a32b32
safer caching 2026-01-30 19:52:18 +02:00
ali
879aa93967
fix validating js/ts code with markdown syntax 2026-01-30 19:44:31 +02:00
Kevin Turcios
0f5d578d37
Merge branch 'main' into ranker-multidim-scoring 2026-01-29 22:45:18 +00:00
HeshamHM28
c24f350719
Fix Prevent log code for paid org in the optimization feature "AI service " (#2325)
Fixes Cf-1038
2026-01-29 19:28:30 +00:00
Kevin Turcios
04197195e8
Store instrumented performance tests in feature logging (#2330)
## Summary
- Add `instrumented_perf_test` field to `OptimizationFeatures` model
- Update `log_features` function to accept and store performance
instrumented tests

---------

Co-authored-by: Sarthak Agarwal <sarthak.saga@gmail.com>
2026-01-29 03:09:47 -05:00
aseembits93
f1b6fbf737 adding back the instructions 2026-01-28 16:13:28 -08:00
aseembits93
7386dd20b5 1-indexed ranking everywhere 2026-01-28 16:02:24 -08:00
aseembits93
71d397753d Merge remote-tracking branch 'origin/main' into ranker-multidim-scoring 2026-01-28 15:36:30 -08:00
aseembits93
215e6ad390 fixed merge issues 2026-01-28 15:33:11 -08:00
ali
c19d9f4450
fix unit tests 2026-01-28 23:20:48 +02:00
ali
f0480fac39
use treesitter for validating js & ts code syntax 2026-01-28 23:15:30 +02:00
ali
db3f269b37
linting and formatting 2026-01-28 22:41:27 +02:00
ali
ec97ebd4e7
more cleanup 2026-01-28 22:23:54 +02:00
ali
31091350c9
cleanup 2026-01-28 22:19:40 +02:00
ali
dcac02b3f2
abstraction 2026-01-28 21:53:51 +02:00
Kevin Turcios
5ab169076c
Merge branch 'main' into multi-language 2026-01-28 08:19:52 -05:00
Kevin Turcios
8a66c78220
refactor: consolidate CST utilities and simplify add_missing_imports (#2324)
## Summary
- Consolidate shared CST utilities into `aiservice/common/cst_utils.py`
- Simplify `add_missing_imports` by removing redundant abstractions
- Require CST modules instead of strings in postprocessing pipeline
2026-01-28 08:05:45 -05:00
ali
d7e7125220
Merge branch 'multi-language' of github.com:codeflash-ai/codeflash-internal into multi-language 2026-01-28 13:41:55 +02:00
ali
557fb11939
remove jest globals check (client handles it now) 2026-01-28 13:41:06 +02:00
misrasaurabh1
b0a1d6c09f Remove instrumentation of js tests from aiservice and into client 2026-01-27 16:10:35 -08:00
ali
39de6a3bce
Merge branch 'multi-language' of github.com:codeflash-ai/codeflash-internal into multi-language 2026-01-27 23:30:11 +02:00
ali
72cb589948
some instrumentation fixes 2026-01-27 23:30:03 +02:00
Aseem Saxena
7f8ce66c6a multi dimensional scoring and structured json parsing for ranker 2026-01-27 12:24:16 -08:00
codeflash-ai[bot]
238de2b9c0
️ Speed up function _safe_replace_function_calls by 101% in PR #2247 (multi-language) (#2306)
## ️ This pull request contains optimizations for PR #2247
If you approve this dependent PR, these changes will be merged into the
original PR branch `multi-language`.
>This PR will be automatically closed if the original PR is merged.
----
#### 📄 101% (1.01x) speedup for ***`_safe_replace_function_calls` in
`django/aiservice/testgen/instrumentation/javascript/instrument_javascript.py`***

⏱️ Runtime : **`4.63 milliseconds`** **→** **`2.30 milliseconds`** (best
of `5` runs)

#### 📝 Explanation and details


This optimization achieves a **101% speedup** (reducing runtime from
4.63ms to 2.30ms) through three key performance improvements:

**1. Pre-compiled regex pattern (primary speedup driver)**
The original code calls `re.match(pattern, remaining)` inside the main
loop, which recompiles the pattern on every iteration. The line profiler
shows this operation took 3.92 seconds (33.1% of total time). The
optimized version compiles the pattern once upfront (`pattern_re =
re.compile(pattern)`) and reuses it via `pattern_re.match(source, i)`,
reducing this to 1.10 seconds (10% of total time) - a **72% reduction**
in pattern matching overhead.

**2. Eliminated substring creation for pattern matching**
The original creates a new substring `remaining = source[i:]` on every
loop iteration (5.68e8 ns, 4.8% of time). The optimized version uses
`pattern_re.match(source, i)` which matches directly from position `i`
without creating intermediate strings. This saves both allocation and
copying costs, particularly impactful given the loop runs 1.3-1.8
million times per execution.

**3. Fast-forward through comments using `str.find()`**
- **Single-line comments**: Changed from character-by-character loop
(`while i < length and source[i] != "\n"`) to `source.find("\n", i)`,
eliminating repeated indexing operations
- **Multi-line comments**: Changed from character-by-character scanning
to `source.find("*/", i)`, directly jumping to comment end

**Test case analysis:**
The optimization particularly excels on:
- **Large-scale scenarios**: The 500-iteration test shows 125% speedup
(1.24ms → 549μs), demonstrating excellent scalability
- **Comment-heavy code**: Tests with comments show 17-65% speedups as
`str.find()` efficiently skips entire comment blocks
- **Method call filtering**: Tests with `.foo()` patterns show 53-75%
speedups from reduced pattern matching overhead

All 45 test cases pass with improved or comparable performance,
validating that the optimization preserves correctness while delivering
consistent runtime improvements across diverse JavaScript code patterns.



 **Correctness verification report:**

| Test                        | Status            |
| --------------------------- | ----------------- |
| ⚙️ Existing Unit Tests | 🔘 **None Found** |
| 🌀 Generated Regression Tests |  **59 Passed** |
|  Replay Tests | 🔘 **None Found** |
| 🔎 Concolic Coverage Tests | 🔘 **None Found** |
|📊 Tests Coverage       | 100.0% |
<details>
<summary>🌀 Click to see Generated Regression Tests</summary>

```python
from __future__ import annotations

# imports
import re
from collections.abc import Callable

import pytest  # used for our unit tests

from testgen.instrumentation.javascript.instrument_javascript import (
    _safe_replace_function_calls,
)

# unit tests


# Helper replacement function used in many tests: wraps the matched call
def _wrap_with_capture(match: re.Match[str]) -> str:
    # return a wrapper that includes the exact matched text
    return f"codeflash.capture({match.group(0)})"


def test_basic_single_replacement():
    # Basic scenario: a single top-level function call should be replaced.
    src = "var x = foo(1);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 12.5μs -> 9.73μs (28.9% faster)


def test_string_literals_not_replaced():
    # Strings containing the function call should be left alone.
    src = "var s = 'foo(1)'; foo(2);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 14.5μs -> 10.5μs (38.5% faster)


def test_double_quoted_and_escaped_strings():
    # Double-quoted strings with escaped quotes should be preserved.
    src = 'var s = "he said \\"foo(3)\\""; foo(4);'
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 15.7μs -> 11.5μs (36.0% faster)


def test_template_literals_with_expressions_skipped():
    # Template literals should preserve content and skip replacement inside ${...}.
    src = "`start ${ { nested: foo(5) } } end` foo(6);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 12.8μs -> 12.0μs (6.23% faster)


def test_single_line_comment_skipped():
    # Calls inside single-line comments should not be replaced.
    src = " // foo(7)\nfoo(8);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 9.40μs -> 8.02μs (17.1% faster)


def test_multi_line_comment_skipped_and_unclosed_comment_handled():
    # Multi-line comments containing calls should be preserved.
    src = "/* comment foo(9) */ foo(10);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 9.87μs -> 8.17μs (20.9% faster)

    # Unclosed multi-line comment at EOF should not hang or replace inside it
    src_unclosed = "/* unclosed foo(11)"
    codeflash_output = _safe_replace_function_calls(
        src_unclosed, "foo", _wrap_with_capture, pattern
    )
    res_unclosed = codeflash_output  # 3.26μs -> 1.97μs (65.2% faster)


def test_method_call_dot_not_replaced():
    # Calls that are methods (preceded by a dot) should not be replaced.
    src = "obj.foo(12); foo(13);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 14.8μs -> 9.49μs (55.6% faster)


def test_already_wrapped_skipped():
    # If 'codeflash.capture' immediately precedes the call start index, replacement should be skipped.
    # This tests the specific check in the function that avoids double-wrapping.
    src = "codeflash.capturefoo(14);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 19.9μs -> 11.3μs (75.4% faster)


def test_template_literal_with_escaped_backticks_and_expressions():
    # Template literal containing escaped backticks and ${...} should be handled correctly.
    src = r"`backtick \` and ${ { foo: foo(15) } } end` foo(16);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 14.2μs -> 12.8μs (11.1% faster)


def test_overlapping_names_handling():
    # When function name appears adjacent to other letters like 'foofoo(17)',
    # ensure inner call is still found at the correct position.
    src = "foofoo(17); foo(18);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 11.3μs -> 7.92μs (43.2% faster)


def test_escaped_characters_in_regular_strings_do_not_break_scanning():
    # Strings that include escaped backslashes and escaped quotes should not break the scanner.
    src = '"line1\\nfoo(19)\\nline2" foo(20);'
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 10.9μs -> 9.15μs (19.1% faster)


def test_large_scale_multiple_replacements():
    # Large-scale test: many occurrences (but below 1000) to assess scalability.
    count = 500  # keep below 1000 as per instructions
    src_parts = [f"foo({i});" for i in range(count)]
    src = " ".join(src_parts)
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 1.24ms -> 549μs (125% faster)
    # Ensure that every occurrence was wrapped exactly once.
    wrapped_count = res.count("codeflash.capture(")


def test_no_false_positive_when_preceded_by_other_chars():
    # If the character before the function is not a dot but some other char, replacement should proceed.
    src = "afoo(21); .foo(22);"
    pattern = r"foo\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 13.3μs -> 8.96μs (48.5% faster)


def test_pattern_that_matches_functions_with_spaces():
    # Pattern that allows spaces before the opening parenthesis.
    src = "foo (23); foo(24);"
    pattern = r"foo\s*\([^\)]*\)"
    codeflash_output = _safe_replace_function_calls(
        src, "foo", _wrap_with_capture, pattern
    )
    res = codeflash_output  # 10.1μs -> 7.71μs (30.6% faster)


# codeflash_output is used to check that the output of the original code is the same as that of the optimized code.

```

```python
import re
from collections.abc import Callable

# imports
import pytest

from testgen.instrumentation.javascript.instrument_javascript import (
    _safe_replace_function_calls,
)


class TestBasicFunctionality:
    """Test cases for basic functionality of _safe_replace_function_calls."""

    def test_simple_function_call_replacement(self):
        """Test replacing a simple function call with a basic pattern."""
        source = "foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "bar()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 5.38μs -> 5.00μs (7.58% faster)

    def test_function_call_with_arguments(self):
        """Test replacing function calls that have arguments."""
        source = "foo(x, y)"
        pattern = r"foo\([^)]*\)"
        replace_func = lambda m: "bar(x, y)"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 5.25μs -> 4.74μs (10.7% faster)

    def test_multiple_function_calls(self):
        """Test replacing multiple occurrences of the same function."""
        source = "foo(); foo(); foo();"
        pattern = r"foo\(\)"
        call_count = [0]

        def replace_func(m):
            call_count[0] += 1
            return f"replaced_{call_count[0]}()"

        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 12.2μs -> 8.70μs (40.3% faster)

    def test_empty_source(self):
        """Test with an empty source string."""
        source = ""
        pattern = r"foo\(\)"
        replace_func = lambda m: "bar()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 853ns -> 2.04μs (58.1% slower)

    def test_no_matches(self):
        """Test when no function calls match the pattern."""
        source = "let x = 5; y = 10;"
        pattern = r"foo\(\)"
        replace_func = lambda m: "bar()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 14.3μs -> 7.73μs (84.8% faster)

    def test_function_call_in_middle_of_code(self):
        """Test replacing function call in the middle of other code."""
        source = "let result = foo(); console.log(result);"
        pattern = r"foo\(\)"
        replace_func = lambda m: "bar()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 24.9μs -> 12.9μs (92.5% faster)

    def test_preserve_whitespace(self):
        """Test that whitespace around function calls is preserved."""
        source = "foo(  )  "
        pattern = r"foo\(\s*\)"
        replace_func = lambda m: "bar()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 6.61μs -> 5.85μs (12.8% faster)

    def test_function_call_in_single_quoted_string(self):
        """Test that function calls inside single-quoted strings are NOT replaced."""
        source = "'foo()' bar()"
        pattern = r"foo\(\)|bar\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 7.49μs -> 7.27μs (3.07% faster)

    def test_function_call_in_double_quoted_string(self):
        """Test that function calls inside double-quoted strings are NOT replaced."""
        source = '"foo()" bar()'
        pattern = r"foo\(\)|bar\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 7.22μs -> 6.47μs (11.5% faster)

    def test_function_call_in_single_line_comment(self):
        """Test that function calls in single-line comments are NOT replaced."""
        source = "// foo() should not be replaced\nbar()"
        pattern = r"foo\(\)|bar\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 9.27μs -> 6.86μs (35.0% faster)

    def test_function_call_in_multiline_comment(self):
        """Test that function calls in multi-line comments are NOT replaced."""
        source = "/* foo() should not be replaced */ bar()"
        pattern = r"foo\(\)|bar\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 10.00μs -> 7.47μs (33.7% faster)


class TestEdgeCases:
    """Test cases for edge cases and unusual conditions."""

    def test_escaped_quote_in_string(self):
        """Test handling of escaped quotes inside strings."""
        source = r"'foo\'s bar' foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 7.99μs -> 7.41μs (7.81% faster)

    def test_escaped_backslash_in_string(self):
        """Test handling of escaped backslashes in strings."""
        source = r"'path\\to\\file' foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 8.41μs -> 7.76μs (8.36% faster)

    def test_method_call_not_replaced(self):
        """Test that method calls (preceded by dot) are NOT replaced."""
        source = "obj.foo() + foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "bar()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 12.9μs -> 8.37μs (53.6% faster)

    def test_template_literal_simple(self):
        """Test handling of simple template literals."""
        source = "`foo()` bar()"
        pattern = r"foo\(\)|bar\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 7.53μs -> 6.86μs (9.73% faster)

    def test_template_literal_with_expression(self):
        """Test handling of template literals with embedded expressions."""
        source = "`Value: ${foo()}` bar()"
        pattern = r"foo\(\)|bar\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 9.50μs -> 9.01μs (5.51% faster)

    def test_template_literal_with_nested_braces(self):
        """Test handling of template literals with nested braces in expressions."""
        source = "`${obj.map(x => ({value: x}))}` foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 10.5μs -> 10.2μs (2.83% faster)

    def test_codeflash_capture_already_wrapped(self):
        """Test that already wrapped codeflash.capture calls are not double-wrapped."""
        source = "codeflash.capture(foo())"
        pattern = r"foo\(\)"
        replace_func = lambda m: "bar()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 16.9μs -> 10.5μs (60.3% faster)

    def test_nested_quotes_in_string(self):
        """Test handling of different quote types inside strings."""
        source = """'string with "quotes"' foo()"""
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 8.46μs -> 8.08μs (4.66% faster)

    def test_multiline_comment_not_closed(self):
        """Test handling of multi-line comment that extends to end of source."""
        source = "foo() /* comment starts here"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 9.20μs -> 6.59μs (39.5% faster)

    def test_function_call_at_end_of_source(self):
        """Test function call replacement at the very end of source."""
        source = "foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "bar()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 4.28μs -> 4.24μs (0.730% faster)

    def test_adjacent_strings(self):
        """Test handling of adjacent string literals."""
        source = "'foo()' \"foo()\" foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 9.13μs -> 8.05μs (13.4% faster)

    def test_comment_after_function_call(self):
        """Test function call followed by a comment."""
        source = "foo() // this is foo\nbar()"
        pattern = r"foo\(\)|bar\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 10.5μs -> 8.23μs (27.1% faster)

    def test_escaped_character_in_comment(self):
        """Test comment handling with special characters."""
        source = "foo() // comment with \\ backslash\nbar()"
        pattern = r"foo\(\)|bar\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 11.4μs -> 8.42μs (35.6% faster)

    def test_regex_special_chars_in_string(self):
        """Test strings containing regex special characters."""
        source = r"'[a-z].*+?' foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 7.29μs -> 6.70μs (8.80% faster)

    def test_single_char_string(self):
        """Test handling of single character strings."""
        source = "'' 'a' foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 7.62μs -> 6.79μs (12.2% faster)

    def test_consecutive_escaped_quotes(self):
        """Test multiple consecutive escaped quotes."""
        source = r"'\"\"' foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 6.92μs -> 6.09μs (13.5% faster)

    def test_template_literal_escaped_backtick(self):
        """Test template literal with escaped backtick."""
        source = r"`escaped \` backtick` foo()"
        pattern = r"foo\(\)"
        replace_func = lambda m: "replaced()"
        codeflash_output = _safe_replace_function_calls(
            source, "foo", replace_func, pattern
        )
        result = codeflash_output  # 9.10μs -> 8.80μs (3.43% faster)

```

</details>


To edit these changes `git checkout
codeflash/optimize-pr2247-2026-01-25T08.38.25` and push.


[![Codeflash](https://img.shields.io/badge/Optimized%20with-Codeflash-yellow?style=flat&color=%23ffc428&logo=data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iNDgwIiBoZWlnaHQ9ImF1dG8iIHZpZXdCb3g9IjAgMCA0ODAgMjgwIiBmaWxsPSJub25lIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPgo8cGF0aCBmaWxsLXJ1bGU9ImV2ZW5vZGQiIGNsaXAtcnVsZT0iZXZlbm9kZCIgZD0iTTI4Ni43IDAuMzc4NDE4SDIwMS43NTFMNTAuOTAxIDE0OC45MTFIMTM1Ljg1MUwwLjk2MDkzOCAyODEuOTk5SDk1LjQzNTJMMjgyLjMyNCA4OS45NjE2SDE5Ni4zNDVMMjg2LjcgMC4zNzg0MThaIiBmaWxsPSIjRkZDMDQzIi8+CjxwYXRoIGZpbGwtcnVsZT0iZXZlbm9kZCIgY2xpcC1ydWxlPSJldmVub2RkIiBkPSJNMzExLjYwNyAwLjM3ODkwNkwyNTguNTc4IDU0Ljk1MjZIMzc5LjU2N0w0MzIuMzM5IDAuMzc4OTA2SDMxMS42MDdaIiBmaWxsPSIjMEIwQTBBIi8+CjxwYXRoIGZpbGwtcnVsZT0iZXZlbm9kZCIgY2xpcC1ydWxlPSJldmVub2RkIiBkPSJNMzA5LjU0NyA4OS45NjAxTDI1Ni41MTggMTQ0LjI3NkgzNzcuNTA2TDQzMC4wMjEgODkuNzAyNkgzMDkuNTQ3Vjg5Ljk2MDFaIiBmaWxsPSIjMEIwQTBBIi8+CjxwYXRoIGZpbGwtcnVsZT0iZXZlbm9kZCIgY2xpcC1ydWxlPSJldmVub2RkIiBkPSJNMjQyLjg3MyAxNjQuNjZMMTg5Ljg0NCAyMTkuMjM0SDMxMC44MzNMMzYzLjM0NyAxNjQuNjZIMjQyLjg3M1oiIGZpbGw9IiMwQjBBMEEiLz4KPC9zdmc+Cg==)](https://codeflash.ai)
![Static
Badge](https://img.shields.io/badge/🎯_Optimization_Quality-high-green)

Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
Co-authored-by: Kevin Turcios <106575910+KRRT7@users.noreply.github.com>
2026-01-27 00:28:03 -05:00
codeflash-ai[bot]
64b0bb74e0
️ Speed up function _has_test_functions by 18% in PR #2247 (multi-language) (#2308)
## ️ This pull request contains optimizations for PR #2247
If you approve this dependent PR, these changes will be merged into the
original PR branch `multi-language`.
>This PR will be automatically closed if the original PR is merged.
----
#### 📄 18% (0.18x) speedup for ***`_has_test_functions` in
`django/aiservice/testgen/testgen_javascript.py`***

⏱️ Runtime : **`740 microseconds`** **→** **`627 microseconds`** (best
of `76` runs)

#### 📝 Explanation and details


The optimized code achieves an **18% runtime improvement** by
eliminating repeated regex compilation overhead.

**Key optimization:**
- **Precompiled regex pattern**: The pattern
`r"(?:test|it)\s*\(\s*['\"]"` is compiled once at module load time into
`_TEST_FUNC_RE`, rather than being recompiled on every function call. In
Python, `re.search()` with a raw string pattern incurs compilation cost
each time it's invoked.

**Performance breakdown from line profiler:**
- Original: 2.70ms spent in `re.search(test_pattern, code)` (96.5% of
total time)
- Optimized: 862μs for the direct pattern search (100% of total time,
but 3.1x faster overall)
- The pattern string assignment overhead (97μs in original) is
eliminated entirely

**Why this matters for the workload:**
Based on `function_references`, this function is called from
`parse_and_validate_js_output()` during LLM response validation. This is
a **hot path** operation that executes on every test generation request.
The validation flow checks multiple conditions including syntax
validation before checking for test functions, meaning this function
runs repeatedly during normal operations.

**Test case performance:**
- **Small inputs** (single test functions): 50-80% faster (e.g., 2.80μs
→ 1.83μs)
- **Empty/minimal strings**: 130-140% faster (e.g., 1.80μs → 750ns) 
- **Large inputs** (500-1000 lines): 1-8% faster depending on match
location
- **Early matches** benefit most since regex short-circuits on first
match

The optimization is most effective when processing typical-sized
JavaScript test code (dozens to hundreds of lines), which aligns with
the common use case of validating LLM-generated test functions.



 **Correctness verification report:**

| Test                        | Status            |
| --------------------------- | ----------------- |
| ⚙️ Existing Unit Tests | 🔘 **None Found** |
| 🌀 Generated Regression Tests |  **102 Passed** |
|  Replay Tests | 🔘 **None Found** |
| 🔎 Concolic Coverage Tests | 🔘 **None Found** |
|📊 Tests Coverage       | 100.0% |
<details>
<summary>🌀 Click to see Generated Regression Tests</summary>

```python
from __future__ import annotations

import re

# imports
import pytest  # used for our unit tests

from testgen.testgen_javascript import _has_test_functions


def test_basic_test_call_double_quotes():
    # Basic: a standard Jest test call using double quotes should be detected.
    code = 'test("my test name", () => { expect(true).toBe(true); });'
    codeflash_output = _has_test_functions(code)  # 2.83μs -> 1.78μs (59.3% faster)


def test_basic_it_call_single_quotes():
    # Basic: a standard it() call using single quotes should be detected.
    code = "it('does something', function() { /* ... */ });"
    codeflash_output = _has_test_functions(code)  # 2.80μs -> 1.83μs (53.0% faster)


def test_whitespace_and_newlines_between_name_and_paren():
    # Edge: whitespace/newlines between the function name and '(' and between '(' and the quote
    # The regex allows arbitrary whitespace, so this should still match.
    code = "it   \n (\n  'handles newlines'\n )"
    codeflash_output = _has_test_functions(code)  # 2.90μs -> 1.89μs (53.2% faster)


def test_empty_string_returns_false():
    # Edge: empty input must return False (no tests found).
    code = ""
    codeflash_output = _has_test_functions(code)  # 1.80μs -> 750ns (139% faster)


def test_uppercase_function_name_not_matched():
    # Edge: the regex is case-sensitive; 'Test' should NOT match.
    code = "Test('capitalized should not match', () => {});"
    codeflash_output = _has_test_functions(code)  # 3.20μs -> 2.23μs (43.7% faster)


def test_backtick_template_not_matched():
    # Edge: template literals use backticks; pattern looks only for single/double quotes.
    code = "test(`template literal name`, () => {});"
    codeflash_output = _has_test_functions(code)  # 3.34μs -> 2.27μs (47.0% faster)


def test_numeric_first_arg_not_matched():
    # Edge: if the first argument is not a quoted string (e.g., a number), pattern should not match.
    code = "test(123, () => {});"
    codeflash_output = _has_test_functions(code)  # 3.01μs -> 1.86μs (61.7% faster)


def test_test_call_inside_comment_still_matches():
    # Important behavioral note: the function does not ignore comments.
    # A 'test(' occurrence inside a JS comment still matches because the function only does regex search.
    code = "// test('in a single-line comment')\n/* test(\"in block comment\") */"
    # Both comment forms contain test('...') / test("...") which the regex will find.
    codeflash_output = _has_test_functions(code)  # 2.96μs -> 1.84μs (61.1% faster)


def test_substring_in_identifier_matches():
    # The regex is permissive and will match occurrences where 'test' or 'it' appear as suffixes
    # of other identifiers (e.g., 'latesttest(' or 'split('). This test documents that behavior.
    code_latest = "function latesttest(){}\nlatesttest('x')"
    code_split = "const arr = ['a']; arr.split('a');"
    # Both contain the substring "test('..." or "it('...", so they should be considered matches by the implementation.
    codeflash_output = _has_test_functions(
        code_latest
    )  # 3.77μs -> 2.59μs (45.5% faster)
    codeflash_output = _has_test_functions(code_split)  # 1.16μs -> 793ns (46.7% faster)


def test_comment_between_paren_blocks_prevents_match():
    # If there is a non-whitespace token (like a block comment) between '(' and the starting quote,
    # the current regex will not match because it expects only whitespace between '(' and the quote.
    code = "test(/* important note */ 'name in comment')"
    codeflash_output = _has_test_functions(code)  # 3.31μs -> 2.30μs (43.5% faster)


def test_multiple_test_and_it_occurrences():
    # A file with multiple matches should still return True (boolean).
    code = """
    describe('suite', () => {
      it('first case', () => {});
      // some other code
      test("second case", () => {});
    });
    """
    codeflash_output = _has_test_functions(code)  # 3.34μs -> 2.16μs (54.9% faster)


def test_large_scale_no_match_performance():
    # Large-scale: many lines without any test/it(...) occurrences should return False.
    # Keep size under 1000 to respect constraints. We use 900 repeated lines.
    repeated = "const filler = 0;\n" * 900  # 900 lines of filler
    codeflash_output = _has_test_functions(repeated)  # 76.6μs -> 75.5μs (1.47% faster)


def test_large_scale_match_near_end():
    # Large-scale: many lines of filler followed by a single test at the end should return True.
    # This ensures the search scans through large input and finds a late occurrence.
    repeated = "const filler = 0;\n" * 900  # 900 lines of filler
    code = repeated + "  // real test follows\n  test('final case', () => {});"
    codeflash_output = _has_test_functions(code)  # 77.5μs -> 76.3μs (1.59% faster)


def test_it_with_newline_between_name_and_paren():
    # Verify that a newline immediately after 'it' and before '(' is allowed by the regex (\s* covers newline).
    code = "it\n('newline-allowed')"
    codeflash_output = _has_test_functions(code)  # 3.22μs -> 1.94μs (66.0% faster)


def test_quoted_string_with_escaped_quotes_still_matches():
    # Even if the string contains escaped quotes, the regex only checks the opening quote, so it should match.
    code = r'test("contains an escaped quote: \" here", () => {});'
    codeflash_output = _has_test_functions(code)  # 2.90μs -> 1.83μs (58.2% faster)


# codeflash_output is used to check that the output of the original code is the same as that of the optimized code.

```

```python
import re

import pytest

from testgen.testgen_javascript import _has_test_functions


def test_simple_test_function_with_single_quotes():
    """Test detection of test() function with single quotes."""
    code = "test('should work', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.78μs -> 2.29μs (65.1% faster)


def test_simple_it_function_with_single_quotes():
    """Test detection of it() function with single quotes."""
    code = "it('should work', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.10μs -> 1.99μs (56.0% faster)


def test_simple_test_function_with_double_quotes():
    """Test detection of test() function with double quotes."""
    code = 'test("should work", () => {})'
    codeflash_output = _has_test_functions(code)  # 3.07μs -> 1.89μs (62.8% faster)


def test_simple_it_function_with_double_quotes():
    """Test detection of it() function with double quotes."""
    code = 'it("should work", () => {})'
    codeflash_output = _has_test_functions(code)  # 3.13μs -> 1.86μs (68.2% faster)


def test_no_test_functions():
    """Test code without any test functions returns False."""
    code = "function myFunction() { return 42; }"
    codeflash_output = _has_test_functions(code)  # 3.08μs -> 1.85μs (66.5% faster)


def test_test_function_with_single_whitespace():
    """Test detection with single space between function name and parenthesis."""
    code = "test ('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.20μs -> 1.96μs (63.0% faster)


def test_it_function_with_single_whitespace():
    """Test detection with single space between function name and parenthesis."""
    code = "it ('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.98μs -> 1.88μs (58.5% faster)


def test_test_function_with_multiple_whitespaces():
    """Test detection with multiple spaces between function name and parenthesis."""
    code = "test   ('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.04μs -> 1.95μs (55.5% faster)


def test_it_function_with_multiple_whitespaces():
    """Test detection with multiple spaces between function name and parenthesis."""
    code = "it   ('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.06μs -> 1.89μs (62.1% faster)


def test_test_function_with_tab_character():
    """Test detection with tab character between function name and parenthesis."""
    code = "test\t('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.05μs -> 1.93μs (57.8% faster)


def test_it_function_with_tab_character():
    """Test detection with tab character between function name and parenthesis."""
    code = "it\t('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.99μs -> 1.85μs (61.6% faster)


def test_test_function_with_newline():
    """Test detection with newline between function name and parenthesis."""
    code = "test\n('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.02μs -> 1.93μs (56.5% faster)


def test_it_function_with_newline():
    """Test detection with newline between function name and parenthesis."""
    code = "it\n('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.95μs -> 1.90μs (55.4% faster)


def test_multiple_test_functions():
    """Test detection with multiple test functions in the same code."""
    code = """
    test('first test', () => {});
    it('second test', () => {});
    """
    codeflash_output = _has_test_functions(code)  # 3.12μs -> 1.92μs (62.3% faster)


def test_test_function_in_multiline_code():
    """Test detection of test function within multiline code."""
    code = """
    const helper = () => {};
    test('actual test', () => {});
    const another = () => {};
    """
    codeflash_output = _has_test_functions(code)  # 3.19μs -> 2.12μs (50.1% faster)


def test_it_function_in_multiline_code():
    """Test detection of it function within multiline code."""
    code = """
    const helper = () => {};
    it('actual test', () => {});
    const another = () => {};
    """
    codeflash_output = _has_test_functions(code)  # 3.46μs -> 2.23μs (55.1% faster)


def test_test_word_in_comment_not_matched():
    """Test that test() in comments is still detected by regex (no comment parsing)."""
    code = "// test('in comment', () => {})"
    # Note: The function uses regex without comment awareness, so it will match
    codeflash_output = _has_test_functions(code)  # 3.10μs -> 1.94μs (60.2% faster)


def test_test_word_in_string_variable():
    """Test that test word in string variable doesn't match pattern."""
    code = 'const description = "this is a test of something";'
    codeflash_output = _has_test_functions(code)  # 3.56μs -> 2.29μs (55.3% faster)


def test_test_as_variable_name_not_matched():
    """Test that 'test' as variable name doesn't match without parenthesis."""
    code = "const test = 5;"
    codeflash_output = _has_test_functions(code)  # 3.11μs -> 2.03μs (53.1% faster)


def test_testing_as_word_not_matched():
    """Test that 'testing' word doesn't match."""
    code = "const testing = 'some value';"
    codeflash_output = _has_test_functions(code)  # 3.17μs -> 2.04μs (55.4% faster)


def test_it_as_pronoun_not_matched():
    """Test that 'it' as pronoun doesn't match without proper pattern."""
    code = "// it is a good day"
    codeflash_output = _has_test_functions(code)  # 3.15μs -> 1.98μs (59.1% faster)


def test_it_as_variable_not_matched():
    """Test that 'it' as variable name doesn't match without parenthesis."""
    code = "const it = 5;"
    codeflash_output = _has_test_functions(code)  # 3.13μs -> 1.92μs (63.7% faster)


def test_empty_string():
    """Test with empty string input."""
    codeflash_output = _has_test_functions("")  # 1.74μs -> 757ns (129% faster)


def test_only_whitespace():
    """Test with only whitespace."""
    codeflash_output = _has_test_functions("   \n\t  ")  # 2.05μs -> 890ns (131% faster)


def test_test_function_with_special_test_name():
    """Test detection with special characters in test name."""
    code = "test('test-name_123!@#', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.95μs -> 1.90μs (55.1% faster)


def test_it_function_with_special_test_name():
    """Test detection with special characters in test name."""
    code = "it('it-name_123!@#', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.04μs -> 1.88μs (62.0% faster)


def test_test_function_with_empty_string_name():
    """Test detection with empty string as test name."""
    code = "test('', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.90μs -> 1.84μs (58.0% faster)


def test_it_function_with_empty_string_name():
    """Test detection with empty string as test name."""
    code = "it('', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.97μs -> 1.86μs (59.7% faster)


def test_test_with_carriage_return():
    """Test detection with carriage return character."""
    code = "test\r('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.02μs -> 1.89μs (60.2% faster)


def test_it_with_carriage_return():
    """Test detection with carriage return character."""
    code = "it\r('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.12μs -> 1.91μs (63.8% faster)


def test_test_with_form_feed():
    """Test detection with form feed character."""
    code = "test\f('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.98μs -> 1.85μs (60.7% faster)


def test_it_with_form_feed():
    """Test detection with form feed character."""
    code = "it\f('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.13μs -> 1.91μs (64.4% faster)


def test_test_with_vertical_tab():
    """Test detection with vertical tab character."""
    code = "test\v('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.08μs -> 1.87μs (65.0% faster)


def test_it_with_vertical_tab():
    """Test detection with vertical tab character."""
    code = "it\v('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.13μs -> 1.83μs (71.4% faster)


def test_test_with_non_breaking_space():
    """Test that non-breaking space might not work depending on whitespace regex."""
    code = "test\u00a0('my test', () => {})"
    # Non-breaking space might not be treated as \s in regex
    codeflash_output = _has_test_functions(code)
    result = codeflash_output  # 3.25μs -> 2.07μs (56.6% faster)


def test_test_with_zero_width_space():
    """Test with zero-width space."""
    code = "test\u200b('my test', () => {})"
    codeflash_output = _has_test_functions(code)
    result = codeflash_output  # 3.82μs -> 2.74μs (39.4% faster)


def test_only_test_keyword():
    """Test with only the word 'test' without parenthesis."""
    code = "test"
    codeflash_output = _has_test_functions(code)  # 2.85μs -> 1.83μs (55.3% faster)


def test_only_it_keyword():
    """Test with only the word 'it' without parenthesis."""
    code = "it"
    codeflash_output = _has_test_functions(code)  # 1.89μs -> 793ns (138% faster)


def test_test_with_parenthesis_but_no_quote():
    """Test function call without string argument."""
    code = "test(variable)"
    codeflash_output = _has_test_functions(code)  # 3.39μs -> 2.12μs (59.8% faster)


def test_it_with_parenthesis_but_no_quote():
    """Test it function call without string argument."""
    code = "it(variable)"
    codeflash_output = _has_test_functions(code)  # 3.24μs -> 2.07μs (56.5% faster)


def test_test_followed_by_string_literal_without_parenthesis():
    """Test with string literal but missing parenthesis."""
    code = "test 'string'"
    codeflash_output = _has_test_functions(code)  # 3.17μs -> 1.93μs (64.5% faster)


def test_it_followed_by_string_literal_without_parenthesis():
    """Test with string literal but missing parenthesis."""
    code = "it 'string'"
    codeflash_output = _has_test_functions(code)  # 3.24μs -> 2.02μs (59.9% faster)


def test_test_with_backtick_quotes():
    """Test with backtick quotes (template literals)."""
    code = "test(`my test`, () => {})"
    codeflash_output = _has_test_functions(code)  # 3.49μs -> 2.34μs (49.5% faster)


def test_it_with_backtick_quotes():
    """Test it with backtick quotes (template literals)."""
    code = "it(`my test`, () => {})"
    codeflash_output = _has_test_functions(code)  # 3.38μs -> 2.23μs (51.4% faster)


def test_describe_function_not_matched():
    """Test that describe() function is not matched."""
    code = "describe('suite', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.13μs -> 1.94μs (61.2% faster)


def test_beforeEach_function_not_matched():
    """Test that beforeEach() function is not matched."""
    code = "beforeEach(() => {})"
    codeflash_output = _has_test_functions(code)  # 1.98μs -> 901ns (119% faster)


def test_afterEach_function_not_matched():
    """Test that afterEach() function is not matched."""
    code = "afterEach(() => {})"
    codeflash_output = _has_test_functions(code)  # 2.68μs -> 1.50μs (78.6% faster)


def test_test_method_on_object():
    """Test with test as method call on object."""
    code = "obj.test('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.12μs -> 1.94μs (61.4% faster)


def test_it_method_on_object():
    """Test with it as method call on object."""
    code = "obj.it('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.10μs -> 2.00μs (55.0% faster)


def test_test_substring_in_longer_identifier():
    """Test when test is part of longer identifier."""
    code = "mytest('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.00μs -> 1.90μs (58.0% faster)


def test_it_substring_in_longer_identifier():
    """Test when it is part of longer identifier."""
    code = "unit('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.07μs -> 1.94μs (58.2% faster)


def test_test_with_unicode_test_name():
    """Test detection with unicode characters in test name."""
    code = "test('\u4e2d\u6587\u6d4b\u8bd5', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.57μs -> 2.31μs (54.4% faster)


def test_it_with_unicode_test_name():
    """Test detection with unicode characters in test name."""
    code = "it('\u4e2d\u6587\u6d4b\u8bd5', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.09μs -> 2.08μs (48.8% faster)


def test_test_with_emoji():
    """Test detection with emoji in test name."""
    code = "test('\u263a emoji test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.14μs -> 2.02μs (55.2% faster)


def test_it_with_emoji():
    """Test detection with emoji in test name."""
    code = "it('\u263a emoji test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.09μs -> 2.00μs (54.0% faster)


def test_very_long_test_name():
    """Test detection with very long test name."""
    long_name = "a" * 5000
    code = f"test('{long_name}', () => {{}})"
    codeflash_output = _has_test_functions(code)  # 3.00μs -> 1.77μs (69.5% faster)


def test_very_long_code_without_tests():
    """Test with very long code but no test functions."""
    code = "const x = 1;\n" * 500
    codeflash_output = _has_test_functions(code)  # 29.7μs -> 28.5μs (4.29% faster)


def test_test_with_escaped_quote():
    """Test with escaped quote in test name."""
    code = "test('test\\'s name', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.15μs -> 2.03μs (55.2% faster)


def test_it_with_escaped_quote():
    """Test with escaped quote in test name."""
    code = "it('it\\'s name', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.97μs -> 1.87μs (59.0% faster)


def test_test_with_double_quote_in_single_quote():
    """Test with double quote inside single quoted test name."""
    code = "test('has \"double\" quotes', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.93μs -> 1.82μs (60.7% faster)


def test_it_with_double_quote_in_single_quote():
    """Test with double quote inside single quoted test name."""
    code = "it('has \"double\" quotes', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.97μs -> 1.83μs (62.6% faster)


def test_test_with_single_quote_in_double_quote():
    """Test with single quote inside double quoted test name."""
    code = "test(\"has 'single' quotes\", () => {})"
    codeflash_output = _has_test_functions(code)  # 3.00μs -> 1.90μs (57.8% faster)


def test_it_with_single_quote_in_double_quote():
    """Test with single quote inside double quoted test name."""
    code = "it(\"has 'single' quotes\", () => {})"
    codeflash_output = _has_test_functions(code)  # 2.97μs -> 1.78μs (66.7% faster)


def test_test_case_sensitive():
    """Test that TEST (uppercase) is not matched."""
    code = "TEST('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.03μs -> 1.81μs (67.1% faster)


def test_it_case_sensitive():
    """Test that IT (uppercase) is not matched."""
    code = "IT('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.81μs -> 1.84μs (52.8% faster)


def test_test_with_mixed_case():
    """Test that TeSt (mixed case) is not matched."""
    code = "TeSt('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 3.05μs -> 1.85μs (65.2% faster)


def test_it_with_mixed_case():
    """Test that It (mixed case) is not matched."""
    code = "It('my test', () => {})"
    codeflash_output = _has_test_functions(code)  # 2.94μs -> 1.91μs (54.3% faster)


def test_code_with_many_non_test_functions():
    """Test performance with many non-test functions."""
    # Build code with 500 non-test function definitions
    code_lines = [f"function func{i}() {{ return {i}; }}" for i in range(500)]
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 77.0μs -> 75.9μs (1.52% faster)


def test_code_with_many_functions_and_one_test():
    """Test detection of single test among many non-test functions."""
    # Build code with 500 non-test functions and 1 test function
    code_lines = [f"function func{i}() {{ return {i}; }}" for i in range(250)]
    code_lines.append("test('the actual test', () => {})")
    code_lines.extend(
        [f"function func{i}() {{ return {i}; }}" for i in range(250, 500)]
    )
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 40.3μs -> 38.9μs (3.41% faster)


def test_code_with_many_test_functions():
    """Test detection with many test functions."""
    # Build code with 100 test functions
    code_lines = [f"test('test {i}', () => {{}})" for i in range(100)]
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 2.94μs -> 1.74μs (69.4% faster)


def test_code_with_many_it_functions():
    """Test detection with many it functions."""
    # Build code with 100 it functions
    code_lines = [f"it('test {i}', () => {{}})" for i in range(100)]
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 2.91μs -> 1.61μs (80.4% faster)


def test_code_with_alternating_test_and_it_functions():
    """Test detection with alternating test and it functions."""
    # Build code with 100 alternating test and it functions
    code_lines = []
    for i in range(50):
        code_lines.append(f"test('test {i}', () => {{}})")
        code_lines.append(f"it('it {i}', () => {{}})")
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 2.93μs -> 1.64μs (78.7% faster)


def test_code_with_many_non_matching_similar_patterns():
    """Test performance with many similar but non-matching patterns."""
    # Build code with 500 similar patterns that don't match
    code_lines = [f"test{i}('name', () => {{}})" for i in range(500)]
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 64.3μs -> 62.9μs (2.27% faster)


def test_large_code_with_test_at_end():
    """Test detection when test function is at end of large code."""
    # Build code with 500 lines and test at the end
    code_lines = [f"const var{i} = {i};" for i in range(500)]
    code_lines.append("test('test at end', () => {})")
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 41.4μs -> 40.3μs (2.65% faster)


def test_large_code_with_it_at_end():
    """Test detection when it function is at end of large code."""
    # Build code with 500 lines and it at the end
    code_lines = [f"const var{i} = {i};" for i in range(500)]
    code_lines.append("it('it at end', () => {})")
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 41.5μs -> 40.3μs (3.00% faster)


def test_large_code_with_test_at_beginning():
    """Test detection when test function is at beginning of large code."""
    # Build code with test at beginning and 500 lines after
    code_lines = ["test('test at beginning', () => {})"]
    code_lines.extend([f"const var{i} = {i};" for i in range(500)])
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 3.09μs -> 1.90μs (62.2% faster)


def test_large_code_with_it_at_beginning():
    """Test detection when it function is at beginning of large code."""
    # Build code with it at beginning and 500 lines after
    code_lines = ["it('it at beginning', () => {})"]
    code_lines.extend([f"const var{i} = {i};" for i in range(500)])
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 3.02μs -> 1.81μs (67.1% faster)


def test_code_with_multiple_tests_scattered():
    """Test detection with multiple test functions scattered throughout large code."""
    # Build code with 20 test functions scattered among 480 non-test lines
    code_lines = []
    for i in range(500):
        if i % 25 == 0:
            code_lines.append(f"test('scattered test {i}', () => {{}})")
        else:
            code_lines.append(f"const var{i} = {i};")
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 2.86μs -> 1.79μs (59.8% faster)


def test_code_with_very_large_test_name():
    """Test performance with very long test name."""
    # Create a test with name of 10000 characters
    long_name = "x" * 10000
    code = f"test('{long_name}', () => {{}})"
    codeflash_output = _has_test_functions(code)  # 2.99μs -> 1.84μs (62.1% faster)


def test_code_with_deeply_nested_structures():
    """Test detection in deeply nested code structures."""
    # Build nested structure with test at bottom
    code = "const nested = { level1: { level2: { level3: { level4: { " * 50
    code += "test('nested test', () => {})"
    code += " } } } } };" * 50
    codeflash_output = _has_test_functions(code)  # 14.6μs -> 13.5μs (8.56% faster)


def test_code_with_many_whitespace_variations():
    """Test detection with many different whitespace patterns."""
    code_lines = []
    for i in range(100):
        if i % 4 == 0:
            code_lines.append(f"test('test {i}', () => {{}})")
        elif i % 4 == 1:
            code_lines.append(f"test ('test {i}', () => {{}})")
        elif i % 4 == 2:
            code_lines.append(f"test  ('test {i}', () => {{}})")
        else:
            code_lines.append(f"test\t('test {i}', () => {{}})")
    code = "\n".join(code_lines)
    codeflash_output = _has_test_functions(code)  # 2.92μs -> 1.71μs (70.7% faster)


def test_code_return_type_is_boolean():
    """Test that return value is always boolean regardless of input size."""
    # Various test inputs
    test_inputs = [
        "",
        "test",
        "test('name', () => {})",
        "const x = 1;" * 100,
        "test('name', () => {})" + "const x = 1;" * 100,
    ]
    for test_input in test_inputs:
        codeflash_output = _has_test_functions(test_input)
        result = codeflash_output  # 11.9μs -> 9.34μs (27.0% faster)


# codeflash_output is used to check that the output of the original code is the same as that of the optimized code.

```

</details>


To edit these changes `git checkout
codeflash/optimize-pr2247-2026-01-25T08.57.25` and push.


[![Codeflash](https://img.shields.io/badge/Optimized%20with-Codeflash-yellow?style=flat&color=%23ffc428&logo=data:image/svg+xml;base64,PHN2ZyB3aWR0aD0iNDgwIiBoZWlnaHQ9ImF1dG8iIHZpZXdCb3g9IjAgMCA0ODAgMjgwIiBmaWxsPSJub25lIiB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciPgo8cGF0aCBmaWxsLXJ1bGU9ImV2ZW5vZGQiIGNsaXAtcnVsZT0iZXZlbm9kZCIgZD0iTTI4Ni43IDAuMzc4NDE4SDIwMS43NTFMNTAuOTAxIDE0OC45MTFIMTM1Ljg1MUwwLjk2MDkzOCAyODEuOTk5SDk1LjQzNTJMMjgyLjMyNCA4OS45NjE2SDE5Ni4zNDVMMjg2LjcgMC4zNzg0MThaIiBmaWxsPSIjRkZDMDQzIi8+CjxwYXRoIGZpbGwtcnVsZT0iZXZlbm9kZCIgY2xpcC1ydWxlPSJldmVub2RkIiBkPSJNMzExLjYwNyAwLjM3ODkwNkwyNTguNTc4IDU0Ljk1MjZIMzc5LjU2N0w0MzIuMzM5IDAuMzc4OTA2SDMxMS42MDdaIiBmaWxsPSIjMEIwQTBBIi8+CjxwYXRoIGZpbGwtcnVsZT0iZXZlbm9kZCIgY2xpcC1ydWxlPSJldmVub2RkIiBkPSJNMzA5LjU0NyA4OS45NjAxTDI1Ni41MTggMTQ0LjI3NkgzNzcuNTA2TDQzMC4wMjEgODkuNzAyNkgzMDkuNTQ3Vjg5Ljk2MDFaIiBmaWxsPSIjMEIwQTBBIi8+CjxwYXRoIGZpbGwtcnVsZT0iZXZlbm9kZCIgY2xpcC1ydWxlPSJldmVub2RkIiBkPSJNMjQyLjg3MyAxNjQuNjZMMTg5Ljg0NCAyMTkuMjM0SDMxMC44MzNMMzYzLjM0NyAxNjQuNjZIMjQyLjg3M1oiIGZpbGw9IiMwQjBBMEEiLz4KPC9zdmc+Cg==)](https://codeflash.ai)
![Static
Badge](https://img.shields.io/badge/🎯_Optimization_Quality-high-green)

Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
Co-authored-by: Kevin Turcios <106575910+KRRT7@users.noreply.github.com>
2026-01-27 00:14:02 -05:00
Kevin Turcios
bf1d214ece format 2026-01-27 00:00:49 -05:00
Kevin Turcios
764a3f8899
Merge branch 'main' into multi-language 2026-01-26 23:59:03 -05:00
Kevin Turcios
0444e32f77
fix: CST tree handling and testgen pipeline improvements (#2310)
## Summary
- Fix CST tree corruption issues that caused 'NoneType' object has no
attribute 'visit' errors
- Consolidate testgen postprocessing into a single pipeline with
tuple-based pattern
- Improve markdown code extraction to prefer filepath-annotated blocks
- Add diagnostic context to optimization failure logs

## Changes
- Handle empty `SimpleStatementLine` and `StatementHandler` body to
prevent malformed CST
- Add trace_id logging to optimization and import failure paths
- Refactor testgen postprocessing into consolidated pipeline
- Fix code extraction for LLM responses with multiple code blocks

## Test plan
- [x] Added integration tests for full testgen pipeline
- [x] Added tests for markdown extraction with filepath preference
- [x] Existing tests pass

---------

Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
2026-01-26 23:57:55 -05:00
ali
580da4cc8e
fix unit tests 2026-01-26 19:57:38 +02:00
ali
e559b6b9b4
handle repair multi-language 2026-01-26 16:41:58 +02:00
ali
7a8661ae31
fix line profiler optimizations 2026-01-26 15:19:24 +02:00
Kevin Turcios
80be7d2dae fix: resolve type errors and linting issues
- Add language parameter to split_markdown_code and group_code for JS/TS support
- Fix callable type annotation in instrument_javascript.py
- Update testgen_javascript.py to use ChatCompletionMessageParam types
- Add None checks before parse_python_version calls
- Add missing None assertions in test files
- Apply ruff auto-fixes for formatting and unused imports
2026-01-25 01:17:44 -05:00
Kevin Turcios
549f99eb97 formatting 2026-01-25 01:04:50 -05:00
Kevin Turcios
f04ac67728 merge: resolve conflicts with main
- Accept consolidated markdown utilities from common module
- Use wrap_code_in_markdown with language parameter for language support
- Remove duplicate split_markdown_code implementation
- Add validation for python_version before parsing
2026-01-25 01:00:56 -05:00
Kevin Turcios
d091cd17fe
refactor: consolidate shared utilities (#2289)
- Consolidate shared utilities into `aiservice/common/`
- Remove profanity filtering from optimization pipeline
- Add unit tests for common module
2026-01-25 00:49:48 -05:00
Kevin Turcios
df9bef20c4
refactor: remove profanity filtering from optimization pipeline (#2288)
## Summary
- Removes `profanity_regex` and `profanity_words` from
`postprocess_constants.py`
- Removes `remove_profanity_from_explanation` from the optimization
pipeline
- Removes associated test
2026-01-24 23:46:03 -05:00
Kevin Turcios
3a304aa00f
Merge branch 'main' into multi-language 2026-01-24 17:42:16 -05:00
Kevin Turcios
7aa4da74f8
feat: improve testgen import resolution and forward reference handling (#2276)
## Summary
- Add forward reference detection and automatic fix with `from
__future__ import annotations`
- Handle aliased imports and chained calls in test instrumentation
- Fix import resolution from correct module in multi-context testgen
- Allow ellipsis in Protocol/abstract method bodies
- Add dataclass constructor notes for LLM about required/positional
arguments
- Add logging to silent exception handlers

## Test plan
- [x] Unit tests added for forward reference detection
- [x] Unit tests added for dataclass constructor notes
- [x] Unit tests added for ellipsis handling in AST
- [x] Unit tests added for chained call instrumentation
- [x] Unit tests extended for add_missing_imports

---------

Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
2026-01-24 17:39:38 -05:00
Aseem Saxena
b62c9dceab api veersion 2026-01-23 08:55:05 -08:00
Kevin Turcios
f534926059
more context for asyncs (#2273)
# Pull Request Checklist

## Description
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->
2026-01-23 05:25:24 -05:00
Kevin Turcios
8c75f238ed
Fix async crashes (#2270)
# Pull Request Checklist

## Description
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->
2026-01-23 00:51:35 -05:00
Kevin Turcios
2d8da15932
Acceptance reason in explanation for async (#2207)
# Pull Request Checklist

## Description
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->
2026-01-22 22:28:44 -05:00
Sarthak Agarwal
b095f5d9b8
Merge branch 'main' into multi-language 2026-01-23 03:34:13 +05:30
ali
7f795cf941
remove capture perf looped 2026-01-21 21:49:42 +02:00
Sarthak Agarwal
47ecce6e4f fix refinment add language param 2026-01-21 22:09:58 +05:30
Kevin Turcios
6f6b5e47e6
replace_definition_with_import as early as possible (#2264)
# Pull Request Checklist

## Description
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->
2026-01-20 17:42:29 -05:00
ali
3722c3db22
multi-file context support for js/ts 2026-01-20 19:34:11 +02:00
Sarthak Agarwal
56304b5669 line_profiling_changes 2026-01-20 03:58:35 +05:30
Sarthak Agarwal
229f3c8f1a add ts verification for candidate 2026-01-19 21:44:13 +05:30
Kevin Turcios
e793993d12
Merge branch 'main' into cf-1014 2026-01-19 10:58:43 -05:00
Kevin Turcios
3a79f03f0a
fix: improve test generation parsing and validation (#2259)
## Summary
- Remove unicode quote sanitization from test code validation
- Rely on individual test validation to filter out tests with syntax
errors (including unicode characters)

## Test plan
- [x] Existing tests pass
- [x] Tests with unicode quote syntax errors are correctly filtered out
during individual validation
2026-01-19 10:23:26 -05:00
Sarthak Agarwal
de7dffbb11 add debug logs 2026-01-19 18:53:13 +05:30
Kevin Turcios
48b667062b fix: exclude loop variables from test import fallback
Make get_referenced_names_from_source scope-aware by reusing
UndefinedNameCollector, preventing invalid imports like `i` and `v`
from loop variables in AI-generated tests.
2026-01-18 04:53:34 -05:00
Kevin Turcios
53baee3994
test: add normalize_code helper and update test assertions (#2255)
## Summary
- Add `normalize_code` helper in `tests/conftest.py` for comparing code
while ignoring quote style differences
- Update test assertions to use `normalize_code()` wrapper
- Add unit tests for comprehension instrumentation cases
2026-01-18 04:03:21 -05:00
Kevin Turcios
fd16c8b889
Typeagent fixes (#2256)
# Pull Request Checklist

## Description
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->
2026-01-18 03:58:51 -05:00
Kevin Turcios
caf48ac5c3
fix: skip instrumentation for function calls inside comprehensions (#2254)
## Summary
- Skip instrumentation for target function calls inside list/set/dict
comprehensions and generator expressions
- Fixes NameError when AI-generated tests use comprehensions like
`[func(x) for x in items]`

---------

Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
2026-01-18 03:58:14 -05:00
Kevin Turcios
b64776e284
use AsyncAzureOpenAI (#2253)
# Pull Request Checklist

## Description
- [ ] **Description of PR**: Clear and concise description of what this
PR accomplishes
- [ ] **Breaking Changes**: Document any breaking changes (if
applicable)
- [ ] **Related Issues**: Link to any related issues or tickets

## Testing
- [ ] **Test cases Attached**: All relevant test cases have been
added/updated
- [ ] **Manual Testing**: Manual testing completed for the changes

## Monitoring & Debugging
- [ ] **Logging in place**: Appropriate logging has been added for
debugging user issues
- [ ] **Sentry will be able to catch errors**: Error handling ensures
Sentry can capture and report errors
- [ ] **Avoid Dev based/Prisma logging**: No development-only or
Prisma-specific logging in production code

## Configuration
- [ ] **Env variables newly added**: Any new environment variables are
documented in .env.example file or mentioned in description
---

## Additional Notes
<!-- Add any additional context, screenshots, or notes for reviewers
here -->
2026-01-17 22:44:12 -05:00
Aseem Saxena
14d64da083
Additional Instructions for writing tests compatible with JIT compiled code (#2204)
Plan is to have it behind a jit flag (False by default, passed from cli)
until we have tested it thoroughly.
2026-01-16 10:07:49 -08:00
misrasaurabh1
d8095c7424 Make it better and work again e2e 2026-01-16 10:03:44 -08:00
misrasaurabh1
4570e219a6 fixes and refactor 2026-01-16 00:44:11 -08:00
aseembits93
1a29879603 fix 2026-01-15 21:41:09 -08:00
misrasaurabh1
eaed8a5749 jest looper 2026-01-15 21:40:57 -08:00
misrasaurabh1
4f053eae49 First e2e working version js optimizer!! 2026-01-15 20:40:03 -08:00
misrasaurabh1
0ae49fdd87 checkpoint 2026-01-15 17:31:41 -08:00
misrasaurabh1
0f2dc957d8 checkpoint 2026-01-15 17:00:01 -08:00
misrasaurabh1
1c80984933 checkpoint 2026-01-15 15:57:46 -08:00
misrasaurabh1
6f6bceb233 checkpoint 2026-01-15 15:24:07 -08:00
aseembits93
51ef6aa86b newlines 2026-01-15 12:36:45 -08:00
Aseem Saxena
be392e7808
newline 2026-01-15 12:35:26 -08:00
aseembits93
bb7b898ee3 ready to review 2026-01-15 12:33:10 -08:00
misrasaurabh1
87e09c222a wip instrumentation and execution 2026-01-15 12:13:36 -08:00
Aseem Saxena
bb7b15b27c
Merge branch 'main' into testgen-jit 2026-01-15 11:24:27 -08:00
aseembits93
fa0689c1a6 cleaning up 2026-01-15 10:52:11 -08:00
misrasaurabh1
dc36bf5c57 wip 2026-01-14 23:58:35 -08:00
misrasaurabh1
1a63515f47 Initial js support in aiservice 2026-01-14 22:15:27 -08:00