Kevin Turcios
74c29b20b1
fix: update tests for multi-round benchmark plugin
...
The benchmark plugin now runs multiple rounds with calibrated
iterations. Tests need SELECT DISTINCT for row counts and must
extract median_ns from BenchmarkStats before validation.
2026-04-02 07:24:55 -05:00
misrasaurabh1
198487bf81
format and lint all
2026-01-29 01:39:48 -08:00
ali
6c8be653ce
Merge branch 'main' of github.com:codeflash-ai/codeflash into feat/feedback-loop-for-unmatched-test-results
2025-12-11 16:59:38 +02:00
Kevin Turcios
2e34d83c52
remove test_framework from pyproject.toml ( #955 )
...
* follow up
* remove requirement
* Delete uv.lock
* refresh uv-lock
* first pass
* cleanup test_framework here
* cleanup
* code_review
* cleanup tests
* fix for E2E
* fix tests dir missing
* one more cleanup
* cancel-in-progress
* Revert "cancel-in-progress"
This reverts commit f4bb9079cb .
* not needed here
* lower threshold and cleanup comments
* debug
* temp
* debug
Revert "debug"
This reverts commit fc3655149486c8b980e245e97b8304232086f08d.
fix(discover): Fix pytest discovery for futurehouse structure
Revert "fix(discover): Fix pytest discovery for futurehouse structure"
This reverts commit 40c48882b7413f5876af0e2e08d8f17a65bab091.
Reapply "debug"
This reverts commit c8297e57fbdca2462a8ca1199657748b8bc225e9.
Revert "not needed here"
This reverts commit dd2c5cdf76 .
Revert "lower threshold and cleanup comments"
This reverts commit 0e2f57e292 .
Reapply "lower threshold and cleanup comments"
This reverts commit e3b24f4a2967551eca8a19f96bf6647b23acdbbc.
Reapply "not needed here"
This reverts commit aec32103c931ff6d57dfa0d012113c2cec5d37a7.
Revert "Reapply "debug""
This reverts commit 77ab9f34f858a17fb29764c544769a0eb72ce7f0.
Reapply "fix(discover): Fix pytest discovery for futurehouse structure"
This reverts commit 506b94ab4fe17a7c8e0d458253812758cced3f22.
feat(futurehouse): Make futurehouse structure pytest compatible
* Revert "debug"
This reverts commit 271c5a37ec .
* Revert "temp"
This reverts commit b363acda1c .
* Revert "debug"
This reverts commit ac29b6beb3 .
* just for now
2025-12-09 02:53:08 -08:00
ali
a7f8816f5e
fix tests
2025-11-27 19:51:55 +02:00
Kevin Turcios
841f55ba62
normalize for trace and replay tests too
2025-09-28 08:14:36 +00:00
Kevin Turcios
16d21e7464
reinsert
2025-07-29 23:49:22 -07:00
Kevin Turcios
41d314ddbf
cleanup benchmark
2025-07-28 16:46:05 -07:00
Alvin Ryanputra
3158f9cc1c
end to end test that proves picklepatcher works. example shown is a socket (which is unpickleable) that's used or not used
2025-04-10 21:43:56 -04:00
Alvin Ryanputra
d653d0dc42
draft of end to end test
2025-04-08 13:32:21 -07:00
Alvin Ryanputra
fe6365262a
basic pickle patch version working
2025-04-08 11:32:44 -07:00