Commit graph

11 commits

Author SHA1 Message Date
Kevin Turcios
74c29b20b1 fix: update tests for multi-round benchmark plugin
The benchmark plugin now runs multiple rounds with calibrated
iterations. Tests need SELECT DISTINCT for row counts and must
extract median_ns from BenchmarkStats before validation.
2026-04-02 07:24:55 -05:00
misrasaurabh1
198487bf81 format and lint all 2026-01-29 01:39:48 -08:00
ali
6c8be653ce
Merge branch 'main' of github.com:codeflash-ai/codeflash into feat/feedback-loop-for-unmatched-test-results 2025-12-11 16:59:38 +02:00
Kevin Turcios
2e34d83c52
remove test_framework from pyproject.toml (#955)
* follow up

* remove requirement

* Delete uv.lock

* refresh uv-lock

* first pass

* cleanup test_framework here

* cleanup

* code_review

* cleanup tests

* fix for E2E

* fix tests dir missing

* one more cleanup

* cancel-in-progress

* Revert "cancel-in-progress"

This reverts commit f4bb9079cb.

* not needed here

* lower threshold and cleanup comments

* debug

* temp

* debug

Revert "debug"

This reverts commit fc3655149486c8b980e245e97b8304232086f08d.

fix(discover): Fix pytest discovery for futurehouse structure

Revert "fix(discover): Fix pytest discovery for futurehouse structure"

This reverts commit 40c48882b7413f5876af0e2e08d8f17a65bab091.

Reapply "debug"

This reverts commit c8297e57fbdca2462a8ca1199657748b8bc225e9.

Revert "not needed here"

This reverts commit dd2c5cdf76.

Revert "lower threshold and cleanup comments"

This reverts commit 0e2f57e292.

Reapply "lower threshold and cleanup comments"

This reverts commit e3b24f4a2967551eca8a19f96bf6647b23acdbbc.

Reapply "not needed here"

This reverts commit aec32103c931ff6d57dfa0d012113c2cec5d37a7.

Revert "Reapply "debug""

This reverts commit 77ab9f34f858a17fb29764c544769a0eb72ce7f0.

Reapply "fix(discover): Fix pytest discovery for futurehouse structure"

This reverts commit 506b94ab4fe17a7c8e0d458253812758cced3f22.

feat(futurehouse): Make futurehouse structure pytest compatible

* Revert "debug"

This reverts commit 271c5a37ec.

* Revert "temp"

This reverts commit b363acda1c.

* Revert "debug"

This reverts commit ac29b6beb3.

* just for now
2025-12-09 02:53:08 -08:00
ali
a7f8816f5e
fix tests 2025-11-27 19:51:55 +02:00
Kevin Turcios
841f55ba62 normalize for trace and replay tests too 2025-09-28 08:14:36 +00:00
Kevin Turcios
16d21e7464 reinsert 2025-07-29 23:49:22 -07:00
Kevin Turcios
41d314ddbf cleanup benchmark 2025-07-28 16:46:05 -07:00
Alvin Ryanputra
3158f9cc1c end to end test that proves picklepatcher works. example shown is a socket (which is unpickleable) that's used or not used 2025-04-10 21:43:56 -04:00
Alvin Ryanputra
d653d0dc42 draft of end to end test 2025-04-08 13:32:21 -07:00
Alvin Ryanputra
fe6365262a basic pickle patch version working 2025-04-08 11:32:44 -07:00