Commit graph

23 commits

Author SHA1 Message Date
Sarthak Agarwal
2f6a91f95d fix: lint issues in experiments folder and format fixes 2026-01-29 19:26:50 +05:30
Kevin Turcios
c3b3f2db9c check 2026-01-29 06:08:46 -05:00
Kevin Turcios
0170ccb361 update assertions 2026-01-29 05:33:16 -05:00
misrasaurabh1
df529b5977 line profiler experiments 2026-01-16 13:09:17 -08:00
misrasaurabh1
6187eb1131 before implementation 2026-01-14 18:29:20 -08:00
Saurabh Misra
7b11f9e5dc move data from cli/experiments and cli/pie_test_set to top-level experiments directory 2025-02-12 22:32:45 -05:00
RD
af714cf0af Merge branch 'main' of github.com:codeflash-ai/codeflash into bootstrapped-benchmarking 2025-01-28 16:39:35 -08:00
Sarthak Agarwal
f7dc4a6498 Update analysis_experiments.ipynb 2025-01-28 00:01:12 +05:30
RD
40c72da59d Byesian analysis implementation 2025-01-17 17:44:24 -08:00
RD
afe094feaa Busy work moving a file. 2024-11-12 11:00:10 -08:00
Saurabh Misra
d455cdee1a Ruff reformat and fix all the python files
Set minimum libcst version to be 1.0.1
move the stub files to dev dependencies
2024-10-25 15:45:44 -07:00
RD
d36e046c3b Changing benchmarking budget from 5 to 10 seconds. 2024-10-14 11:30:56 -07:00
RD
77cccc2b2a Benchmarking analytics notebook 2024-10-13 21:15:57 -07:00
Saurabh Misra
c73f5b40c9 create instrumented tests locally 2024-07-23 15:50:47 -07:00
ihitamandal
2955c33ff0 Add speedup as a comment to each candidate file 2024-06-26 16:35:43 -07:00
ihitamandal
242f0c1e7f Add explanations in the candidate files, and documentation on how to use the script 2024-06-21 14:25:54 -07:00
afik.cohen
b6c397b799 Rm unused wandb stuff 2024-05-07 17:01:44 -07:00
afik.cohen
22c7466e53 Use experiment metadata to see experiment id in metrics_analysis.py 2024-05-07 16:59:34 -07:00
afik.cohen
ec020bc503 Use harmonic mean instead of geometric 2024-05-06 15:01:41 -07:00
afik.cohen
682f5d9fd1 Take an experiment_id as parameter to load_data 2024-05-03 18:22:10 -07:00
afik.cohen
9a2b9ca2a5 Add example wandb notebook 2024-05-02 17:51:43 -07:00
afik.cohen
e6a137536c Add wip performance metrics tests 2024-05-02 17:51:33 -07:00
afik.cohen
01f48b4f11 Add experiment id column, add tests for metrics analysis - validity 2024-05-02 17:30:08 -07:00