feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS #4322

mldangelo · 2025-06-04T23:31:35Z

Adds a new maximum evaluation time limit feature with PROMPTFOO_MAX_EVAL_TIME_MS environment variable and maxEvalTimeMs API option. Includes comprehensive documentation and test coverage. Useful for CI/CD time limits, cost control, and preventing runaway evaluations.

…MS env var and maxEvalTimeMs option

…ences

…d direct action-oriented guidance

gru-agent · 2025-06-04T23:31:55Z

TestGru Assignment

Summary

Link	CommitId	Status	Reason
Detail	`a168f20`	✅ Finished

Files

File	Pull Request
src/envars.ts	🛑 Cancelled (Canceled by Auto Rebase Detail)

Tip

You can @gru-agent and leave your feedback. TestGru will make adjustments based on your input

greptile-apps

_{6 file(s) reviewed, no comment(s)}
_{Edit PR Review Bot Settings | Greptile}

sourcery-ai

Hey @mldangelo - I've reviewed your changes and found some issues that need to be addressed.

Here's what I looked at during the review

🟡 General issues: 1 issue found
🟢 Security: all looks good
🟢 Testing: all looks good
🟢 Documentation: all looks good

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

site/docs/usage/command-line.md

src/evaluator.ts

test/evaluator.test.ts

gru-agent · 2025-06-04T23:34:04Z

TestGru Assignment

Summary

Link	CommitId	Status	Reason
Detail	`274165e`	✅ Finished

Files

File	Pull Request
src/envars.ts	🛑 Cancelled (Canceled by Auto Rebase Detail)

Tip

You can @gru-agent and leave your feedback. TestGru will make adjustments based on your input

src/evaluator.ts

gru-agent · 2025-06-04T23:38:31Z

TestGru Assignment

Summary

Link	CommitId	Status	Reason
Detail	`542c031`	✅ Finished

Files

File	Pull Request
src/envars.ts	🛑 Cancelled (Canceled by Auto Rebase Detail)

Tip

You can @gru-agent and leave your feedback. TestGru will make adjustments based on your input

… feedback

gru-agent · 2025-06-04T23:48:22Z

TestGru Assignment

Summary

Link	CommitId	Status	Reason
Detail	`a0cd4fb`	✅ Finished

Files

File	Pull Request
src/envars.ts	🟣 Merged #4323

Tip

You can @gru-agent and leave your feedback. TestGru will make adjustments based on your input

Copilot

Pull Request Overview

Adds a global time‐limit feature for evaluations so users can cap total run time via maxEvalTimeMs or PROMPTFOO_MAX_EVAL_TIME_MS.

Introduces a new maxEvalTimeMs option and corresponding environment variable for overall evaluation timeouts
Implements global abort logic in Evaluator.evaluate, tracks pending steps, and records timeout results
Updates types, schema, documentation, and tests to cover the new feature

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
test/evaluator.test.ts	Refactored mocks and added a test for aborting when `maxEvalTimeMs` is exceeded
src/types/index.ts	Added `maxEvalTimeMs` to `EvaluateOptionsSchema` and the exported type
src/evaluator.ts	Implemented global timeout/abort logic, tracking, and pending-result handling
src/envars.ts	Introduced `PROMPTFOO_MAX_EVAL_TIME_MS` env var and `getMaxEvalTimeMs` helper
site/static/config-schema.json	Extended JSON schema with `"maxEvalTimeMs"` property
site/docs/usage/troubleshooting.md	Updated troubleshooting guide with triage steps and examples
site/docs/usage/command-line.md	Documented the new `PROMPTFOO_MAX_EVAL_TIME_MS` and updated related tips

Copilot · 2025-06-04T23:59:56Z

src/envars.ts

+ * @returns The max duration in milliseconds, or the default value if not set.
+ */
+export function getMaxEvalTimeMs(defaultValue: number = 0): number {
+  return getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS', defaultValue);


The implementation calls getEnvInt with a defaultValue parameter, but getEnvInt only accepts one argument. This means the defaultValue is ignored and may cause a type error; change to const val = getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS'); return val != null ? val : defaultValue;.

Suggested change

return getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS', defaultValue);

const val = getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS');

return val != null ? val : defaultValue;

Copilot · 2025-06-04T23:59:56Z

test/evaluator.test.ts

    const originalReadFileSync = fs.readFileSync;
-    fs.readFileSync = jest.fn().mockImplementation((path) => {
-      if (path.includes('test_file.txt')) {
+    jest.spyOn(fs, 'readFileSync').mockImplementation((path) => {
+      if (typeof path === 'string' && path.includes('test_file.txt')) {
        return '<h1>Sample Report</h1><p>This is a test report with some data for the year 2023.</p>';
      }
      return originalReadFileSync(path);


[nitpick] After using jest.spyOn on fs.readFileSync, the spy is never restored and could leak into other tests. Consider adding afterEach(() => jest.restoreAllMocks()); or calling .mockRestore() on the spy.

Copilot · 2025-06-04T23:59:57Z

src/evaluator.ts

-          logger.info(
-            `[${numComplete}/${serialRunEvalOptions.length}] Running ${provider} with vars: ${vars}`,
-          );
+    try {


[nitpick] The large try/catch block covers both serial and concurrent evaluation logic, making the method hard to follow. Consider extracting serial and concurrent processing into separate helper functions for clarity.

Co-authored-by: gru-agent[bot] <185149714+gru-agent[bot]@users.noreply.github.com>

mldangelo added 4 commits June 4, 2025 16:11

feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_…

e348e59

…MS env var and maxEvalTimeMs option

docs: add comprehensive timeout troubleshooting guide and cross-refer…

cb23800

…ences

docs: simplify timeout troubleshooting with progressive disclosure an…

2c334ef

…d direct action-oriented guidance

update docs

a168f20

greptile-apps bot reviewed Jun 4, 2025

View reviewed changes

update assets

274165e

sourcery-ai bot reviewed Jun 4, 2025

View reviewed changes

site/docs/usage/command-line.md Outdated Show resolved Hide resolved

src/evaluator.ts Outdated Show resolved Hide resolved

test/evaluator.test.ts Show resolved Hide resolved

mldangelo requested a review from Copilot June 4, 2025 23:34

This comment was marked as outdated.

Sign in to view

github-advanced-security bot found potential problems Jun 4, 2025

View reviewed changes

src/evaluator.ts Dismissed Show dismissed Hide dismissed

fix: update broken anchor link in command-line docs after heading change

542c031

refactor: use object destructuring for metrics access per code review…

a0cd4fb

… feedback

mldangelo requested a review from Copilot June 4, 2025 23:56

gru-agent bot mentioned this pull request Jun 4, 2025

test: add unit test for src/envars.ts #4323

Merged

Copilot AI reviewed Jun 4, 2025

View reviewed changes

test: add unit test for src/envars.ts (#4323)

f4b9dc9

Co-authored-by: gru-agent[bot] <185149714+gru-agent[bot]@users.noreply.github.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS #4322

feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS #4322

Uh oh!

mldangelo commented Jun 4, 2025

Uh oh!

gru-agent bot commented Jun 4, 2025 •

edited

Loading

Uh oh!

greptile-apps bot left a comment

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gru-agent bot commented Jun 4, 2025 •

edited

Loading

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

gru-agent bot commented Jun 4, 2025 •

edited

Loading

Uh oh!

gru-agent bot commented Jun 4, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jun 4, 2025

Uh oh!

Copilot AI Jun 4, 2025

Uh oh!

Copilot AI Jun 4, 2025

Uh oh!

Uh oh!

	return getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS', defaultValue);
	const val = getEnvInt('PROMPTFOO_MAX_EVAL_TIME_MS');
	return val != null ? val : defaultValue;

Uh oh!

feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS #4322

Are you sure you want to change the base?

feat: add maximum evaluation time limit with PROMPTFOO_MAX_EVAL_TIME_MS #4322

Uh oh!

Conversation

mldangelo commented Jun 4, 2025

Uh oh!

gru-agent bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TestGru Assignment

Summary

Files

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gru-agent bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TestGru Assignment

Summary

Files

Uh oh!

This comment was marked as outdated.

Uh oh!

Uh oh!

gru-agent bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TestGru Assignment

Summary

Files

Uh oh!

gru-agent bot commented Jun 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TestGru Assignment

Summary

Files

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

gru-agent bot commented Jun 4, 2025 •

edited

Loading

gru-agent bot commented Jun 4, 2025 •

edited

Loading

gru-agent bot commented Jun 4, 2025 •

edited

Loading

gru-agent bot commented Jun 4, 2025 •

edited

Loading