fix: increase API timeout default from 900s to 1800s for slow-thinking models by teknium1 · Pull Request #3431 · NousResearch/hermes-agent

teknium1 · 2026-03-27T18:13:51Z

Problem

Models like GLM-5/5.1 can think for 15+ minutes. The previous HERMES_API_TIMEOUT default of 900s (15 min) killed legitimate requests mid-thinking.

Fix

Raise HERMES_API_TIMEOUT default from 900s to 1800s (30 min) in both places that read the env var:

_build_api_kwargs() — non-streaming total timeout
_call_chat_completions() — streaming connection write timeout

Still configurable via HERMES_API_TIMEOUT env var.

Unchanged

Stream per-chunk read timeout (60s) — appropriate for inter-chunk timing
Stale stream detector (180-300s) — already scales for large contexts

Test results

200 passed, 0 failures.

…g models Models like GLM-5/5.1 can think for 15+ minutes. The previous 900s (15 min) default for HERMES_API_TIMEOUT killed legitimate requests. Raised to 1800s (30 min) in both places that read the env var: - _build_api_kwargs() timeout (non-streaming total timeout) - _call_chat_completions() write timeout (streaming connection) The streaming per-chunk read timeout (60s) and stale stream detector (180-300s) are unchanged — those are appropriate for inter-chunk timing.

github-actions · 2026-03-27T19:58:51Z

⚠️ Supply Chain Risk Detected

This PR contains patterns commonly associated with supply chain attacks. This does not mean the PR is malicious — but these patterns require careful human review before merging.

⚠️ WARNING: Install hook files modified

These files can execute code during package installation or interpreter startup.

Files:

hermes_cli/setup.py

Automated scan triggered by supply-chain-audit. If this is a false positive, a maintainer can approve after manual review.

teknium1 force-pushed the hermes/hermes-86f614ec branch from b4391bc to 9ca086c Compare March 27, 2026 18:31

teknium1 force-pushed the hermes/hermes-86f614ec branch from 9ca086c to 20c2aeb Compare March 27, 2026 19:58

teknium1 changed the title ~~fix(streaming): increase read timeout and skip retries for thinking models~~ fix: increase API timeout default from 900s to 1800s for slow-thinking models Mar 27, 2026

teknium1 merged commit fb46a90 into main Mar 27, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: increase API timeout default from 900s to 1800s for slow-thinking models#3431

fix: increase API timeout default from 900s to 1800s for slow-thinking models#3431
teknium1 merged 1 commit intomainfrom
hermes/hermes-86f614ec

teknium1 commented Mar 27, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

teknium1 commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Fix

Unchanged

Test results

Uh oh!

github-actions bot commented Mar 27, 2026

⚠️ Supply Chain Risk Detected

⚠️ WARNING: Install hook files modified

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

teknium1 commented Mar 27, 2026 •

edited

Loading