Add exponential backoff for HTTP 429 rate limiting in scrapers by priv-r8s · Pull Request #6731 · stashapp/stash

priv-r8s · 2026-03-21T22:24:13Z

Summary

Add retry logic with exponential backoff when scrapers receive HTTP 429 (Too Many Requests)
Backoff delay = Retry-After + exponential (2s, 4s, 8s, ...) so the server's requested wait is always respected
If Retry-After exceeds 60s max, give up immediately instead of waiting excessively
Parse Retry-After header (both seconds and HTTP-date formats) per RFC 9110
5-minute total timeout prevents rate-limit retries from running indefinitely
Comprehensive unit tests for all backoff paths

Test plan

Unit tests: exponential backoff without Retry-After header
Unit tests: Retry-After + exponential backoff (additive)
Unit tests: Retry-After via HTTP-date format
Unit tests: Retry-After exceeds max → immediate give-up
Unit tests: loadURL retry-and-succeed after 429s
Unit tests: loadURL retry exhaustion
Unit tests: context cancellation during retries
Live tested against r18.dev API — confirmed backoff works with real 429 responses (Retry-After: 10)

Replaces #6722 (closed due to fork rebuild).

- Backoff delay = Retry-After + exponential (2s, 4s, 8s, ...) - If Retry-After exceeds 60s max, give up immediately - Respects Retry-After header as floor, adds incremental backoff - Comprehensive unit tests for all backoff paths Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…tness - rateLimitBackoff returns (time.Duration, bool) instead of sentinel -1 - Use errors.As instead of direct type assertion for HTTPError - TestLoadURL_429ExhaustsRetries now actually tests retry exhaustion path (asserts *HTTPError with status 429 and correct attempt count) Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Gykes

So, after a brief look over the biggest thing that's nagging me is the hard coded values. Each site is different and has more or less strict rules based on scraping. Hard coding these could increase wait times dramatically depending on the situation. I see 2 possible options and could use some feedback on best solutions.

Change this up to allow individual scrapers to pass the values:

rate_limit_retries. Allow scrapers to pass this individually per scraper. It would require all existing scrapers to be updated and pass these new values but would allow user flexibility depending on site.

Set them as variables and add these settings to the scrapers settings in the app to allow users to modify them to what they see fit. We would need to find sane default for these but it could allow users to do their own testing with sites and find optimal settings.

Would be best to wait for some feedback from others before implementing any of these.

feederbox826 · 2026-03-23T02:44:48Z

Off the top of my head, not sure how helpful this would be. I haven't encountered one yml/json scraper that returns a 429 properly as a ratelimit backoff mechanism.

redgifs (needs anon auth token so need to be py)
nubiles (fake 429)
TPDB (real 429, handled through stash-box concurrency limit)

Other than that I'm not sure how this would apply for bulk scrapes, would we need a global cooldown that freezes all http requests or is this better suited just as #2914

priv-r8s and others added 2 commits March 21, 2026 15:23

Gykes reviewed Mar 22, 2026

View reviewed changes

Gykes added the deferred Good feature that can be looked at for a later release. label Mar 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add exponential backoff for HTTP 429 rate limiting in scrapers#6731

Add exponential backoff for HTTP 429 rate limiting in scrapers#6731
priv-r8s wants to merge 2 commits intostashapp:developfrom
priv-r8s:develop

priv-r8s commented Mar 21, 2026

Uh oh!

Gykes left a comment •

edited

Loading

Uh oh!

feederbox826 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

priv-r8s commented Mar 21, 2026

Summary

Test plan

Uh oh!

Gykes left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

feederbox826 commented Mar 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Gykes left a comment •

edited

Loading