<b>Playbook: audit your status-code distribution</b>
Healthy median: 200s = 91.3%, 3xx = 5.1%, 4xx = 2.8%, 5xx = 0.2% (across 240 crawls). Deviation flags specific failures.
Checklist:
— Step 1: build the full distribution, compare each band to the median above
— Step 2: 3xx above 12% ▓▓▓▓▓▓▓▓░░ → audit redirect chains; median chain length should be 1.0, p90 ≤ 2
— Step 3: 4xx above 6% → pull top broken-link sources; usually 4-5 templates emit 80% of them
— Step 4: any 5xx ↑ vs. baseline → check during your crawler's peak concurrency (often timeout, not real outage)
— Step 5: log the distribution monthly; track 5xx as a leading reliability metric
So what: 5xx rate is the one band where 0.2%→1.0% predicts ranking loss before traffic drops. Watch the delta, not the absolute.
—
В @CrawlBudgetMyths такого crawl budget myths ещё много
Crawl & Render
@CrawlAndRender
<b>Playbook: audit your status-code distribution</b>
Этот пост опубликован в Telegram-канале Crawl & Render. Подписаться можно по ссылке: @CrawlAndRender.