← Search Off the Record

Analysing Robots.txt at scale with HTTP Archive and BigQuery

April 23, 2026

Episode notes

In this episode of Search Off the Record, Martin and Gary turn a simple robots.txt question into a data‑driven deep dive using HTTP Archive, WebPageTest, custom JavaScript metrics, and BigQuery. They explore how millions of real robots.txt files are actually written in 2025–2026, which directives an...