Where content lives
For most sites, the homepage links to category pages (depth 1), category pages link to listings (depth 2), and listings link to detail pages (depth 3). Going deeper rarely reveals new content — you hit pagination, sort variants, and tag clouds. Sizing depth at 3-4 captures the bulk of meaningful URLs without blowing budget on combinatorial junk.
Depth vs budget interaction
Depth and budget interact: a high depth limit on a small budget cuts off mid-traversal; a low depth limit on a large budget wastes capacity. The rule of thumb: set depth to the natural shape of the site (3-4 hops for most), then size budget to "depth × average fanout" with safety margin. A site with 20 categories and 200 items each fits in ~5,000 pages at depth 3.
Per-pattern depth
Advanced crawlers vary depth by URL pattern. Detail pages get depth 0 (do not follow links). Category pages get high depth (you want to discover the items). Pagination links get capped at 50-100 to avoid infinite-calendar traps. This is more work than a single global depth limit but dramatically improves budget efficiency on large sites.
