Why it matters for scraping and privacy
Favicon fingerprinting is a reminder that stateful tracking — recognising you by data left behind on your machine — is not limited to cookies and localStorage (a browser key/value store websites can write to). A scraper that rotates cookies but reuses the same browser profile can still be linked across sessions through the favicon cache and similar leftovers (HSTS pins, HTTP/2 connection coalescing, the disk cache itself — see cache-timing fingerprinting). The practical defence for automation is a genuinely fresh, isolated profile per identity — not just cleared cookies. Major browsers added partitioning and mitigations after the 2021 disclosure, but the broader class of attack (persistent side caches used as supercookies) remains relevant.
Defending against it when scraping
Because the favicon supercookie lives in a cache that survives normal cookie clearing, the defence is isolation: give every scraping session its own browser profile or container, so the favicon cache (like cookies, localStorage and the HTTP cache) starts empty and is thrown away afterward. Reusing one long-lived profile across thousands of requests lets the cache fill up into a stable identifier that ties all your sessions together.
Fresh, disposable profiles also matter for a subtler reason: an empty favicon cache on every single visit is itself slightly odd for a "returning" user — so for crawls that need to look like a real return visit, you want the cache to persist within one session but reset between identities. Managed browser pools and a web scraping API handle this rotation for you, pairing each fresh profile with a matching proxy so the favicon cache, cookies and IP all turn over together.
