homeblog
tags
  • Published on
    November 24, 2025 (Today)

    After 2025, Can Your Google SERP Crawler Still Survive?

    Web-ScrapingGoogle-SearchSEOAnti-botAI
    An in-depth look at how AI-driven anti-bot defenses, SERP reshuffling, and legal/ethical constraints are reshaping Google search crawling af...
  • Published on
    November 4, 2024 (1y ago)

    The Infra to handle 10M Requests in 10 Minutes for $0.0116

    InfrastructureKubernetesTerraformCloud-ComputingRedisDistributed-Systems
    A comprehensive guide to setting up a highly efficient, scalable infrastructure to process 10 million requests in 10 minutes at a minimal co...
  • Published on
    October 30, 2024 (1y ago)

    27.6% of the Top 10 Million Sites are Dead

    internet-decaydomain-analysisinactive-websitestop-domainsweb-crawlerkubernetes
    An analysis of the top 10 million websites reveals that over a quarter are inactive, highlighting the web's shifting landscape. Using a high...
  • Published on
    October 14, 2023 (2y ago)

    Web Crawling at Scale: Navigating Billions of URLs with Efficiency

    KubernetesWeb-CrawlerGolangNodejsDistributed-System
    Dive into the world of distributed web crawling with Golang, Docker, and Redis. Learn the logic behind efficient code, use Bloom filters for...
  • Published on
    October 13, 2023 (2y ago)

    The Architecture of a Web Crawler: Building a Google-Inspired Distributed Web Crawler. Part 1

    KubernetesWeb-CrawlerGolangNodejsDistributed-System
    Unlock the potential of the web with a Google-inspired distributed web crawler. Explore scalable solutions using Kubernetes, Golang, Python,...
1 of 2Next
© 2025 Built with 💖 by Tony Wang • With TypeScript, Next.js, Tailwind • Inspired by Leerob
Support me • Contact me