article thumbnail

How to Block Unwanted Bots from Your Website with.htaccess

SoCal Delicious

This article is a practical guide designed to help webmasters who already know what they want to block. How to Identify the Bot You Want to Block Before you can block a bot, you will need to know at least one of two things: the IP address where the bot is coming from or the"User Agent string" that the bot is using. Be careful though.

IP 38
article thumbnail

How to cleverly integrate your own URI shortener

SoCal Delicious

Please note that my design suggestions ain’t black nor white. While handling bots, you should detect search engine crawlers, too: /* lookup your crawler IP database to populate $isCrawler; then, if the IP wasnt identified as search engine crawler: */ if ($isCrawler !== from the IP address range 65.52.0.0

SEO 37
article thumbnail

How brain-amputated developers created the social media plague

SoCal Delicious

ThingFetcher sometimes requests a (shortened) URI 30 times per second, from different IPs. A method to verify the bot All IP addresses used by a bot should resolve to server names having a unique pattern. For example Googlebot comes only from servers named "crawl" + "-" + replace($IP, ".", "-") + ".googlebot.com" Sad but true.