article thumbnail

How to Block Unwanted Bots from Your Website with.htaccess

SoCal Delicious

How to Identify the Bot You Want to Block Before you can block a bot, you will need to know at least one of two things: the IP address where the bot is coming from or the"User Agent string" that the bot is using. Once youve located the entries that belong to the bot, look for the IP address and the user agent string. Be careful though.

IP 38
article thumbnail

How brain-amputated developers created the social media plague

SoCal Delicious

ThingFetcher sometimes requests a (shortened) URI 30 times per second, from different IPs. A method to verify the bot All IP addresses used by a bot should resolve to server names having a unique pattern. For example Googlebot comes only from servers named "crawl" + "-" + replace($IP, ".", "-") + ".googlebot.com"

article thumbnail

How to cleverly integrate your own URI shortener

SoCal Delicious

While handling bots, you should detect search engine crawlers, too: /* lookup your crawler IP database to populate $isCrawler; then, if the IP wasnt identified as search engine crawler: */ if ($isCrawler !== from the IP address range 65.52.0.0 Other rogue bots identify themselves by IP addy, user agent, and/or referrer.

SEO 37