My site has been getting content and image scraped by bb-81-107.018.net.il and bb-153-46.018.net.il, but these two host names do not resolve. Furthermore there is very little on the internet on them. My next step is to ban their complete IP range.
CUST-22.214.171.124.018.net.il 126.96.36.199 2017-jan-23
BB-151-179.018.net.il 188.8.131.52 2017-jan-18
BB-154-107.018.net.il 184.108.40.206 2016-oct-08
If there are 4 octets in the host name, then reverse the octets. If there are only 2 octets then these are the last 2 of the IP. You will need to use the host command and try the first 2 octets of their common ranges.
Fool, it would, an automated anti-bot system, because humans are more intelligent than bots. They are innovative, in their evil genius way. Computer security is all about the arms race. The better the methods, the better the counter measures, and then it repeats. No security measure is foolproof for very long.
IPVNow.com has a slew of host names that when you look them up, resolve successfully and all point to the same IP address, 220.127.116.11. This misdirection is what would fool the anti-bot software, because this IP is real and it points to a valid company, Trellian, which owns IPVNow.com. But banning this single IP does not stop the content scraping. Each host name has its own IP address that uses ISPs Ubiquity and Nobis. These are the IPs you need to ban.
This host name is constantly scraping my site, but when I look it up it does not resolve. Searches on Google reveal that they seem to change their IP address very often. Many other sites are getting spammed and content scraped by this host. I have no alternative than to ban the whole IP range of customer.worldstream.nl.
I read my raw access log and the first column provides me with an IP address or host name. This first column is usually enough to target the specific IP that is errant, and I ban the last IP octet of 256 addresses.
intra.cea.fr content scraped me, so I researched them.
0x667.crypt.gy came back with a host lookup of 18.104.22.168, OVH. I cannot verify this IP address. Research is inconclusive. This guy uses a Microsoft server error code “1639 (0x667). Invalid command line argument” in his hostname.
After an epic fail in selling food, will Canadian Tire now sell lingerie? Say it isn't so...
As a long time Canadian I consider it a tradition to shop at Canadian Tire. Unlike Walfart, Crappy Tire is, after all, Canadian. When I need a tool or something for the house I think of Canadian Tire first. This year, I am finding that the products I need are no longer advertised in the CT weekly flyer, and all the stuff I don’t need has taken its place. This has the result of me not visiting the local Crappy Tire as much as I used to, and therefore I am not spending the family’s cash in their stores. Going to the epitome of insanity, CT’s weekly food specials now gets attached to the weekly flyer. As my daughter often says, this is an EPIC FAIL.
Make no mistake that riding a motorcycle here in Toronto is dangerous. Intentionally or not cages (cars) do dangerous things, endangering the life of the motorcyclist. Talking on the phone is especially hazardous to riders. When anyone makes a mistake and there is contact, the motorcyclist will take a trip to the hospital. An interesting motorcycle strategy that is new to me is called aggressive but cautious. Central to this strategy is to ride 20-30kph over the speed limit.
Drupal 6's Feed Aggregator works well
I really love Drupal 6’s feed aggregator. It has the ability to aggregate numerous RSS (Really Simple Sybndication) news or blog feeds, categorize them, and keep them current, all in the effort to save you the trouble of going to each news source and hunting for relevant news. All you have to do is chose a category, read the summary and click the link for the article you what you want.
This is a preview of
Increasing Efficiency of Drupal 6’s Feed Aggregator
. Read the full post (650 words, 1 image, estimated 2:36 mins reading time)