Category: Tech

Hit by Weight Loss Spambot, Heavy Day for Content Scrapers

Hit I was, by a terribly time wasteful spambot pushing weight loss ads. Yes, my Recaptcha did send them to my spam folder for analysis, but it was still a lot. I just wished they would simply stop. All the comment spammers were pushing weight loss. I’m sure they are telling me something about my slightly widening girth, but I am already making amends. There is no need for added pressure, nor waste of bandwidth and technology.

454a986e.cst.lightpath.net: Research, Ban

454a986e.cst.lightpath.net is a content scraper bot that has been visiting my site, so I would like to remove the welcome mat.

lightpath.net seems to change their front extent many times, as a search on Google did not yield an exact match, but many variants.

Pattern:
Take the numbers before “.cst.lightpath.net” and convert them from hex to decimal, giving you 4 octets.

lightpath.net resolves to 216.2.192.141, Optimum Online or Cablevision Systems, XO Communications (ISP), but they have no website. cablevisionlightpath.org also resolves to the same ip address.

454a986e.cst.lightpath.net Their hex converts to 69.74.152.110, Cablevision Systems.

Black & Decker F1000 Type 1 Iron: Disassembly Tips

Black and Decker Steam Advantage Iron, F1000 type 1, has a terrible reputation for usability and reliability. Diana's iron does not work as the safety features prematurely turn the iron off.

Black and Decker Steam Advantage Iron, F1000 type 1, has a terrible reputation for usability and reliability. Diana’s iron does not work as the safety features prematurely turn the iron off.

She was miffed, friend Diana, in Toronto, Canada, that her newly purchased Black and Decker Steam Advantage Iron, F1000 type 1, was acting up. It was prematurely shutting down, a supposed safety feature that belied the task of actual ironing. This seems similar to PC anti-virus software that so overtaxes the PC such that it cripples even simple and small mouse movements. She asked me to look into it.

Toronto Star Treating All Browsers like a Smartphone = Crap UI

Headscratching, it is, when I browse on the internet and the site treats me like a smartphone. I’m not on a smartphone, have lots of screen space and do not like the experience. Different pieces of hardware should be treated differently. We are all not smartphones.

The Toronto Star recently changed their web site UI so that all browsers are treated like smartphones. While this is great if you use a smartphone, it breaks all the rules if you are using a regular PC, or even a tablet.

IPVNow.com Will Fool Anti-Bot Software

Fool, it would, an automated anti-bot system, because humans are more intelligent than bots. They are innovative, in their evil genius way. Computer security is all about the arms race. The better the methods, the better the counter measures, and then it repeats. No security measure is foolproof for very long.

IPVNow.com has a slew of host names that when you look them up, resolve successfully and all point to the same IP address, 103.224.182.241. This misdirection is what would fool the anti-bot software, because this IP is real and it points to a valid company, Trellian, which owns IPVNow.com. But banning this single IP does not stop the content scraping. Each host name has its own IP address that uses ISPs Ubiquity and Nobis. These are the IPs you need to ban.

customer.worldstream.nl: Banning Content Scraper

This host name is constantly scraping my site, but when I look it up it does not resolve. Searches on Google reveal that they seem to change their IP address very often. Many other sites are getting spammed and content scraped by this host. I have no alternative than to ban the whole IP range of customer.worldstream.nl.

I read my raw access log and the first column provides me with an IP address or host name. This first column is usually enough to target the specific IP that is errant, and I ban the last IP octet of 256 addresses.

Strange Host Names that I Cracked

These host names try hard to evade detection of their IP addresses, in order to scrape content and sometimes break into from web sites. They have specifically scraped mine and so I hunted them down and banished them. Often times the unix host command returns nothing, so research is required. This usually works.

Unexpected Site Visitors: Welcome

Welcome to my web site. If I can break the boredom of your day or put a smile on your face I am happy. These are some visitors that are unexpected but welcome. Most of these are government agencies, because I think it is odd that any government would be interested in my writing. There are lots of educational institutions that visit, but they are not as interesting to me. Note that just because they visit my site does not mean that they are hunting for bad stuff. It usually means that workers are human, looking for general life information and find it on my site for specific topics.

Host Names I have Researched, Flummoxed

intra.cea.fr content scraped me, so I researched them.

is005045.intra.cea.fr 10.0.5.45
archie6420.intra.cea.fr 32.166.1.28

napsaci011.intra.cea.fr 132.166.177.50
napsaci012.intra.cea.fr 132.166.177.51
is151991.intra.cea.fr 132.166.118.1

kalahari.intra.cea.fr 132.167.4.137
aster.intra.cea.fr 132.167.197.147

gre018941.intra.cea.fr 132.168.11.11
gre019465.intra.cea.fr 132.168.11.112
gre045998.intra.cea.fr 132.168.11.183
grecfnimon01.intra.cea.fr 132.168.16.105
gre058496-24.intra.cea.fr 132.168.24.180
gre047417.intra.cea.fr 132.168.28.194
gre033069.intra.cea.fr 132.168.30.141
moises.intra.cea.fr 132.168.37.241
gre022491.intra.cea.fr 132.168.65.0
gre035045-160.intra.cea.fr 132.168.160.31

altairnew.intra.cea.fr 132.169.8.1
717rccair5235b.intra.cea.fr 132.169.13.1
aurel.intra.cea.fr 132.169.33.1
celaeno.intra.cea.fr 132.169.11.129

0x667.crypt.gy came back with a host lookup of 94.23.147.30, OVH. I cannot verify this IP address. Research is inconclusive. This guy uses a Microsoft server error code “1639 (0x667). Invalid command line argument” in his hostname.
server.crypt.gy 188.165.211.48

dailyfeed.co.uk are thieves, stealing my bandwidth

dailyfeed.co.uk is using an image straight from my site and therefore stealing my bandwidth. I have tried to contact them but cannot find an email address.

One of their articles: http://www.dailyfeed.co.uk/2016/03/13-things-found-every-00s-kids-bedroom/6/ takes an image from my site. Why they are so cheap and not store the image on their site I do not know. I have banned them as a referrer, so hope this will stop.

Theft, and no less. Assholes. Alex Taylor, owner of dailyfeed.co.uk, you are an asshole.

I have contacted their ISP, cloudflare.com: