The Daily Static
  The Daily Static
UF Archives
Register
UF Membership
Ad Free Site
Postcards
Community

Geekfinder
UFie Gear
Advertise on UF

Forum Rules
& FAQ


Username

Password


Create a New Account

 
 

Back to UserFriendly Strip Comments Index

<Deleted><Deleted>2010-04-25 08:41:21
  Actually, you bring up a good point. by kelli2172010-04-25 08:23:03
    The way it's been explained before... by Sharku 2010-04-25 09:08:21
...is that spiders aren't really that smart, they'll trigger on anything that matches [:alpha:]+(\.[:alpha:]+)* at-sign [:alpha:]+(\.[:alpha:]+)*

I think that's a comprehensive regex for it, not sure though and too lazy to actually check. Anyway, long story short, no they don't discriminate against valid TLDs etc. Mind, I'm not claiming to be authoritative on this, it's just what I remember as being the lore around here wrt email spiders, gleaned from both other UFies and TPTB (Illiad, Myke...).

It makes sense from a spammer's point of view to keep the matching rules as simple as possible: the goal is mostly high throughput, not so much data validation. Since both email harvesting and spamming are mostly done by botnets anyway, you could argue that they have ample CPU and bandwidth available; but then you have to take into account that while it's fairly easy for a human to recognize or determine from the context whether something is ROT13 (ROT1-25 or any other obfuscation method) there's probably no easy way for a computer to do so other than to bruteforce it and check it against a list of know TLDs. A quick check on wikipedia tells me there's at least two symmetrical pairs of TLDs: .am, Armenia ROT13s into .nz, New Zealand; as do Mozambique and Zambia.

Finally, checking against a list of TLDs adds a layer of responsibility for the spammer: they have to maintain that list for when new TLDs get added or old ones are deprecated.

Again, not saying I'm authoritative on any of this, just giving a coder's view on it. "How I'd do it if I were a spammer" if you will.
[ Reply ]

 

[Todays Cartoon Discussion] [News Index]

Come get yer ARS (Account Registration System) Source Code here!
All images, characters, content and text are copyrighted and trademarks of J.D. Frazer except where other ownership applies. Don't do bad things, we have lawyers.
UserFriendly.Org and its operators are not liable for comments or content posted by its visitors, and will cheerfully assist the lawful authorities in hunting down script-kiddies, spammers and other net scum. And if you're really bad, we'll call your mom. (We're not kidding, we've done it before.)