KBD

Keith Devens .com

Wednesday, March 17, 2010 Flag waving
I would rather be an optimist and be wrong than a pessimist who proves to be right. The former sometimes... – "Hoots"
← Bookmark ExplorerI'm running PHP as a CGI? →

Daily link icon Thursday, November 20, 2003

Why I think weblog spam isn't a long term problem

Like I wrote a few days ago, I don't think weblog spam will be as much of a long-term problem as Mark Pilgrim does.

Whereas e-mail has essentially zero cost to send to millions of people, the cost structure for weblog spam is completely different, and the benefits are much smaller.

If someone leaves a comment on my web site, I know they were here personally and actually had to go through the effort to cut and paste stuff into my comment form. Only the few people who visit my site between the time it's left and the time I delete it ever see the spam, and it only takes me seconds to delete it (no matter how many spam comments were left), after which there's no trace. Even if they're doing it to get Google juice, and Google happens to visit my site during those few hours or whatever, they'd only get a temporary benefit, and a small one at that, until Google scans my site again.

Now, my site is a little different from others because I don't use a standard weblogging tool like Movable Type. For common tools, spammers can automate the process somewhat, but the math still doesn't add up. Spammers are up against an army of webloggers deleting the comments they leave. Unlike e-mail, where a spammer can get a list of e-mail addresses, type up a message, and hit "send", spammers have to do more work to hit weblogs, for far fewer people to see their ad. While an e-mail can go out to millions in one shot, most weblogs don't get more than a few thousand unique hits every day, and most of those hits are probably for RSS files that won't even contain the ads.

Also, while my site lists the most recent comment per post over on the right side of my layout, most times someone will only see a spam comment if they happen to visit a particular post.

Anyway, to me, the math just doesn't seem to add up. Sure, spam comments on weblogs are a small nuisance, and they may continue for a while, but I don't see it becoming the epidemic that e-mail spam is. The one difficulty may be if you use a common weblogging tool that doesn't allow you to mass-delete comments. For me, I can do it in one SQL statement.

← Bookmark ExplorerI'm running PHP as a CGI? →

Comments XML gif

Isofarro (http://www.isolani.co.uk/) wrote:

Keith, I wish you are right about this.

Just a couple of points. You say that you check your comments often, so comment spam doesn't last. That really means you've given up the "right" not to have to monitor your comments. Do you really want to be maintaining your blog when you are off on holiday?

Your solution is analogous to a "Just Hit Delete" (JHD) from the email world. In the email world JHD's weakness is that the damage is already done by the time email is delivered. I accept in comment spam terms the damage may have been done, or will happen in future. (The damage being Google visiting and using your PageRank to bolster a comment spammers website "reputation").

I agree with your point that a temporary Google boost from a spammed link on your website is just that - temporary. But, look at the bigger picture. If one person can get temporary links on 1000 blogs - it evens out to maybe something like a permanent link on 50 high profile blogs. (The trough of you removing the link is counteracted by the link still existing on 999 other websites). Then its just a case of Lather. Rinse. Repeat.

Also when comparing returns of email to returns of comment spam, remember email needs only 0.05% to turn a profit. Google requires far less links for a healthy increase in PageRank. If that results in a higher ranking for high-profile keywords the investment in automation is well worth it - and also a resellable spam tool.

Remember, all a comment form is is an HTML form. All a browser does with an HTML form is to url-encode it and send it to a particular URL. Anyone with a marginal knowledge of HTTP and forms can connect directly to the form action URL and bypass your form entirely. (Unless you build in a server-created hidden field type double-check that can't be easily duplicated).

I do hope you are right, otherwise comment spam is going to get very ugly.

∴ Isofarro | 21-Nov-2003 4:12am est | http://www.isolani.co.uk/ | #3322

Mean Dean (http://www.healyourchurchwebsite.com) wrote:

First, as much as I like Mark's technical writings, I'm not so keen on his oft' pessimistic posts on spam and combatting spam. History is replete with examples of how the weak have organized to hammer the strong. What was more unsettling about MP's comments was the fear of reprisal. Like appeasing spammers is going to get them to stop.

I agree. I've made some changes to my own MT system and have radically reduced the spam. Those few that get through find that I make it VERY expensive for them, reporting them to their upstream providers, their affiliation sponsers, the works. I've got a few other things to do just to make sure they get the message.

In other words, I make it expensive. And here's where MP is really wrong. If each of us took action in our own variety of ways, we'd create so many moving and disimilar targets that spammers would go on to find weaker, easer marks.

∴ Mean Dean | 21-Nov-2003 2:16pm est | http://www.healyourchurchwebsite.com | #3324

Feel free to post a comment below. Please see my comment policy.

Formatting Rules (No HTML):

  • **bold**, *italic*, _underlined_, --strikeout--
  • "text"="url" creates a link, and URLs are auto-highlighted
  • Blockquote: Like e-mail, begin paragraph with > (greater-than sign)
  • Lists: begin paragraph with *,-, or + (unordered), or # (ordered)
  • Code block: ?!code:language=perl|php|sql|javascript|etc.{\n}...{\n}?!/code

:
(will be your IP address if blank)
: (optional)
(Will not be shown on site)

: (optional)
:

March 2010
SunMonTueWedThuFriSat
 123456
78910111213
14151617181920
21222324252627
28293031 



RSS feed RSS feed for Keith's Weblog
Atom feed Atom feed for Keith's Weblog
Weblog archive
Recent comments
  on 2 posts

Recent comments XML

new⇒I hate ASP.NET

I hate ASP... I was doing wonders​with PHP, then suddenly one of my​clients...

Johnies: Mar 17, 6:14am

Quantum physics and free will

I knew you were going to say that....

Tom Massey: Mar 15, 9:26pm

Generated in about 0.11s.

(Used 8 db queries)