ACM Queue - A Conversation with Phil Smoot - An engineer at Hotmail discusses the challenges of keeping one of the Web’s largest and oldest Internet services running 24/7 (via Matt). Informative, and has some good quotes. On simplicity:
Most of this experience comes with time. We try to build our bench with folks who understand what mistakes not to make. New hires tend to want to do complex things, but we know complex things break in complex ways. The veterans want simple designs, with simple interfaces and simple constructs that are easy to understand and debug and easy to put back together after they break.
The best advice is just basically to keep everything as simple as possible—simple processes, simple SKUs, simple engineering. These systems get to be very big very fast. I don’t think there’s really any one particularly hard, gnarly problem, but when you add them all up, there are lots and lots of little problems. As long as you can keep each of those pieces simple, that seems to be the key. It’s more of a philosophy, I think, than anything else.
On manageability:
BF Are there scaling reasons to think about the benefits of a command line for managing over a GUI, or are there other things to think about?
PS Our operations group never wants to rely on any sort of user interface. Everything has to be scriptable and run from some sort of command line. That’s the only way you’re going to be able to execute scripts and gather the results over thousands of machines.
On scalability:
BF Is storage going to change enough so that maybe just the next round of disks will be fast enough that you don’t need to worry about that?
PS If you rely on scale up, you’ll probably get killed. You should always be relying on scale out.
In other words, every scaling problem is ultimately a distributed computing problem.
I'm getting dictionary spams to my catch-all, so I have to add some more addresses to my procmail blacklist (code wrapped to not bust my layout).
Here I mine my junk mail folder for addresses the spammers have used:
$ perl -lne "print $1 if /(\w+)\@keithdevens\.com/;"
C:\keith\prog\Thunderbird\ImapMail\keithdevens.com\INBOX.sbd\Junk
| sort | uniq > addresses.txt
Then I (manually) examine addresses to remove ones I don't want to block, add the existing addresses I'm blocking, then regexify them to aid procmail:
$ perl -e "use Regex::PreSuf; print presuf(<>)" addresses.txt
and copy that to my procmail blacklist.
alexking.org: Blog > FeedLounge is Open! Cool, gonna try as soon as they get their "tour" (i.e. trial) up.
Update: Ah heck, it was worth the 5 bucks to me to not wait. But I had to make sure they had the "river of news" view first by checking the screenshots.
Ok, it's taking a while to import my OPML.
Well, that finished in a few minutes, but I closed my browser and now when I try to log in I'm getting a 'Server Error'.
new⇒Court rejects death penalty for raping children - Yahoo! News
:)...
Keith: Jul 4, 11:32am