Scraping with Ruby and FireBug

about 10-20 lines of code..

I like that.

clipped from www.igvita.com

Ruby Screen-Scraper in 60 Seconds

I often find myself trying to automate content extraction from a saved HTML file or a remote server. I’ve tried a number of approaches over the years, but the dynamic duo of Hpricot and Firebug blew me away – this is by far the fastest way to get what you want without compromising flexibility. Hpricot is an extremely powerful ruby-based HTML parser, and Firebug is arguably the best on-the-fly development add-on for Firefox. Now, I said it will take you about 60 seconds. I lied, it should take less. Let’s get right to it.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

%d bloggers like this: