Grab - site scraping framework

Grab could help you to:

  • Extract data from web site
  • Work with web-service API
  • Automate some activty on the web site

Important information:

If you want to use Grab in Windows OS then you should to download our pycurl library compilation, we have fixed the bug in pycurl library which causes some POST requests to fail. Link to download: pycurl-ssl-7.19.0.win32-py2.7.msi

Discussions in python-grab

  • 20 January 21:05: Подскажите по парсингу img src
  • 20 January 17:32: Re: Есть ли альтернатива для Hammer_mode?
  • 19 January 15:36: Динамическое выражение xpath
  • 04 January 05:41: Skype комната для общения на тему парсинга сайтов
  • 03 January 11:34: Клонирование grab-объекта при использовании прокси

Documenation

Docs are here docs.grablib.org. Originally docs were written in Russian. Now I am trying to tranlate documentation into English.

Here is incompleted English docs.

How to help Grab project

  1. Write publication about the Grab in your blog or on some pupular discussion board like reddit or hacker news
  2. Report a bug, describe details
  3. Create new feature and submit pull-request
  4. Order some site-scraping project at DataLab

Development activity


Fork me on GitHub