Grab - site scraping framework

Grab could help you to:

  • Extract data from web site
  • Work with web-service API
  • Automate some activty on the web site

Important information:

If you want to use Grab in Windows OS then you should to download our pycurl library compilation, we have fixed the bug in pycurl library which causes some POST requests to fail. Link to download: pycurl-ssl-7.19.0.win32-py2.7.msi

Discussions in python-grab

  • 31 July 07:00: Javascript URL extraction
  • 29 July 19:17: Ограничение количества асинхронных запросов
  • 28 July 17:01: как узнать, что граб вернул ошибку?
  • 23 July 11:22: как спайдеру задать проксилист?
  • 21 July 14:25: (7, 'Failed connect to yandex.ru:41990; Invalid argument')

Documenation

Docs are here docs.grablib.org. Originally docs were written in Russian. Now I am trying to tranlate documentation into English.

Here is incompleted English docs.

How to help Grab project

  1. Write publication about the Grab in your blog or on some pupular discussion board like reddit or hacker news
  2. Report a bug, describe details
  3. Create new feature and submit pull-request
  4. Order some site-scraping project at DataLab

Development activity


Fork me on GitHub