Grab - site scraping framework

Grab could help you to:

  • Extract data from web site
  • Work with web-service API
  • Automate some activty on the web site

Important information:

If you want to use Grab in Windows OS then you should to download our pycurl library compilation, we have fixed the bug in pycurl library which causes some POST requests to fail. Link to download: pycurl-ssl-7.19.0.win32-py2.7.msi

Discussions in python-grab

  • 23 July 11:22: как спайдеру задать проксилист?
  • 21 July 14:25: (7, 'Failed connect to yandex.ru:41990; Invalid argument')
  • 14 July 21:57: Можно ли селектором получить текст перед тегом?
  • 11 July 13:13: Поддержка python 3.x
  • 06 July 15:32: Free webinar - How to Manage Your Python Open Source

Documenation

Docs are here docs.grablib.org. Originally docs were written in Russian. Now I am trying to tranlate documentation into English.

Here is incompleted English docs.

How to help Grab project

  1. Write publication about the Grab in your blog or on some pupular discussion board like reddit or hacker news
  2. Report a bug, describe details
  3. Create new feature and submit pull-request
  4. Order some site-scraping project at DataLab

Development activity


Fork me on GitHub