All Projects → scrapy → Scrapy

scrapy / Scrapy

Licence: other
Scrapy, a fast high-level web crawling & scraping framework for Python.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Scrapy

Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (-63.31%)
Mutual labels:  crawler, scraping, crawling, framework
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-99.53%)
Mutual labels:  crawler, scraping, crawling, framework
Ferret
Declarative web scraping
Stars: ✭ 4,837 (-88.58%)
Mutual labels:  hacktoberfest, crawler, scraping, crawling
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (-99.32%)
Mutual labels:  crawler, scraping, crawling, framework
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-99.6%)
Mutual labels:  crawler, scraping, crawling
Pastepwn
Python framework to scrape Pastebin pastes and analyze them
Stars: ✭ 87 (-99.79%)
Mutual labels:  hacktoberfest, scraping, framework
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-99.35%)
Mutual labels:  crawler, scraping, crawling
bots-zoo
No description or website provided.
Stars: ✭ 59 (-99.86%)
Mutual labels:  crawler, scraping, crawling
Spidermon
Scrapy Extension for monitoring spiders execution.
Stars: ✭ 309 (-99.27%)
Mutual labels:  hacktoberfest, scraping, crawling
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (-87.89%)
Mutual labels:  crawler, scraping, crawling
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (-98.62%)
Mutual labels:  crawler, scraping, crawling
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-99.76%)
Mutual labels:  crawler, scraping, crawling
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-98.96%)
Mutual labels:  crawler, scraping, crawling
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (-98.14%)
Mutual labels:  crawler, scraping, crawling
Webmagic
A scalable web crawler framework for Java.
Stars: ✭ 10,186 (-75.94%)
Mutual labels:  crawler, scraping, framework
Geziyor
Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (-97.06%)
Mutual labels:  crawler, scraping
Loadingshimmer
An easy way to add a shimmering effect to any view with just one line of code. It is useful as an unobtrusive loading indicator.
Stars: ✭ 1,180 (-97.21%)
Mutual labels:  hacktoberfest, framework
Truffle
A tool for developing smart contracts. Crafted with the finest cacaos.
Stars: ✭ 11,909 (-71.87%)
Mutual labels:  hacktoberfest, framework
Edxposedmanager
Companion Android application for EdXposed
Stars: ✭ 1,172 (-97.23%)
Mutual labels:  hacktoberfest, framework
Dataengineeringproject
Example end to end data engineering project.
Stars: ✭ 82 (-99.81%)
Mutual labels:  hacktoberfest, scraping

/artwork/scrapy-logo.jpg

Scrapy

PyPI Version Supported Python Versions Ubuntu macOS Windows Wheel Status Coverage report Conda Version

Overview

Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

Scrapy is maintained by Zyte (formerly Scrapinghub) and many other contributors.

Check the Scrapy homepage at https://scrapy.org for more information, including a list of features.

Requirements

  • Python 3.6+
  • Works on Linux, Windows, macOS, BSD

Install

The quick way:

pip install scrapy

See the install section in the documentation at https://docs.scrapy.org/en/latest/intro/install.html for more details.

Documentation

Documentation is available online at https://docs.scrapy.org/ and in the docs directory.

Releases

You can check https://docs.scrapy.org/en/latest/news.html for the release notes.

Community (blog, twitter, mail list, IRC)

See https://scrapy.org/community/ for details.

Contributing

See https://docs.scrapy.org/en/master/contributing.html for details.

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct (see https://github.com/scrapy/scrapy/blob/master/CODE_OF_CONDUCT.md).

By participating in this project you agree to abide by its terms. Please report unacceptable behavior to [email protected].

Companies using Scrapy

See https://scrapy.org/companies/ for a list.

Commercial Support

See https://scrapy.org/support/ for details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].