All Projects → roach-php → core

roach-php / core

Licence: other
The complete web scraping toolkit for PHP.

Programming Languages

PHP
23972 projects - #3 most used programming language

Projects that are alternatives of or similar to core

Apify Js
Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+184.14%)
Mutual labels:  crawling, web-scraping
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-75.05%)
Mutual labels:  crawling, web-scraping
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (-93.87%)
Mutual labels:  crawling, web-scraping
N2h4
네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (-84.05%)
Mutual labels:  crawling
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-82.16%)
Mutual labels:  crawling
pdf-crawler
SimFin's open source PDF crawler
Stars: ✭ 100 (-90.99%)
Mutual labels:  crawling
podcastcrawler
PHP library to find podcasts
Stars: ✭ 40 (-96.4%)
Mutual labels:  crawling
Holiday Cn
📅🇨🇳 中国法定节假日数据 自动每日抓取国务院公告
Stars: ✭ 157 (-85.86%)
Mutual labels:  crawling
lopez
Crawling and scraping the Web for fun and profit
Stars: ✭ 20 (-98.2%)
Mutual labels:  web-scraping
concurrent-web-scraping
Building a Concurrent Web Scraper with Python and Selenium
Stars: ✭ 28 (-97.48%)
Mutual labels:  web-scraping
wayback
⏪ Tools to Work with the Various Internet Archive Wayback Machine APIs
Stars: ✭ 52 (-95.32%)
Mutual labels:  web-scraping
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+1299.55%)
Mutual labels:  crawling
Hi
A Programming language for Web Scraping
Stars: ✭ 14 (-98.74%)
Mutual labels:  web-scraping
Nutch
Apache Nutch is an extensible and scalable web crawler
Stars: ✭ 2,277 (+105.14%)
Mutual labels:  crawling
2017-summer-workshop
Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Stars: ✭ 33 (-97.03%)
Mutual labels:  web-scraping
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-84.59%)
Mutual labels:  crawling
puppet-master
Puppeteer as a service hosted on Saasify.
Stars: ✭ 25 (-97.75%)
Mutual labels:  crawling
UofT-Timetable-Generator
A web application that generates timetables for university students at the University of Toronto
Stars: ✭ 34 (-96.94%)
Mutual labels:  web-scraping
Memorious
Distributed crawling framework for documents and structured data.
Stars: ✭ 248 (-77.66%)
Mutual labels:  crawling
BaiduSpider
项目已经移动至:https://github.com/BaiduSpider/BaiduSpider !! 一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 29 (-97.39%)
Mutual labels:  crawling

🐴 Roach

Latest Stable Version Total Downloads

A complete web scraping toolkit for PHP

About

Roach is a complete web scraping toolkit for PHP. It is heavily inspired (read: a shameless clone) of the popular Scrapy package for Python.

Installation

Install the package via composer

composer require roach-php/core

Documentation

The full documentation can be found here.

Credits

License

MIT

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].