All Projects → Phpscraper → Similar Projects or Alternatives

693 Open source projects that are alternatives of or similar to Phpscraper

`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.

Stars: ✭ 239 (+61.49%)

Mutual labels: scraper, scraping, web-scraping, web-scraper

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-89.86%)

Mutual labels: scraper, scraping, web-scraping

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+343.24%)

Mutual labels: scraper, web-scraping, web-scraper

Scrapple

A framework for creating semi-automatic web content extractors

Stars: ✭ 464 (+213.51%)

Mutual labels: scraping, web-scraping, web-scraper

OLX Scraper

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-89.86%)

Mutual labels: scraper, web-scraper, web-scraping

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+2654.73%)

Mutual labels: scraper, scraping, web-scraping

top-github-scraper

Scape top GitHub repositories and users based on keywords

Stars: ✭ 40 (-72.97%)

Mutual labels: scraping, web-scraper, web-scraping

Detect Cms

PHP Library for detecting CMS

Stars: ✭ 78 (-47.3%)

Mutual labels: scraping, web-scraping, web-scraper

Linkedin-Client

Web scraper for grabing data from Linkedin profiles or company pages (personal project)

Stars: ✭ 42 (-71.62%)

Mutual labels: scraper, web-scraper, web-scraping

Zillow

Zillow Scraper for Python using Selenium

Stars: ✭ 141 (-4.73%)

Mutual labels: scraper, web-scraping

scrapy facebooker

Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.

Stars: ✭ 22 (-85.14%)

Mutual labels: scraper, scraping

whatsapp-tracking

Scraping the status of WhatsApp contacts

Stars: ✭ 49 (-66.89%)

Mutual labels: scraper, scraping

Seleniumcrawler

An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site

Stars: ✭ 117 (-20.95%)

Mutual labels: scraper, scraping

document-dl

Command line program to download documents from web portals.

Stars: ✭ 14 (-90.54%)

Mutual labels: scraper, scraping

Html Metadata

MetaData html scraper and parser for Node.js (supports Promises and callback style)

Stars: ✭ 129 (-12.84%)

Mutual labels: web-scraping, web-scraper

AzurLaneWikiScrapers

A console application that can scrape the Azur Lane wiki and export the data to Json files

Stars: ✭ 12 (-91.89%)

Mutual labels: scraper, web-scraper

scraper

Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.

Stars: ✭ 37 (-75%)

Mutual labels: scraper, scraping

bots-zoo

No description or website provided.

Stars: ✭ 59 (-60.14%)

Mutual labels: scraper, scraping

Php Curl Class

PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs

Stars: ✭ 2,903 (+1861.49%)

Mutual labels: web-scraping, web-scraper

Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy

Stars: ✭ 309 (+108.78%)

Mutual labels: scraper, scraping

copycat

A PHP Scraping Class

Stars: ✭ 70 (-52.7%)

Mutual labels: scraper, scraping

Apify Js

Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

Stars: ✭ 3,154 (+2031.08%)

Mutual labels: scraping, web-scraping

Katana

A Python Tool For google Hacking

Stars: ✭ 355 (+139.86%)

Mutual labels: scraper, scraping

Sqrape

Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)

Stars: ✭ 144 (-2.7%)

Mutual labels: scraping, web-scraping

Rod

A Devtools driver for web automation and scraping

Stars: ✭ 1,392 (+840.54%)

Mutual labels: scraper, web-scraping

Awesome Crawler

A collection of awesome web crawler,spider in different languages

Stars: ✭ 4,793 (+3138.51%)

Mutual labels: scraper, web-scraper

Imagescraper

✂️ High performance, multi-threaded image scraper

Stars: ✭ 630 (+325.68%)

Mutual labels: scraper, scraping

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (-64.86%)

Mutual labels: scraper, scraping

Captcha-Tools

All-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!

Stars: ✭ 23 (-84.46%)

Mutual labels: scraper, scraping

angel.co-companies-list-scraping

No description or website provided.

Stars: ✭ 54 (-63.51%)

Mutual labels: scraper, scraping

Scraper-Projects

🕸 List of mini projects that involve web scraping 🕸

Stars: ✭ 25 (-83.11%)

Mutual labels: scraper, scraping

Sillynium

Automate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements

Stars: ✭ 100 (-32.43%)

Mutual labels: scraper, web-scraping

sp-subway-scraper

🚆This web scraper builds a dataset for São Paulo subway operation status

Stars: ✭ 24 (-83.78%)

Mutual labels: scraper, web-scraping

browser-pool

A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.

Stars: ✭ 71 (-52.03%)

Mutual labels: scraping, web-scraping

raspagem-de-dados-fatec

📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí

Stars: ✭ 22 (-85.14%)

Mutual labels: scraping, web-scraping

Zeiver

A Scraper, Downloader, & Recorder for static open directories.

Stars: ✭ 14 (-90.54%)

Mutual labels: scraper, scraping

facebook-discussion-tk

A collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.

Stars: ✭ 33 (-77.7%)

Mutual labels: scraper, scraping

TorScrapper

A Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)

Stars: ✭ 24 (-83.78%)

Mutual labels: scraper, scraping

Basketball reference web scraper

NBA Stats API via Basketball Reference

Stars: ✭ 279 (+88.51%)

Mutual labels: web-scraping, web-scraper

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (+87.16%)

Mutual labels: scraping, web-scraping

Faster Than Requests

Faster requests on Python 3

Stars: ✭ 639 (+331.76%)

Mutual labels: web-scraping, web-scraper

Pypatent

Search for and retrieve US Patent and Trademark Office Patent Data

Stars: ✭ 31 (-79.05%)

Mutual labels: scraper, scraping

Project Tauro

A Router WiFi key recovery/cracking tool with a twist.

Stars: ✭ 52 (-64.86%)

Mutual labels: web-scraping, web-scraper

Ferret

Declarative web scraping

Stars: ✭ 4,837 (+3168.24%)

Mutual labels: scraper, scraping

Headless Chrome Crawler

Distributed crawler powered by Headless Chrome

Stars: ✭ 5,129 (+3365.54%)

Mutual labels: scraper, scraping

Dataflowkit

Extract structured data from web sites. Web sites scraping.

Stars: ✭ 456 (+208.11%)

Mutual labels: scraper, scraping

Lulu

[Unmaintained] A simple and clean video/music/image downloader 👾

Stars: ✭ 789 (+433.11%)

Mutual labels: scraper, scraping

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-54.05%)

Mutual labels: web-scraping, web-scraper

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+591.89%)

Mutual labels: scraper, scraping

Crawly

Crawly, a high-level web crawling & scraping framework for Elixir.

Stars: ✭ 440 (+197.3%)

Mutual labels: scraper, scraping

Cascadia

Go cascadia package command line CSS selector

Stars: ✭ 67 (-54.73%)

Mutual labels: web-scraping, web-scraper

Email Extractor

The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url

Stars: ✭ 81 (-45.27%)

Mutual labels: scraper, scraping

Daftlistings

A library that enables programmatic interaction with daft.ie. Daft.ie has nationwide coverage and contains about 80% of the total available properties in Ireland.

Stars: ✭ 86 (-41.89%)

Mutual labels: web-scraping, web-scraper

Geziyor

Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.

Stars: ✭ 1,246 (+741.89%)

Mutual labels: scraper, scraping

Social Media Profile Scrapers

Fetch user's data across social media

Stars: ✭ 60 (-59.46%)

Mutual labels: web-scraping, web-scraper

proxycrawl-python

ProxyCrawl Python library for scraping and crawling