All Projects → Phpscraper → Similar Projects or Alternatives

693 Open source projects that are alternatives of or similar to Phpscraper

Scrape Linkedin Selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+61.49%)
Mutual labels:  scraper, scraping, web-scraping, web-scraper
papercut
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-89.86%)
Mutual labels:  scraper, scraping, web-scraping
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+343.24%)
Mutual labels:  scraper, web-scraping, web-scraper
Scrapple
A framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+213.51%)
Mutual labels:  scraping, web-scraping, web-scraper
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-89.86%)
Mutual labels:  scraper, web-scraper, web-scraping
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+2654.73%)
Mutual labels:  scraper, scraping, web-scraping
top-github-scraper
Scape top GitHub repositories and users based on keywords
Stars: ✭ 40 (-72.97%)
Mutual labels:  scraping, web-scraper, web-scraping
Detect Cms
PHP Library for detecting CMS
Stars: ✭ 78 (-47.3%)
Mutual labels:  scraping, web-scraping, web-scraper
Linkedin-Client
Web scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (-71.62%)
Mutual labels:  scraper, web-scraper, web-scraping
Zillow
Zillow Scraper for Python using Selenium
Stars: ✭ 141 (-4.73%)
Mutual labels:  scraper, web-scraping
scrapy facebooker
Collection of scrapy spiders which can scrape posts, images, and so on from public Facebook Pages.
Stars: ✭ 22 (-85.14%)
Mutual labels:  scraper, scraping
whatsapp-tracking
Scraping the status of WhatsApp contacts
Stars: ✭ 49 (-66.89%)
Mutual labels:  scraper, scraping
Seleniumcrawler
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (-20.95%)
Mutual labels:  scraper, scraping
document-dl
Command line program to download documents from web portals.
Stars: ✭ 14 (-90.54%)
Mutual labels:  scraper, scraping
Html Metadata
MetaData html scraper and parser for Node.js (supports Promises and callback style)
Stars: ✭ 129 (-12.84%)
Mutual labels:  web-scraping, web-scraper
AzurLaneWikiScrapers
A console application that can scrape the Azur Lane wiki and export the data to Json files
Stars: ✭ 12 (-91.89%)
Mutual labels:  scraper, web-scraper
scraper
Nodejs web scraper. Contains a command line, docker container, terraform module and ansible roles for distributed cloud scraping. Supported databases: SQLite, MySQL, PostgreSQL. Supported headless clients: Puppeteer, Playwright, Cheerio, JSdom.
Stars: ✭ 37 (-75%)
Mutual labels:  scraper, scraping
bots-zoo
No description or website provided.
Stars: ✭ 59 (-60.14%)
Mutual labels:  scraper, scraping
Php Curl Class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+1861.49%)
Mutual labels:  web-scraping, web-scraper
Linkedin
Linkedin Scraper using Selenium Web Driver, Chromium headless, Docker and Scrapy
Stars: ✭ 309 (+108.78%)
Mutual labels:  scraper, scraping
copycat
A PHP Scraping Class
Stars: ✭ 70 (-52.7%)
Mutual labels:  scraper, scraping
Apify Js
Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+2031.08%)
Mutual labels:  scraping, web-scraping
Katana
A Python Tool For google Hacking
Stars: ✭ 355 (+139.86%)
Mutual labels:  scraper, scraping
Sqrape
Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
Stars: ✭ 144 (-2.7%)
Mutual labels:  scraping, web-scraping
Rod
A Devtools driver for web automation and scraping
Stars: ✭ 1,392 (+840.54%)
Mutual labels:  scraper, web-scraping
Awesome Crawler
A collection of awesome web crawler,spider in different languages
Stars: ✭ 4,793 (+3138.51%)
Mutual labels:  scraper, web-scraper
Imagescraper
✂️ High performance, multi-threaded image scraper
Stars: ✭ 630 (+325.68%)
Mutual labels:  scraper, scraping
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-64.86%)
Mutual labels:  scraper, scraping
Captcha-Tools
All-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!
Stars: ✭ 23 (-84.46%)
Mutual labels:  scraper, scraping
angel.co-companies-list-scraping
No description or website provided.
Stars: ✭ 54 (-63.51%)
Mutual labels:  scraper, scraping
Scraper-Projects
🕸 List of mini projects that involve web scraping 🕸
Stars: ✭ 25 (-83.11%)
Mutual labels:  scraper, scraping
Sillynium
Automate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements
Stars: ✭ 100 (-32.43%)
Mutual labels:  scraper, web-scraping
sp-subway-scraper
🚆This web scraper builds a dataset for São Paulo subway operation status
Stars: ✭ 24 (-83.78%)
Mutual labels:  scraper, web-scraping
browser-pool
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (-52.03%)
Mutual labels:  scraping, web-scraping
raspagem-de-dados-fatec
📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí
Stars: ✭ 22 (-85.14%)
Mutual labels:  scraping, web-scraping
Zeiver
A Scraper, Downloader, & Recorder for static open directories.
Stars: ✭ 14 (-90.54%)
Mutual labels:  scraper, scraping
facebook-discussion-tk
A collection of tools to (semi-)automatically collect and analyze data from online discussions on Facebook groups and pages.
Stars: ✭ 33 (-77.7%)
Mutual labels:  scraper, scraping
TorScrapper
A Scraper made 100% in Python using BeautifulSoup and Tor. It can be used to scrape both normal and onion links. Happy Scraping :)
Stars: ✭ 24 (-83.78%)
Mutual labels:  scraper, scraping
Basketball reference web scraper
NBA Stats API via Basketball Reference
Stars: ✭ 279 (+88.51%)
Mutual labels:  web-scraping, web-scraper
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+87.16%)
Mutual labels:  scraping, web-scraping
Faster Than Requests
Faster requests on Python 3
Stars: ✭ 639 (+331.76%)
Mutual labels:  web-scraping, web-scraper
Pypatent
Search for and retrieve US Patent and Trademark Office Patent Data
Stars: ✭ 31 (-79.05%)
Mutual labels:  scraper, scraping
Project Tauro
A Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (-64.86%)
Mutual labels:  web-scraping, web-scraper
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+3168.24%)
Mutual labels:  scraper, scraping
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+3365.54%)
Mutual labels:  scraper, scraping
Dataflowkit
Extract structured data from web sites. Web sites scraping.
Stars: ✭ 456 (+208.11%)
Mutual labels:  scraper, scraping
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+433.11%)
Mutual labels:  scraper, scraping
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (-54.05%)
Mutual labels:  web-scraping, web-scraper
Django Dynamic Scraper
Creating Scrapy scrapers via the Django admin interface
Stars: ✭ 1,024 (+591.89%)
Mutual labels:  scraper, scraping
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+197.3%)
Mutual labels:  scraper, scraping
Cascadia
Go cascadia package command line CSS selector
Stars: ✭ 67 (-54.73%)
Mutual labels:  web-scraping, web-scraper
Email Extractor
The main functionality is to extract all the emails from one or several URLs - La funcionalidad principal es extraer todos los correos electrónicos de una o varias Url
Stars: ✭ 81 (-45.27%)
Mutual labels:  scraper, scraping
Daftlistings
A library that enables programmatic interaction with daft.ie. Daft.ie has nationwide coverage and contains about 80% of the total available properties in Ireland.
Stars: ✭ 86 (-41.89%)
Mutual labels:  web-scraping, web-scraper
Geziyor
Geziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (+741.89%)
Mutual labels:  scraper, scraping
Social Media Profile Scrapers
Fetch user's data across social media
Stars: ✭ 60 (-59.46%)
Mutual labels:  web-scraping, web-scraper
proxycrawl-python
ProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (-65.54%)
Mutual labels:  scraper, scraping
Hockey Scraper
Python Package for scraping NHL Play-by-Play and Shift data
Stars: ✭ 93 (-37.16%)
Mutual labels:  scraper, web-scraping
Scrapers
A list of scrapers from around the web.
Stars: ✭ 366 (+147.3%)
Mutual labels:  scraper, web-scraper
Scrapy Craigslist
Web Scraping Craigslist's Engineering Jobs in NY with Scrapy
Stars: ✭ 54 (-63.51%)
Mutual labels:  web-scraping, web-scraper
Humanoid
Node.js package to bypass CloudFlare's anti-bot JavaScript challenges
Stars: ✭ 88 (-40.54%)
Mutual labels:  scraping, web-scraping
1-60 of 693 similar projects