All Projects → selectorlib → Similar Projects or Alternatives

431 Open source projects that are alternatives of or similar to selectorlib

PythonScrapyBasicSetup
Basic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (+7.55%)
Mutual labels:  scraping, web-scraping
raspagem-de-dados-fatec
📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí
Stars: ✭ 22 (-58.49%)
Mutual labels:  scraping, web-scraping
top-github-scraper
Scape top GitHub repositories and users based on keywords
Stars: ✭ 40 (-24.53%)
Mutual labels:  scraping, web-scraping
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+7592.45%)
Mutual labels:  scraping, web-scraping
reapr
🕸→ℹ️ Reap Information from Websites
Stars: ✭ 14 (-73.58%)
Mutual labels:  web-scraping, xpath
Humanoid
Node.js package to bypass CloudFlare's anti-bot JavaScript challenges
Stars: ✭ 88 (+66.04%)
Mutual labels:  scraping, web-scraping
Webhere
HTML scraping for Objective-C.
Stars: ✭ 16 (-69.81%)
Mutual labels:  scraping, xpath
Detect Cms
PHP Library for detecting CMS
Stars: ✭ 78 (+47.17%)
Mutual labels:  scraping, web-scraping
browser-pool
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (+33.96%)
Mutual labels:  scraping, web-scraping
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+422.64%)
Mutual labels:  scraping, web-scraping
Apify Js
Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+5850.94%)
Mutual labels:  scraping, web-scraping
Xquery
Extract data or evaluate value from HTML/XML documents using XPath
Stars: ✭ 155 (+192.45%)
Mutual labels:  scraping, xpath
Parsel
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
Stars: ✭ 628 (+1084.91%)
Mutual labels:  scraping, xpath
Sqrape
Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
Stars: ✭ 144 (+171.7%)
Mutual labels:  scraping, web-scraping
Phpscraper
PHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+179.25%)
Mutual labels:  scraping, web-scraping
codechef-rank-comparator
Web application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).
Stars: ✭ 23 (-56.6%)
Mutual labels:  web-scraping, xpath
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+1241.51%)
Mutual labels:  scraping, web-scraping
papercut
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-71.7%)
Mutual labels:  scraping, web-scraping
Scrapple
A framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+775.47%)
Mutual labels:  scraping, web-scraping
Scrape Linkedin Selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+350.94%)
Mutual labels:  scraping, web-scraping
ioweb
Web Scraping Framework
Stars: ✭ 31 (-41.51%)
Mutual labels:  scraping, web-scraping
BookingScraper
🌎 🏨 Scrape Booking.com 🏨 🌎
Stars: ✭ 68 (+28.3%)
Mutual labels:  web-scraping
telenium
Automation for Kivy Application
Stars: ✭ 56 (+5.66%)
Mutual labels:  xpath
socials
👨‍👩‍👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (-30.19%)
Mutual labels:  scraping
etf4u
📊 Python tool to scrape real-time information about ETFs from the web and mixing them together by proportionally distributing their assets allocation
Stars: ✭ 29 (-45.28%)
Mutual labels:  scraping
faexport
The API for Furaffinity you wish existed
Stars: ✭ 61 (+15.09%)
Mutual labels:  web-scraping
scrapy-fieldstats
A Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-67.92%)
Mutual labels:  scraping
Neural-Scam-Artist
Web Scraping, Document Deduplication & GPT-2 Fine-tuning with a newly created scam dataset.
Stars: ✭ 18 (-66.04%)
Mutual labels:  web-scraping
Architeuthis
MITM HTTP(S) proxy with integrated load-balancing, rate-limiting and error handling. Built for automated web scraping.
Stars: ✭ 35 (-33.96%)
Mutual labels:  scraping
saveddit
Bulk Downloader for Reddit
Stars: ✭ 130 (+145.28%)
Mutual labels:  web-scraping
oversmash
Overwatch API library for player details and career stats
Stars: ✭ 42 (-20.75%)
Mutual labels:  scraping
scrapers
scrapers for building your own image databases
Stars: ✭ 46 (-13.21%)
Mutual labels:  scraping
shorter.recipes
A website dedicated to making recipes from any website easy to read.
Stars: ✭ 27 (-49.06%)
Mutual labels:  scraping
uiautomatorview
给uiautomatorview添加xpath等待
Stars: ✭ 45 (-15.09%)
Mutual labels:  xpath
docker-selenium-lambda
The simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (+224.53%)
Mutual labels:  scraping
turtle
Instagram Photo Downloader
Stars: ✭ 15 (-71.7%)
Mutual labels:  scraping
diffbot-php-client
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Stars: ✭ 53 (+0%)
Mutual labels:  scraping
NBA-Fantasy-Optimizer
NBA Daily Fantasy Lineup Optimizer for FanDuel Using Python
Stars: ✭ 21 (-60.38%)
Mutual labels:  scraping
double-agent
A test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+132.08%)
Mutual labels:  scraping
Python
covers python basic to advance topics, practice questions, logical problems in python, web development using html, css, bootstrap, jquery, DOM, Django 🚀🚀. 💥 🌈
Stars: ✭ 29 (-45.28%)
Mutual labels:  web-scraping
Stock-Market-Predictor
Stock Market Predictor with LSTM network. Web scraping and analyzing tools (ohlc, mean)
Stars: ✭ 28 (-47.17%)
Mutual labels:  web-scraping
gochanges
**[ARCHIVED]** website changes tracker 🔍
Stars: ✭ 12 (-77.36%)
Mutual labels:  scraping
codepen-puppeteer
Use Puppeteer to download pens from Codepen.io as single html pages
Stars: ✭ 22 (-58.49%)
Mutual labels:  web-scraping
RARBG-scraper
With Selenium headless browsing and CAPTCHA solving
Stars: ✭ 38 (-28.3%)
Mutual labels:  scraping
web-poet
Web scraping Page Objects core library
Stars: ✭ 67 (+26.42%)
Mutual labels:  web-scraping
crawler-chrome-extensions
爬虫工程师常用的 Chrome 插件 | Chrome extensions used by crawler developer
Stars: ✭ 53 (+0%)
Mutual labels:  scraping
reason-rust-scraper
🦀 Scraping & crawling websites using Rust, and ReasonML
Stars: ✭ 21 (-60.38%)
Mutual labels:  scraping
covid19br-pub
Projeto de monitoramento de publicações oficiais relacionadas a COVID-19 no Brasil.
Stars: ✭ 12 (-77.36%)
Mutual labels:  scraping
TikTokDownloader PyWebIO
🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+1633.96%)
Mutual labels:  web-scraping
4cat
The 4CAT Capture and Analysis Toolkit provides modular data capture & analysis for a variety of social media platforms.
Stars: ✭ 144 (+171.7%)
Mutual labels:  scraping
core
The complete web scraping toolkit for PHP.
Stars: ✭ 1,110 (+1994.34%)
Mutual labels:  web-scraping
scrape-github-trending
Tutorial for web scraping / crawling with Node.js.
Stars: ✭ 42 (-20.75%)
Mutual labels:  scraping
actor-content-checker
You can use this act to monitor any page's content and get a notification when content changes.
Stars: ✭ 16 (-69.81%)
Mutual labels:  web-scraping
info-bot
🤖 A Versatile Telegram Bot
Stars: ✭ 37 (-30.19%)
Mutual labels:  scraping
crawlzone
Crawlzone is a fast asynchronous internet crawling framework for PHP.
Stars: ✭ 70 (+32.08%)
Mutual labels:  web-scraping
testcafe-vue-selectors
TestCafe selector extensions for Vue.js apps.
Stars: ✭ 103 (+94.34%)
Mutual labels:  selectors
exquery
EXQuery repository
Stars: ✭ 19 (-64.15%)
Mutual labels:  xpath
Goirate
Pillaging the seven seas for torrents, pieces of eight and other bounty.
Stars: ✭ 20 (-62.26%)
Mutual labels:  scraping
dont-waste-your-ducking-time
🐓 An opinionated guide on how to test Redux ducks
Stars: ✭ 28 (-47.17%)
Mutual labels:  selectors
xpath2.js
xpath.js - Open source XPath 2.0 implementation in JavaScript (DOM agnostic)
Stars: ✭ 74 (+39.62%)
Mutual labels:  xpath
1-60 of 431 similar projects