All Projects → crawlzone → Similar Projects or Alternatives

194 Open source projects that are alternatives of or similar to crawlzone

City Scrapers
Scrape, standardize and share public meetings from local government websites
Stars: ✭ 220 (+214.29%)
Mutual labels:  web-scraping
Pythoncode Tutorials
The Python Code Tutorials
Stars: ✭ 544 (+677.14%)
Mutual labels:  web-scraping
Html Metadata
MetaData html scraper and parser for Node.js (supports Promises and callback style)
Stars: ✭ 129 (+84.29%)
Mutual labels:  web-scraping
User Agents
A JavaScript library for generating random user agents with data that's updated daily.
Stars: ✭ 485 (+592.86%)
Mutual labels:  web-scraping
concurrent-web-scraping
Building a Concurrent Web Scraper with Python and Selenium
Stars: ✭ 28 (-60%)
Mutual labels:  web-scraping
Scrapple
A framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+562.86%)
Mutual labels:  web-scraping
30 Days Of Python
Learn Python for the next 30 (or so) Days.
Stars: ✭ 1,748 (+2397.14%)
Mutual labels:  web-scraping
Selectolax
Python binding to Modest engine (fast HTML5 parser with CSS selectors).
Stars: ✭ 368 (+425.71%)
Mutual labels:  web-scraping
Short Jokes Dataset
Python scripts for building 'Short Jokes' dataset, featured on Kaggle
Stars: ✭ 215 (+207.14%)
Mutual labels:  web-scraping
Ache
ACHE is a web crawler for domain-specific search.
Stars: ✭ 320 (+357.14%)
Mutual labels:  web-scraping
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+2065.71%)
Mutual labels:  web-scraping
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+295.71%)
Mutual labels:  web-scraping
Kattis-Demos-Testing
Kattis demo solutions with unit testing
Stars: ✭ 14 (-80%)
Mutual labels:  automated-testing
Php Curl Class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+4047.14%)
Mutual labels:  web-scraping
Scrapyd Cluster On Heroku
Set up free and scalable Scrapyd cluster for distributed web-crawling with just a few clicks. DEMO 👉
Stars: ✭ 106 (+51.43%)
Mutual labels:  web-scraping
comic-scraper
[Python] Scraps comics and manga from various websites and creates cbz files from them
Stars: ✭ 16 (-77.14%)
Mutual labels:  web-scraping
Trump Lies
Tutorial: Web scraping in Python with Beautiful Soup
Stars: ✭ 201 (+187.14%)
Mutual labels:  web-scraping
Text-Analysis
Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (-31.43%)
Mutual labels:  web-scraping
Pulsar
Turn large Web sites into tables and charts using simple SQLs.
Stars: ✭ 100 (+42.86%)
Mutual labels:  web-scraping
wayback
⏪ Tools to Work with the Various Internet Archive Wayback Machine APIs
Stars: ✭ 52 (-25.71%)
Mutual labels:  web-scraping
Webmiddle
Node.js framework for modular web scraping and data extraction
Stars: ✭ 13 (-81.43%)
Mutual labels:  web-scraping
article-summary-deep-learning
📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!
Stars: ✭ 18 (-74.29%)
Mutual labels:  web-scraping
Splashr
💦 Tools to Work with the 'Splash' JavaScript Rendering Service in R
Stars: ✭ 93 (+32.86%)
Mutual labels:  web-scraping
PaperScraper
A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.
Stars: ✭ 63 (-10%)
Mutual labels:  web-scraping
Twitter Intelligence
Twitter Intelligence OSINT project performs tracking and analysis of the Twitter
Stars: ✭ 179 (+155.71%)
Mutual labels:  web-scraping
linkextractor
A Docker tutorial using a link extraction application example
Stars: ✭ 41 (-41.43%)
Mutual labels:  web-scraping
Humanoid
Node.js package to bypass CloudFlare's anti-bot JavaScript challenges
Stars: ✭ 88 (+25.71%)
Mutual labels:  web-scraping
halfstaff
🇺🇸 Is the US flag at half-staff?
Stars: ✭ 22 (-68.57%)
Mutual labels:  web-scraping
lopez
Crawling and scraping the Web for fun and profit
Stars: ✭ 20 (-71.43%)
Mutual labels:  web-scraping
investigation-amazon-brands
Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Takes the Buy Box, it Doesn’t Give it up"
Stars: ✭ 56 (-20%)
Mutual labels:  web-scraping
Rvest
Simple web scraping for R
Stars: ✭ 1,253 (+1690%)
Mutual labels:  web-scraping
actor-scraper
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
Stars: ✭ 83 (+18.57%)
Mutual labels:  web-scraping
Web Database Analytics
Web scrapping and related analytics using Python tools
Stars: ✭ 175 (+150%)
Mutual labels:  web-scraping
heroshi
Heroshi – open source web crawler.
Stars: ✭ 51 (-27.14%)
Mutual labels:  web-scraping
Reader
Extract clean(er), readable text from web pages via Mercury Web Parser.
Stars: ✭ 75 (+7.14%)
Mutual labels:  web-scraping
tableau-scraping
Tableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (+30%)
Mutual labels:  web-scraping
Quora Api
An unofficial API for Quora.
Stars: ✭ 250 (+257.14%)
Mutual labels:  web-scraping
text-mining-corona-articles
Text Mining for Indonesian Online News Articles About Corona
Stars: ✭ 15 (-78.57%)
Mutual labels:  web-scraping
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (-2.86%)
Mutual labels:  web-scraping
India-WhatsAppFakeNews-Dataset
WhatsApps related deaths News Articles along with other articles across India during that period
Stars: ✭ 41 (-41.43%)
Mutual labels:  web-scraping
Scrapy Training
Scrapy Training companion code
Stars: ✭ 157 (+124.29%)
Mutual labels:  web-scraping
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-78.57%)
Mutual labels:  web-scraping
Decapitated
Headless 'Chrome' Orchestration in R
Stars: ✭ 65 (-7.14%)
Mutual labels:  web-scraping
Node-js-functionalities
This repository contains very useful restful API's and functionalities in node-js containing many important tutorial code for mastering node-js, all tutorials have been published on medium.com, tutorials link is given below
Stars: ✭ 69 (-1.43%)
Mutual labels:  web-scraping
Hi
A Programming language for Web Scraping
Stars: ✭ 14 (-80%)
Mutual labels:  web-scraping
WaWebSessionHandler
(DISCONTINUED) Save WhatsApp Web Sessions as files and open them everywhere!
Stars: ✭ 27 (-61.43%)
Mutual labels:  web-scraping
Instago
Download/access photos, videos, stories, story highlights, postlives, following and followers of Instagram
Stars: ✭ 59 (-15.71%)
Mutual labels:  web-scraping
browser-pool
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (+1.43%)
Mutual labels:  web-scraping
Web Scraping
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, SHFE and news data crawlers on BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Stars: ✭ 153 (+118.57%)
Mutual labels:  web-scraping
iww
AI based web-wrapper for web-content-extraction
Stars: ✭ 61 (-12.86%)
Mutual labels:  web-scraping
Project Tauro
A Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (-25.71%)
Mutual labels:  web-scraping
htmlunit
🕸🧰☕️Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library
Stars: ✭ 39 (-44.29%)
Mutual labels:  web-scraping
Wayback Machine Scraper
A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.
Stars: ✭ 230 (+228.57%)
Mutual labels:  web-scraping
Uc Davis Cs Exams Analysis
📈 Regression and Classification with UC Davis student quiz data and exam data
Stars: ✭ 33 (-52.86%)
Mutual labels:  web-scraping
2017-summer-workshop
Exercises, data, and more for our 2017 summer workshop (funded by the Estes Fund and in partnership with Project Jupyter and Berkeley's D-Lab)
Stars: ✭ 33 (-52.86%)
Mutual labels:  web-scraping
xharness
C# command line tool for running tests on Android / iOS / tvOS devices and simulators
Stars: ✭ 123 (+75.71%)
Mutual labels:  automated-testing
jdi-dark
Powerful Framework for Backend Automation Testing on Java (Rest, Soap, WebSocket)
Stars: ✭ 36 (-48.57%)
Mutual labels:  automated-testing
Docbao
Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam
Stars: ✭ 230 (+228.57%)
Mutual labels:  web-scraping
Phpscraper
PHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+111.43%)
Mutual labels:  web-scraping
Letterboxd recommendations
Scraping publicly-accessible Letterboxd data and creating a movie recommendation model with it that can generate recommendations when provided with a Letterboxd username
Stars: ✭ 23 (-67.14%)
Mutual labels:  web-scraping
61-120 of 194 similar projects