All Projects → Ache → Similar Projects or Alternatives

183 Open source projects that are alternatives of or similar to Ache

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-95.31%)

Mutual labels: web-crawler, web-scraping

Pulsar

Turn large Web sites into tables and charts using simple SQLs.

Stars: ✭ 100 (-68.75%)

Mutual labels: web-scraping, web-crawler

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (+105%)

Mutual labels: web-scraping, web-crawler

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (-13.44%)

Mutual labels: web-scraping, web-crawler

flink-crawler

Continuous scalable web crawler built on top of Flink and crawler-commons

Stars: ✭ 48 (-85%)

Mutual labels: web-crawler

top-github-scraper

Scape top GitHub repositories and users based on keywords

Stars: ✭ 40 (-87.5%)

Mutual labels: web-scraping

proxi

Proxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.

Stars: ✭ 32 (-90%)

Mutual labels: web-crawler

IMDB-Scraper

Scrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.

Stars: ✭ 37 (-88.44%)

Mutual labels: web-scraping

comic-scraper

[Python] Scraps comics and manga from various websites and creates cbz files from them

Stars: ✭ 16 (-95%)

Mutual labels: web-scraping

linkextractor

A Docker tutorial using a link extraction application example

Stars: ✭ 41 (-87.19%)

Mutual labels: web-scraping

Node-js-functionalities

This repository contains very useful restful API's and functionalities in node-js containing many important tutorial code for mastering node-js, all tutorials have been published on medium.com, tutorials link is given below

Stars: ✭ 69 (-78.44%)

Mutual labels: web-scraping

restaurant-finder-featureReviews

Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).

Stars: ✭ 21 (-93.44%)

Mutual labels: web-scraping

Movie-Recommendation-System-with-Sentiment-Analysis

Content based movie recommendation system with sentiment analysis

Stars: ✭ 44 (-86.25%)

Mutual labels: web-scraping

Springboard-Data-Science-Immersive

No description or website provided.

Stars: ✭ 52 (-83.75%)

Mutual labels: web-scraping

ComicBookMaker

Script to fetch webcomics and use them to create ebooks.

Stars: ✭ 27 (-91.56%)

Mutual labels: web-crawler

scraping-ebay

Scraping Ebay's products using Scrapy Web Crawling Framework

Stars: ✭ 79 (-75.31%)

Mutual labels: web-scraping

PaperScraper

A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.

Stars: ✭ 63 (-80.31%)

Mutual labels: web-scraping

siteshooter

📷 Automate full website screenshots and PDF generation with multiple viewport support.

Stars: ✭ 63 (-80.31%)

Mutual labels: web-crawler

Php Curl Class

PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs

Stars: ✭ 2,903 (+807.19%)

Mutual labels: web-scraping

SchweizerMesser

🎯Python 3 网络爬虫实战、数据分析合集 | 当当 | 网易云音乐 | unsplash | 必胜客 | 猫眼 |

Stars: ✭ 89 (-72.19%)

Mutual labels: web-crawler

WebCrawler

Just a simple web crawler which return crawled links as IObservable using reactive extension and async await.

Stars: ✭ 55 (-82.81%)

Mutual labels: web-crawler

rreddit

𝐫⟋ Get Reddit data

Stars: ✭ 49 (-84.69%)

Mutual labels: web-scraping

extractnet

A Dragnet that also extract author, headline, date, keywords from context

Stars: ✭ 52 (-83.75%)

Mutual labels: web-scraping

Text-Analysis

Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.

Stars: ✭ 48 (-85%)

Mutual labels: web-scraping

evine

Interactive CLI Web Crawler

Stars: ✭ 140 (-56.25%)

Mutual labels: web-crawler

learncpp-download

Scrape bot, to get you an offline copy of tutorials

Stars: ✭ 23 (-92.81%)

Mutual labels: web-crawler

actor-scraper

House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.

Stars: ✭ 83 (-74.06%)

Mutual labels: web-scraping

comp thinking social science

Computational Thinking for Social Scientists book project

Stars: ✭ 42 (-86.87%)

Mutual labels: web-scraping

heroshi

Heroshi – open source web crawler.

Stars: ✭ 51 (-84.06%)

Mutual labels: web-scraping

UnChain

A tool to find redirection chains in multiple URLs

Stars: ✭ 77 (-75.94%)

Mutual labels: web-crawler

tableau-scraping

Tableau scraper python library. R and Python scripts to scrape data from Tableau viz

Stars: ✭ 91 (-71.56%)

Mutual labels: web-scraping

article-summary-deep-learning

📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!

Stars: ✭ 18 (-94.37%)

Mutual labels: web-scraping

text-mining-corona-articles

Text Mining for Indonesian Online News Articles About Corona

Stars: ✭ 15 (-95.31%)

Mutual labels: web-scraping

Apify Js

Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

Stars: ✭ 3,154 (+885.63%)

Mutual labels: web-scraping

Mimo-Crawler

A web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.

Stars: ✭ 22 (-93.12%)

Mutual labels: web-crawler

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-95.31%)

Mutual labels: web-scraping

India-WhatsAppFakeNews-Dataset

WhatsApps related deaths News Articles along with other articles across India during that period

Stars: ✭ 41 (-87.19%)

Mutual labels: web-scraping

Competitive Programming Score API

API to get user details for competitive coding platforms - Codeforces, Codechef, SPOJ, Interviewbit

Stars: ✭ 118 (-63.12%)

Mutual labels: web-scraping

audiobooker

Audio Book scrapper

Stars: ✭ 14 (-95.62%)

Mutual labels: web-scraping

htmlunit

🕸🧰☕️Tools to Scrape Dynamic Web Content via the 'HtmlUnit' Java Library

Stars: ✭ 39 (-87.81%)

Mutual labels: web-scraping

automation-scripts

Simple scripts that I'm using to automate the boring things.

Stars: ✭ 14 (-95.62%)

Mutual labels: web-scraping

Basketball reference web scraper

NBA Stats API via Basketball Reference

Stars: ✭ 279 (-12.81%)

Mutual labels: web-scraping

leetcode-compensation

Compensation analysis on the posts scraped from leetcode.com/discuss/compensation. At present, the reports have been generated only for Indian cities.

Stars: ✭ 83 (-74.06%)

Mutual labels: web-scraping

sp-subway-scraper

🚆This web scraper builds a dataset for São Paulo subway operation status

Stars: ✭ 24 (-92.5%)

Mutual labels: web-scraping

WaWebSessionHandler

(DISCONTINUED) Save WhatsApp Web Sessions as files and open them everywhere!

Stars: ✭ 27 (-91.56%)

Mutual labels: web-scraping

Stock-Fundamental-data-scraping-and-analysis

Project on building a web crawler to collect the fundamentals of the stock and review their performance in one go

Stars: ✭ 40 (-87.5%)

Mutual labels: web-scraping

browser-pool

A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.

Stars: ✭ 71 (-77.81%)

Mutual labels: web-scraping

halfstaff

🇺🇸 Is the US flag at half-staff?

Stars: ✭ 22 (-93.12%)

Mutual labels: web-scraping

iww

AI based web-wrapper for web-content-extraction

Stars: ✭ 61 (-80.94%)

Mutual labels: web-scraping

Spidy

The simple, easy to use command line web crawler.

Stars: ✭ 257 (-19.69%)

Mutual labels: web-crawler

Linkedin-Client

Web scraper for grabing data from Linkedin profiles or company pages (personal project)

Stars: ✭ 42 (-86.87%)

Mutual labels: web-scraping

pyCreeper

一个用来快速提取网页内容的信息采集（爬虫）框架，实现了对网页的动态加载与控制。

Stars: ✭ 25 (-92.19%)

Mutual labels: web-crawler

raspagem-de-dados-fatec

📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí

Stars: ✭ 22 (-93.12%)

Mutual labels: web-scraping

grailer

web scraping tool for grailed.com

Stars: ✭ 30 (-90.62%)

Mutual labels: web-scraping

codechef-rank-comparator

Web application hosted on Heroku cloud platform based on web scraping in python using lxml library (XML Path Language).

Stars: ✭ 23 (-92.81%)

Mutual labels: web-scraping

bolsa

Biblioteca feita em Python com o objetivo de facilitar o acesso a dados de seus investimentos na bolsa de valores(B3/CEI) através do Portal CEI.

Stars: ✭ 46 (-85.62%)

Mutual labels: web-crawler

Supercrawler

A web crawler. Supercrawler automatically crawls websites. Define custom handlers to parse content. Obeys robots.txt, rate limits and concurrency limits.

Stars: ✭ 306 (-4.37%)

Mutual labels: web-crawler

investigation-amazon-brands

Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Takes the Buy Box, it Doesn’t Give it up"

Stars: ✭ 56 (-82.5%)

Mutual labels: web-scraping

Data-Wrangling-with-Python

Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices

Stars: ✭ 90 (-71.87%)

Mutual labels: web-scraping

Lagoujob

Job data mining repo for lagou.com

Stars: ✭ 256 (-20%)

Mutual labels: web-crawler

1-60 of 183 similar projects

›