Apify actor that crawls Google Search result pages (SERPs) and extracts a list of organic results, ads, related queries and more. It supports selection of custom country, language and location.

Stars: ✭ 38 (-96.97%)

Mutual labels: web-scraping

restaurant-finder-featureReviews

Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).

Stars: ✭ 21 (-98.32%)

Mutual labels: web-scraping

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+225.38%)

Mutual labels: web-scraping

Competitive Programming Score API

API to get user details for competitive coding platforms - Codeforces, Codechef, SPOJ, Interviewbit

Stars: ✭ 118 (-90.58%)

Mutual labels: web-scraping

IMDB-Scraper

Scrapy project for scraping data from IMDB with Movie Dataset including 58,623 movies' data.

Stars: ✭ 37 (-97.05%)

Mutual labels: web-scraping

Spidr

A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.

Stars: ✭ 656 (-47.65%)

Mutual labels: web-scraping

raspagem-de-dados-fatec

📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí

Stars: ✭ 22 (-98.24%)

Mutual labels: web-scraping

Scrapy Craigslist

Web Scraping Craigslist's Engineering Jobs in NY with Scrapy

Stars: ✭ 54 (-95.69%)

Mutual labels: web-scraping

papercut

Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.

Stars: ✭ 15 (-98.8%)

Mutual labels: web-scraping

Scrapy Fake Useragent

Random User-Agent middleware based on fake-useragent

Stars: ✭ 520 (-58.5%)

Mutual labels: web-scraping

sp-subway-scraper

🚆This web scraper builds a dataset for São Paulo subway operation status

Stars: ✭ 24 (-98.08%)

Mutual labels: web-scraping

Cascadia

Go cascadia package command line CSS selector

Stars: ✭ 67 (-94.65%)

Mutual labels: web-scraping

GSoC-Data-Analyser

Simple search for organisations participating/participated in the GSoC

Stars: ✭ 29 (-97.69%)

Mutual labels: web-scraping

Awesome Web Scraping

List of libraries, tools and APIs for web scraping and data processing.

Stars: ✭ 4,510 (+259.94%)

Mutual labels: web-scraping

top-github-scraper

Scape top GitHub repositories and users based on keywords

Stars: ✭ 40 (-96.81%)

Mutual labels: web-scraping

Snoop

Snoop — инструмент разведки на основе открытых данных (OSINT world)

Stars: ✭ 886 (-29.29%)

Mutual labels: web-scraping

scraping-ebay

Scraping Ebay's products using Scrapy Web Crawling Framework

Stars: ✭ 79 (-93.7%)

Mutual labels: web-scraping

Basketball reference web scraper

NBA Stats API via Basketball Reference

Stars: ✭ 279 (-77.73%)

Mutual labels: web-scraping

Php Curl Class

PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs

Stars: ✭ 2,903 (+131.68%)

Mutual labels: web-scraping

OLX Scraper

📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.

Stars: ✭ 15 (-98.8%)

Mutual labels: web-scraping

Youtube tutorials

Collection of scripts corresponding to LucidProgramming YouTube tutorials

Stars: ✭ 769 (-38.63%)

Mutual labels: web-scraping

comic-scraper

[Python] Scraps comics and manga from various websites and creates cbz files from them

Stars: ✭ 16 (-98.72%)

Mutual labels: web-scraping

Instago

Download/access photos, videos, stories, story highlights, postlives, following and followers of Instagram

Stars: ✭ 59 (-95.29%)

Mutual labels: web-scraping

Text-Analysis

Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.

Stars: ✭ 48 (-96.17%)

Mutual labels: web-scraping

Faster Than Requests

Faster requests on Python 3

Stars: ✭ 639 (-49%)

Mutual labels: web-scraping

comp thinking social science

Computational Thinking for Social Scientists book project

Stars: ✭ 42 (-96.65%)

Mutual labels: web-scraping

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-94.57%)

Mutual labels: web-scraping

article-summary-deep-learning

📖 Using deep learning and scraping to analyze/summarize articles! Just drop in any URL!

Stars: ✭ 18 (-98.56%)

Mutual labels: web-scraping

Pythoncode Tutorials

The Python Code Tutorials

Stars: ✭ 544 (-56.58%)

Mutual labels: web-scraping

PaperScraper

A web scraping tool to systematically extract the text of scientific papers and corresponding metadata from university accessible journals.

Stars: ✭ 63 (-94.97%)

Mutual labels: web-scraping

Project Tauro

A Router WiFi key recovery/cracking tool with a twist.

Stars: ✭ 52 (-95.85%)

Mutual labels: web-scraping

linkextractor

A Docker tutorial using a link extraction application example

Stars: ✭ 41 (-96.73%)

Mutual labels: web-scraping

User Agents

A JavaScript library for generating random user agents with data that's updated daily.

Stars: ✭ 485 (-61.29%)

Mutual labels: web-scraping

halfstaff

🇺🇸 Is the US flag at half-staff?

Stars: ✭ 22 (-98.24%)

Mutual labels: web-scraping

Reader

Extract clean(er), readable text from web pages via Mercury Web Parser.

Stars: ✭ 75 (-94.01%)

Mutual labels: web-scraping

investigation-amazon-brands

Materials to reproduce our findings in our stories, "Amazon Puts Its Own 'Brands' First Above Better-Rated Products" and "When Amazon Takes the Buy Box, it Doesn’t Give it up"

Stars: ✭ 56 (-95.53%)

Mutual labels: web-scraping

Scrapple

A framework for creating semi-automatic web content extractors

Stars: ✭ 464 (-62.97%)

Mutual labels: web-scraping

actor-scraper

House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.

Stars: ✭ 83 (-93.38%)

Mutual labels: web-scraping

Uc Davis Cs Exams Analysis

📈 Regression and Classification with UC Davis student quiz data and exam data

Stars: ✭ 33 (-97.37%)

Mutual labels: web-scraping

heroshi

Heroshi – open source web crawler.

Stars: ✭ 51 (-95.93%)

Mutual labels: web-scraping

Selectolax

Python binding to Modest engine (fast HTML5 parser with CSS selectors).

Stars: ✭ 368 (-70.63%)

Mutual labels: web-scraping

tableau-scraping

Tableau scraper python library. R and Python scripts to scrape data from Tableau viz

Stars: ✭ 91 (-92.74%)

Mutual labels: web-scraping

Decapitated

Headless 'Chrome' Orchestration in R

Stars: ✭ 65 (-94.81%)

Mutual labels: web-scraping

text-mining-corona-articles

Text Mining for Indonesian Online News Articles About Corona

Stars: ✭ 15 (-98.8%)

Mutual labels: web-scraping

Ache

ACHE is a web crawler for domain-specific search.

Stars: ✭ 320 (-74.46%)

Mutual labels: web-scraping

India-WhatsAppFakeNews-Dataset

WhatsApps related deaths News Articles along with other articles across India during that period

Stars: ✭ 41 (-96.73%)

Mutual labels: web-scraping

Webmiddle

Node.js framework for modular web scraping and data extraction

Stars: ✭ 13 (-98.96%)

Mutual labels: web-scraping

Gopa

[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn

Stars: ✭ 277 (-77.89%)

Mutual labels: web-scraping

Detect Cms

PHP Library for detecting CMS

Stars: ✭ 78 (-93.77%)

Mutual labels: web-scraping

Ping Sm

Receive an email or Telegram message as soon as Migros Sanalmarket is available for delivery in your neighborhood.

Stars: ✭ 71 (-94.33%)

Mutual labels: web-scraping

Social Media Profile Scrapers

Fetch user's data across social media

Stars: ✭ 60 (-95.21%)

Mutual labels: web-scraping

Letterboxd recommendations

Scraping publicly-accessible Letterboxd data and creating a movie recommendation model with it that can generate recommendations when provided with a Letterboxd username

Stars: ✭ 23 (-98.16%)

Mutual labels: web-scraping

Apify Js

Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.

Stars: ✭ 3,154 (+151.72%)

Mutual labels: web-scraping

1-60 of 134 similar projects

›