All Projects → top-github-scraper → Similar Projects or Alternatives

622 Open source projects that are alternatives of or similar to top-github-scraper

Phpscraper
PHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+270%)
Mutual labels:  scraping, web-scraper, web-scraping
Scrape Linkedin Selenium
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+497.5%)
Mutual labels:  scraping, web-scraper, web-scraping
Scrapple
A framework for creating semi-automatic web content extractors
Stars: ✭ 464 (+1060%)
Mutual labels:  scraping, web-scraper, web-scraping
Detect Cms
PHP Library for detecting CMS
Stars: ✭ 78 (+95%)
Mutual labels:  scraping, web-scraper, web-scraping
Scrapy Craigslist
Web Scraping Craigslist's Engineering Jobs in NY with Scrapy
Stars: ✭ 54 (+35%)
Mutual labels:  web-scraper, web-scraping
Daftlistings
A library that enables programmatic interaction with daft.ie. Daft.ie has nationwide coverage and contains about 80% of the total available properties in Ireland.
Stars: ✭ 86 (+115%)
Mutual labels:  web-scraper, web-scraping
OLX Scraper
📻 An OLX Scraper using Scrapy + MongoDB. It Scrapes recent ads posted regarding requested product and dumps to NOSQL MONGODB.
Stars: ✭ 15 (-62.5%)
Mutual labels:  web-scraper, web-scraping
Spidr
A versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+1540%)
Mutual labels:  web-scraper, web-scraping
Php Curl Class
PHP Curl Class makes it easy to send HTTP requests and integrate with web APIs
Stars: ✭ 2,903 (+7157.5%)
Mutual labels:  web-scraper, web-scraping
Humanoid
Node.js package to bypass CloudFlare's anti-bot JavaScript challenges
Stars: ✭ 88 (+120%)
Mutual labels:  scraping, web-scraping
Basketball reference web scraper
NBA Stats API via Basketball Reference
Stars: ✭ 279 (+597.5%)
Mutual labels:  web-scraper, web-scraping
papercut
Papercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-62.5%)
Mutual labels:  scraping, web-scraping
Html Metadata
MetaData html scraper and parser for Node.js (supports Promises and callback style)
Stars: ✭ 129 (+222.5%)
Mutual labels:  web-scraper, web-scraping
Social Media Profile Scrapers
Fetch user's data across social media
Stars: ✭ 60 (+50%)
Mutual labels:  web-scraper, web-scraping
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+592.5%)
Mutual labels:  scraping, web-scraping
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+10092.5%)
Mutual labels:  scraping, web-scraping
Project Tauro
A Router WiFi key recovery/cracking tool with a twist.
Stars: ✭ 52 (+30%)
Mutual labels:  web-scraper, web-scraping
raspagem-de-dados-fatec
📓 Minicurso de raspagem de dados web com Python ministrado na Semana de Tecnologia da FATEC Jundiaí
Stars: ✭ 22 (-45%)
Mutual labels:  scraping, web-scraping
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+1677.5%)
Mutual labels:  scraping, web-scraping
Cascadia
Go cascadia package command line CSS selector
Stars: ✭ 67 (+67.5%)
Mutual labels:  web-scraper, web-scraping
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (+70%)
Mutual labels:  web-scraper, web-scraping
Apify Js
Apify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+7785%)
Mutual labels:  scraping, web-scraping
Web Scraping
Detailed web scraping tutorials for dummies with financial data crawlers on Reddit WallStreetBets, CME (both options and futures), US Treasury, CFTC, LME, SHFE and news data crawlers on BBC, Wall Street Journal, Al Jazeera, Reuters, Financial Times, Bloomberg, CNN, Fortune, The Economist
Stars: ✭ 153 (+282.5%)
Mutual labels:  web-scraper, web-scraping
PythonScrapyBasicSetup
Basic setup with random user agents and IP addresses for Python Scrapy Framework.
Stars: ✭ 57 (+42.5%)
Mutual labels:  scraping, web-scraping
Sqrape
Simple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
Stars: ✭ 144 (+260%)
Mutual labels:  scraping, web-scraping
browser-pool
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Stars: ✭ 71 (+77.5%)
Mutual labels:  scraping, web-scraping
Faster Than Requests
Faster requests on Python 3
Stars: ✭ 639 (+1497.5%)
Mutual labels:  web-scraper, web-scraping
selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+32.5%)
Mutual labels:  scraping, web-scraping
ioweb
Web Scraping Framework
Stars: ✭ 31 (-22.5%)
Mutual labels:  scraping, web-scraping
Linkedin-Client
Web scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (+5%)
Mutual labels:  web-scraper, web-scraping
github-markdown-render
Display Markdown formatted documents on your local web server using GitHub's Markdown rendering API and CSS to mimic the visuals of GitHub itself.
Stars: ✭ 18 (-55%)
Mutual labels:  github-api
ferenda
Transform unstructured document collections to structured Linked Data
Stars: ✭ 22 (-45%)
Mutual labels:  scraping
chesf
CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages
Stars: ✭ 18 (-55%)
Mutual labels:  scraping
neo
A Discord bot built to satisfy a multitude of needs
Stars: ✭ 16 (-60%)
Mutual labels:  github-api
git-down-repo
Download git-repo for any url
Stars: ✭ 50 (+25%)
Mutual labels:  github-api
Recent-Commits-on-Repository
Find a github repository an its recent commits
Stars: ✭ 12 (-70%)
Mutual labels:  github-api
org-stats
Get the contributor stats summary from all repos of any given organization
Stars: ✭ 151 (+277.5%)
Mutual labels:  github-api
Instagram-Giveaways-Winner
Instagram Bot which when given a post url will spam mentions to increase the chances of winning. Win Instagram Giveaways!
Stars: ✭ 95 (+137.5%)
Mutual labels:  web-scraper
proxi
Proxy pool. Finds and checks proxies with rest api for querying results. Can find over 25k proxies in under 5 minutes.
Stars: ✭ 32 (-20%)
Mutual labels:  scraping
go-scrapy
Web crawling and scraping framework for Golang
Stars: ✭ 17 (-57.5%)
Mutual labels:  scraping
tableau-scraping
Tableau scraper python library. R and Python scripts to scrape data from Tableau viz
Stars: ✭ 91 (+127.5%)
Mutual labels:  web-scraping
feedsearch-crawler
Crawl sites for RSS, Atom, and JSON feeds.
Stars: ✭ 23 (-42.5%)
Mutual labels:  scraping
journalist
App to write journal digitally. Simple as that.
Stars: ✭ 23 (-42.5%)
Mutual labels:  github-api
VideoRecognition-realtime-autotrainer-alerts
State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.
Stars: ✭ 36 (-10%)
Mutual labels:  web-scraper
automation-scripts
Simple scripts that I'm using to automate the boring things.
Stars: ✭ 14 (-65%)
Mutual labels:  web-scraping
internet-affordability
🌍 Dataset that shows the Internet affordability by country (a shocking reality!)
Stars: ✭ 13 (-67.5%)
Mutual labels:  scraping
zulipbot
GitHub workflow-optimizing bot by @zulip
Stars: ✭ 70 (+75%)
Mutual labels:  github-api
gh-notify
GitHub CLI extension to display GitHub notifications
Stars: ✭ 66 (+65%)
Mutual labels:  github-api
Captcha-Tools
All-in-one Python (And now Go!) module to help solve captchas with Capmonster, 2captcha and Anticaptcha API's!
Stars: ✭ 23 (-42.5%)
Mutual labels:  scraping
larry
Larry 🐦 is a really simple Twitter bot generator that tweets random repositories from Github built in Go
Stars: ✭ 64 (+60%)
Mutual labels:  github-api
GithubApp-android-architecture
Let's learn a deep look at the Android architecture
Stars: ✭ 16 (-60%)
Mutual labels:  github-api
ezprofile
🚀 Create an automatic portfolio based on GitHub profile.
Stars: ✭ 344 (+760%)
Mutual labels:  github-api
github-admin
vue和element-ui搭建一個後台管理系統,使用github提供的api搞事情。輸入您的github賬號名自動幫你生成基本的github信息哦😯
Stars: ✭ 15 (-62.5%)
Mutual labels:  github-api
Triton
GitHub notifications tracker for Telegram. Pushes GitHub notifications to Telegram.
Stars: ✭ 12 (-70%)
Mutual labels:  github-api
GithubClient
Github iOS Client based on Github REST V3 API and GraphQL V4 API
Stars: ✭ 42 (+5%)
Mutual labels:  github-api
subscene scraper
Library to download subtitles from subscene.com
Stars: ✭ 14 (-65%)
Mutual labels:  scraping
search-github-starred
Full-Text Search the readme, description, homepage and URL of your GitHub starred repository. Use GitHub OAuth 2, React, Redux, Golang (server side), Elasticsearch, Redis.
Stars: ✭ 15 (-62.5%)
Mutual labels:  github-api
fixtures
Fixtures for all the octokittens
Stars: ✭ 82 (+105%)
Mutual labels:  github-api
get-sauce
A command line program to download hentai videos and images from multiple websites
Stars: ✭ 40 (+0%)
Mutual labels:  web-scraper
Node-js-functionalities
This repository contains very useful restful API's and functionalities in node-js containing many important tutorial code for mastering node-js, all tutorials have been published on medium.com, tutorials link is given below
Stars: ✭ 69 (+72.5%)
Mutual labels:  web-scraping
1-60 of 622 similar projects