EmbedGet info from any web service or page
Stars: ✭ 1,808 (+7132%)
Awesome PuppeteerA curated list of awesome puppeteer resources.
Stars: ✭ 1,728 (+6812%)
Jikan RestThe REST API for Jikan
Stars: ✭ 200 (+700%)
Shadow UseragentPick the most common user-agents on the Internet 👻
Stars: ✭ 147 (+488%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (+300%)
ThalGetting started with Puppeteer and Chrome Headless for Web Scraping
Stars: ✭ 2,345 (+9280%)
Torchbear🔥🐻 The Speakeasy Scripting Engine Which Combines Speed, Safety, and Simplicity
Stars: ✭ 128 (+412%)
WebmagicA scalable web crawler framework for Java.
Stars: ✭ 10,186 (+40644%)
JuriscraperAn API to scrape American court websites for metadata.
Stars: ✭ 194 (+676%)
XqueryExtract data or evaluate value from HTML/XML documents using XPath
Stars: ✭ 155 (+520%)
PastepwnPython framework to scrape Pastebin pastes and analyze them
Stars: ✭ 87 (+248%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (+744%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (+484%)
MemoriousDistributed crawling framework for documents and structured data.
Stars: ✭ 248 (+892%)
PantherA browser testing and web crawling library for PHP and Symfony
Stars: ✭ 2,480 (+9820%)
HtmlsqlhtmlSQL is a experimental PHP library which allows you to access HTML values by an SQL like syntax.
Stars: ✭ 120 (+380%)
algoexpertAlgoExpert is an online platform that helps software engineers to prepare for coding and technical interviews.
Stars: ✭ 8 (-68%)
SeleniumcrawlerAn example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (+368%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (+692%)
D4n155OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (+320%)
Scrape Linkedin Selenium`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
Stars: ✭ 239 (+856%)
NintendealsLibrary with a set of tools for scraping information about Nintendo games and its prices across all regions (NA, EU and JP).
Stars: ✭ 94 (+276%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (+584%)
Secret AgentThe web browser that's built for scraping.
Stars: ✭ 151 (+504%)
Search Engine ParserLightweight package to query popular search engines and scrape for result titles, links and descriptions
Stars: ✭ 216 (+764%)
SerpscrapSEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Stars: ✭ 153 (+512%)
MusoqUse SQL on various data sources
Stars: ✭ 252 (+908%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (+492%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+62040%)
SqrapeSimple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
Stars: ✭ 144 (+476%)
google-scraperThis class can retrieve search results from Google.
Stars: ✭ 33 (+32%)
Educative.io Downloader📖 This tool is to download course from educative.io for offline usage. It uses your login credentials and download the course.
Stars: ✭ 139 (+456%)
TransistorTransistor, a Python web scraping framework for intelligent use cases.
Stars: ✭ 205 (+720%)
UdemycoursegrabberYour will to enroll in Udemy course is here, but the money isn't? Search no more! This python program searches for your desired course in more than [insert big number here] websites, compares the last updated date, and gives you the download link of the latest one back, but you also have the choice to see the other ones as well!
Stars: ✭ 137 (+448%)
Loconotion📄 Python tool to turn Notion.so pages into lightweight, customizable static websites
Stars: ✭ 237 (+848%)
GooglescraperA Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.
Stars: ✭ 2,363 (+9352%)
Od DatabaseDistributed crawler, database and web frontend for public directories indexing
Stars: ✭ 121 (+384%)
tvseriesTV Series is a tool that scrapes Episode Synopsis' of popular TV Series' from websites like Wikipedia / IMDb and show in one place with a user-friendly navigation UI.
Stars: ✭ 37 (+48%)
SouqscraperSimple scriptes for Level UP your scraping Skills, and source code for Level UP playlist on Youtube
Stars: ✭ 118 (+372%)
IdtImage Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process.
Stars: ✭ 202 (+708%)
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+169272%)
ReaperSocial media scraping / data collection tool for the Facebook, Twitter, Reddit, YouTube, Pinterest, and Tumblr APIs
Stars: ✭ 240 (+860%)
Laravel Bank StatementsLaravel package to collect your bank statements history. Currently support for parsing statements history from BCA, Mandiri, BNI, and MUAMALAT e-banking websites.
Stars: ✭ 105 (+320%)
Jsonframe Cheeriosimple multi-level scraper json input/output for Cheerio
Stars: ✭ 196 (+684%)
Languagepod101 ScraperPython scraper for Language Pods such as Japanesepod101.com 👹 🗾 🍣 Compatible with Japanese, Chinese, French, German, Italian, Korean, Portuguese, Russian, Spanish and many more! ✨
Stars: ✭ 104 (+316%)
ChampA Telegram bot combined with python to serve some basic functions like weather, music charts, cricket score and much more.
Stars: ✭ 22 (-12%)
GrawlerGrawler is a tool written in PHP which comes with a web interface that automates the task of using google dorks, scrapes the results, and stores them in a file.
Stars: ✭ 98 (+292%)
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (+660%)
HumanoidNode.js package to bypass CloudFlare's anti-bot JavaScript challenges
Stars: ✭ 88 (+252%)
Scrapysharpreborn of https://bitbucket.org/rflechner/scrapysharp
Stars: ✭ 226 (+804%)
github-languagesTiny little ruby on rails website that crawls though your public github repos to find out what your favourite languages are.
Stars: ✭ 23 (-8%)
Whatsapp-NetGenerate a network graph of connections from your WhatsApp groups data
Stars: ✭ 75 (+200%)
List Of User AgentsList of major web + mobile browser user agent strings. +1 Bonus script to scrape :)
Stars: ✭ 247 (+888%)
ArachnidCrawl all unique internal links found on a given website, and extract SEO related information - supports javascript based sites
Stars: ✭ 224 (+796%)
Requests HtmlPythonic HTML Parsing for Humans™
Stars: ✭ 12,268 (+48972%)