pickall.NET agile and extensible web searching API
Stars: ✭ 25 (-52.83%)
extract-cssExtract all CSS from a webpage, packaged as a Now V2 Lambda
Stars: ✭ 23 (-56.6%)
turtleInstagram Photo Downloader
Stars: ✭ 15 (-71.7%)
Android-Apps-Downloader📱 A tool to download android apps from Google Play Store and Xiaomi App Store (the famous Chinese Store).
Stars: ✭ 16 (-69.81%)
SillyniumAutomate the creation of Python Selenium Scripts by drawing coloured boxes on webpage elements
Stars: ✭ 100 (+88.68%)
scrapetubeGet all videos from a youtube channel, get all videos from a playlist, get all videos that match a search
Stars: ✭ 120 (+126.42%)
spiderMultithreaded Web spider crawler written in Rust.
Stars: ✭ 81 (+52.83%)
ScrapoxyScrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Stars: ✭ 1,322 (+2394.34%)
ogcheckr-apiAn api to check social media username availability on a variety of services
Stars: ✭ 18 (-66.04%)
scrapeerEssential PHP library that scrapes HTTP(S) and UDP trackers for torrent information.
Stars: ✭ 81 (+52.83%)
pysoundcloudScraping the Un–scrapable™
Stars: ✭ 63 (+18.87%)
scoopi-scraperScoopi Web Scraper is a heavy duty tool to extract data from HTML pages.
Stars: ✭ 18 (-66.04%)
tvseriesTV Series is a tool that scrapes Episode Synopsis' of popular TV Series' from websites like Wikipedia / IMDb and show in one place with a user-friendly navigation UI.
Stars: ✭ 37 (-30.19%)
fb-page-chat-downloadPython script to download messages from a Facebook page to a CSV file
Stars: ✭ 51 (-3.77%)
SpidrA versatile Ruby web spidering library that can spider a site, multiple domains, certain links or infinitely. Spidr is designed to be fast and easy to use.
Stars: ✭ 656 (+1137.74%)
evineInteractive CLI Web Crawler
Stars: ✭ 140 (+164.15%)
InstaloctrackAn Instagram OSINT tool to collect all the geotagged locations available on an Instagram profile in order to plot them on a map, and dump them in a JSON.
Stars: ✭ 85 (+60.38%)
BaiduSpider项目已经移动至:https://github.com/BaiduSpider/BaiduSpider !! 一个爬取百度搜索结果的爬虫,目前支持百度网页搜索,百度图片搜索,百度知道搜索,百度视频搜索,百度资讯搜索,百度文库搜索,百度经验搜索和百度百科搜索。
Stars: ✭ 29 (-45.28%)
GmdbGMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)
Stars: ✭ 189 (+256.6%)
SurgeonDeclarative DOM extraction expression evaluator. 👨⚕️
Stars: ✭ 653 (+1132.08%)
Hoomanhttp interceptor to hoomanize cloudflare requests
Stars: ✭ 82 (+54.72%)
INMET-API-temperatureCrawler dos dados metereológicos de estações convencionais do INMET (BDMEP)
Stars: ✭ 32 (-39.62%)
nyt-first-saidTweets when words are published for the first time in the NYT
Stars: ✭ 222 (+318.87%)
Instagram CrawlerGet Instagram posts/profile/hashtag data without using Instagram API
Stars: ✭ 643 (+1113.21%)
wishlistRead an Amazon wishlist programmatically with Python
Stars: ✭ 44 (-16.98%)
WombatLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Stars: ✭ 1,220 (+2201.89%)
AzurLaneWikiScrapersA console application that can scrape the Azur Lane wiki and export the data to Json files
Stars: ✭ 12 (-77.36%)
Kikoeru Expresskikoeru 后端,不再维护,请到https://github.com/umonaca/kikoeru-express 获取更新
Stars: ✭ 79 (+49.06%)
awesome-interfaceAngularJS SPA interface for awesome lists. Awesome lists parsed using python.
Stars: ✭ 25 (-52.83%)
scrapy-LBCAraignée LeBonCoin avec Scrapy et ElasticSearch
Stars: ✭ 14 (-73.58%)
Instascrape🚀 A fast and lightweight utility and Python library for downloading posts, stories, and highlights from Instagram.
Stars: ✭ 76 (+43.4%)
arxiv leaksWhisper of the arxiv: read comments in tex of papers
Stars: ✭ 22 (-58.49%)
TradeTheEventImplementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Stars: ✭ 64 (+20.75%)
PymarketcapPython3 API wrapper and web scraper for https://coinmarketcap.com
Stars: ✭ 73 (+37.74%)
Unhtml.rsA magic html parser
Stars: ✭ 180 (+239.62%)
Scala ScraperA Scala library for scraping content from HTML pages
Stars: ✭ 631 (+1090.57%)
gHarvesterProof of concept for a security issue (in my opinion) that I found in accounts.google.com
Stars: ✭ 20 (-62.26%)
Instagram CrawlerCrawl instagram photos, posts and videos for download.
Stars: ✭ 178 (+235.85%)
Instagram4j📷 Instagram private API in Java
Stars: ✭ 629 (+1086.79%)
Jd AutobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,174 (+2115.09%)
tieba-zhuaqu百度贴吧分布式爬虫,用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析
Stars: ✭ 56 (+5.66%)
GoscrapeWeb scraper that can create an offline readable version of a website
Stars: ✭ 69 (+30.19%)
yt-videos-listCreate and **automatically** update a list of all videos on a YouTube channel (in txt/csv/md form) via YouTube bot with end-to-end web scraping - no API tokens required. Multi-threaded support for YouTube videos list updates.
Stars: ✭ 64 (+20.75%)
pyscrapersScrapers for vk, facebook, instagram and more
Stars: ✭ 18 (-66.04%)
CheerioFast, flexible, and lean implementation of core jQuery designed specifically for the server.
Stars: ✭ 24,616 (+46345.28%)
Instagram ScraperScrapes an instagram user's photos and videos
Stars: ✭ 5,664 (+10586.79%)
chopperChopper is a tool to extract elements from HTML by preserving ancestors and CSS rules
Stars: ✭ 22 (-58.49%)
ReadablewebproxyRewriting web proxy and archival tool. At this point, it just tries to download all the things.
Stars: ✭ 172 (+224.53%)
JikanUnofficial MyAnimeList PHP+REST API which provides functions other than the official API
Stars: ✭ 531 (+901.89%)
GoogledictionaryapiGoogle does not provide Google Dictionary API so I created one.
Stars: ✭ 528 (+896.23%)
Novel基于 Laravel 5.2 的小说网站
Stars: ✭ 172 (+224.53%)