Website-downloader💡 Download the complete source code of any website (including all assets). [ Javascripts, Stylesheets, Images ] using Node.js
Stars: ✭ 615 (+1657.14%)
tinyPornManagerMade for pornhub. Fork from tinyMediaManager v3
Stars: ✭ 57 (+62.86%)
TikTokDownloader PyWebIO🚀「Douyin_TikTok_Download_API」是一个开箱即用的高性能异步抖音|TikTok数据爬取工具,支持API调用,在线批量解析及下载。
Stars: ✭ 919 (+2525.71%)
nyt-first-saidTweets when words are published for the first time in the NYT
Stars: ✭ 222 (+534.29%)
sotokiStackExchange websites to ZIM scraper
Stars: ✭ 64 (+82.86%)
nuvem-candidatos🇧🇷 Nuvem de palavras com os planos de governo dos candidatos à presidência em 2018
Stars: ✭ 20 (-42.86%)
ibge🌎 Data collection of geographical divisions of Brazil by IBGE (https://servicodados.ibge.gov.br/api/docs)
Stars: ✭ 28 (-20%)
superacao-appAplicativo para o projeto "Anjos do SuperAção"
Stars: ✭ 17 (-51.43%)
enredoLinguagem de programação moderna em portugues, baseada em JS
Stars: ✭ 35 (+0%)
ceiba-dlNTU CEIBA 資料下載工具
Stars: ✭ 80 (+128.57%)
urteile-gesetze-webWeb-Frontend des juristischen Informationssystems urteile-gesetze.de
Stars: ✭ 16 (-54.29%)
jd-autobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,262 (+3505.71%)
4scannerContinuously search imageboards threads for images/webms and download them
Stars: ✭ 103 (+194.29%)
canvas-captchaA simple captcha module for nodejs based on node-canvas
Stars: ✭ 31 (-11.43%)
copycatA PHP Scraping Class
Stars: ✭ 70 (+100%)
proxy-scraper⭐️ A proxy scraper made using Protractor | Proxy list Updates every three hour 🔥
Stars: ✭ 201 (+474.29%)
Captcha-CrackingCrack number and Chinese captcha with both traditional and deep learning methods, based on Torch and python.
Stars: ✭ 35 (+0%)
scrapetubeGet all videos from a youtube channel, get all videos from a playlist, get all videos that match a search
Stars: ✭ 120 (+242.86%)
instagram-get-imagesInstagram get images 🌄 (hashtags, account, locations) with puppeteer
Stars: ✭ 69 (+97.14%)
yellowpages-scraperYellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
Stars: ✭ 56 (+60%)
extensionweb scraping extension
Stars: ✭ 28 (-20%)
camptoCaptcha package for nodejs.
Stars: ✭ 21 (-40%)
gutenbergScraper for downloading the entire ebooks repository of project Gutenberg
Stars: ✭ 100 (+185.71%)
scraper图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Stars: ✭ 64 (+82.86%)
sidrarA R interface to IBGE's SIDRA API
Stars: ✭ 49 (+40%)
scrapmanRetrieve real (with Javascript executed) HTML code from an URL, ultra fast and supports multiple parallel loading of webs
Stars: ✭ 21 (-40%)
cs-wordpress-bouncerCrowdSec is an open-source cyber security tool. This plugin blocks detected attackers or display them a captcha to check they are not bots.
Stars: ✭ 25 (-28.57%)
savedditBulk Downloader for Reddit
Stars: ✭ 130 (+271.43%)
Raid-Protect-Discord-BotA Discord Bot that allows you to protect your Discord server with captcha, anti profanity, anti nudity image, anti spam, account age required, logs...
Stars: ✭ 182 (+420%)
Chinese laws本项目旨在收集中国人民共和国的各类法律条文;项目重启中,期望PR
Stars: ✭ 245 (+600%)
GChanScrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.
Stars: ✭ 31 (-11.43%)
Choosealicense.comA site to provide non-judgmental guidance on choosing a license for your open source project
Stars: ✭ 2,648 (+7465.71%)
EasyShiro基于 RBAC 模型功能全面的 Shiro 安全集成&简化&扩展组件。Shiro integration & simplifies & Extension component based RBAC
Stars: ✭ 47 (+34.29%)
Crawler illegal cases in chinaCollection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。 [AD]中文知识图谱门户
Stars: ✭ 2,448 (+6894.29%)
extract-emailsExtract emails from a given website
Stars: ✭ 58 (+65.71%)
Parselawdocuments对收集的法律文档进行一系列分析,包括根据规范自动切分、案件相似度计算、案件聚类、法律条文推荐等(试验目前基于婚姻类案件,可扩展至其它领域)。
Stars: ✭ 138 (+294.29%)
scraperNode.js based scraper using headless chrome
Stars: ✭ 45 (+28.57%)
Mon EntrepriseL'assistant officiel de l'entrepreneur
Stars: ✭ 123 (+251.43%)
RARBG-scraperWith Selenium headless browsing and CAPTCHA solving
Stars: ✭ 38 (+8.57%)
Pkulaw spider爬取北大法宝网http://www.pkulaw.cn/Case/
Stars: ✭ 113 (+222.86%)
instagram-hashtag-scraperNodeJS application for scraping recent top posts from Instagram by hashtag without API access.
Stars: ✭ 17 (-51.43%)
Mycail中国法研杯-司法人工智能挑战赛
Stars: ✭ 60 (+71.43%)
scrapeerEssential PHP library that scrapes HTTP(S) and UDP trackers for torrent information.
Stars: ✭ 81 (+131.43%)
Site PolicyCollaborative development on GitHub's site policies, procedures, and guidelines
Stars: ✭ 797 (+2177.14%)
city-codesBrazilian city names and official codes, IBGE, LexML and others
Stars: ✭ 39 (+11.43%)
Blackstone⚫️ A spaCy pipeline and model for NLP on unstructured legal text.
Stars: ✭ 465 (+1228.57%)
roseAnalyse all kinds of data for a TV series
Stars: ✭ 34 (-2.86%)
lux👾 Fast and simple video download library and CLI tool written in Go
Stars: ✭ 19,266 (+54945.71%)
FormidableThe PHP pragmatic forms library
Stars: ✭ 116 (+231.43%)
municipios-brDados em formato aberto sobre municípios e unidades federativas do Brasil.
Stars: ✭ 58 (+65.71%)
mCaptchaA no-nonsense CAPTCHA system with seamless UX | Backend component
Stars: ✭ 473 (+1251.43%)
covid19-br-infoCoronavirus frontend info about Brazil's states and cities
Stars: ✭ 12 (-65.71%)
freeDictionaryAPIThere was no free Dictionary API on the web when I wanted one for my friend, so I created one.
Stars: ✭ 1,352 (+3762.86%)
robotstxtrobots.txt file parsing and checking for R
Stars: ✭ 65 (+85.71%)