Crawlertutorial爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Stars: ✭ 282 (+145.22%)
Gopa AbandonedGOPA, a spider written in Go.(NOTE: this project moved to https://github.com/infinitbyte/gopa )
Stars: ✭ 98 (-14.78%)
DotnetspiderDotnetSpider, a .NET standard web crawling library. It is lightweight, efficient and fast high-level web crawling & scraping framework
Stars: ✭ 3,233 (+2711.3%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-87.83%)
GoscraperGolang pkg to quickly return a preview of a webpage (title/description/images)
Stars: ✭ 72 (-37.39%)
RcrawlerAn R web crawler and scraper
Stars: ✭ 274 (+138.26%)
LibrecmsFree Open Source Content Management System, based on PHP, Bootstrap and jQuery.
Stars: ✭ 12 (-89.57%)
Seo Helper🔍 SEO Helper is a package that provides tools and helpers for SEO (Search Engine Optimization).
Stars: ✭ 262 (+127.83%)
Sina Stock CrawlerSina stock options crawler with CSV output 新浪上证ETF期权数据爬虫
Stars: ✭ 12 (-89.57%)
Tumblr crawlerThis is a Multi-thread crawler for Tumblr.
Stars: ✭ 258 (+124.35%)
DisecDistributed Image Search Engine Crawler
Stars: ✭ 11 (-90.43%)
Awesome Ecommerce Stack💰 Popular marketing tools and add-ons used by 10,000+ of the top e-commerce stores.
Stars: ✭ 255 (+121.74%)
lightnovel epub🍭 epub generator for (light)novels (轻) 小说 epub 生成器,支持站点:轻之国度、轻小说文库
Stars: ✭ 89 (-22.61%)
SEO-DashboardSEO dashboard from Search console Data using the Google Search API, Mysql database , NodeJS RESTAPI( ExpressJS) and reactJs Dashboard
Stars: ✭ 39 (-66.09%)
ArachnidPowerful web scraping framework for Crystal
Stars: ✭ 68 (-40.87%)
PuppetronPuppeteer (Headless Chrome Node API)-based rendering solution.
Stars: ✭ 429 (+273.04%)
galerA fast tool to fetch URLs from HTML attributes by crawl-in.
Stars: ✭ 138 (+20%)
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+36720%)
PY-Login模拟登录各类网站,操作 API 完成各种不可描述的事情
Stars: ✭ 26 (-77.39%)
Sqlivmassive SQL injection vulnerability scanner
Stars: ✭ 840 (+630.43%)
ecommercetoolsEcommerceTools is a Python data science toolkit for ecommerce, marketing science, and technical SEO analysis and modelling and was created by Matt Clarke.
Stars: ✭ 41 (-64.35%)
Seo AnalysisA Python script to gain some insights from a domain and list of keywords.
Stars: ✭ 25 (-78.26%)
Seo Panel World's first seo control panel for multiple websites
Stars: ✭ 96 (-16.52%)
tg crawlerJust a crawler based on tg-cli for Telegram. Deprecated by now, please use telegram-export.
Stars: ✭ 71 (-38.26%)
Appcrawler基于appium的app自动遍历工具
Stars: ✭ 925 (+704.35%)
SearchScraperAPIAiohttp web server API, which scrapes Google and returns scrape results as response. Supports proxies, multiple geos and number of results.
Stars: ✭ 31 (-73.04%)
Lxspider爬虫案例合集。包括但不限于《淘宝、京东、天猫、豆瓣、抖音、快手、微博、微信、阿里、头条、pdd、优酷、爱奇艺、携程、12306、58、搜狐、百度指数、维普万方、Zlibraty、Oalib、小说、招标网、采购网、小红书》
Stars: ✭ 60 (-47.83%)
indieweb-searchSource code for the IndieWeb search engine.
Stars: ✭ 16 (-86.09%)
Mzitu👧 美女写真套图爬虫(二)
Stars: ✭ 920 (+700%)
GraphqueryGraphQuery is a query language and execution engine tied to any backend service.
Stars: ✭ 112 (-2.61%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-48.7%)
dijnet-botAz összes számlád még egy helyen :)
Stars: ✭ 17 (-85.22%)
Website-AuditIt's an open-source report template that guides web professionals thought steps to audit any website in terms of the page speed and technical SEO optimisation.
Stars: ✭ 18 (-84.35%)
Schema OrgA fluent builder Schema.org types and ld+json generator
Stars: ✭ 894 (+677.39%)
LightcrawlerCrawl a website and run it through Google lighthouse
Stars: ✭ 1,339 (+1064.35%)
AvbookAV 电影管理系统, avmoo , javbus , javlibrary 爬虫,线上 AV 影片图书馆,AV 磁力链接数据库,Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database
Stars: ✭ 8,133 (+6972.17%)
Comicbook本项目不再维护,详情可加群了解 https://t.me/onecomicbook
Stars: ✭ 429 (+273.04%)
Iclr2020 OpenreviewdataScript that crawls meta data from ICLR OpenReview webpage. Tutorials on installing and using Selenium and ChromeDriver on Ubuntu.
Stars: ✭ 426 (+270.43%)
CrawlabDistributed web crawler admin platform for spiders management regardless of languages and frameworks. 分布式爬虫管理平台,支持任何语言和框架
Stars: ✭ 8,392 (+7197.39%)
LimaxNode.js module to generate URL slugs. Another one? This one cares about i18n and transliterates non-Latin scripts to conform to the RFC3986 standard. Mostly API-compatible with similar modules.
Stars: ✭ 423 (+267.83%)
DotcommonWhat do people have in their dotfiles?
Stars: ✭ 418 (+263.48%)
Vulnxvulnx 🕷️ is an intelligent bot auto shell injector that detect vulnerabilities in multiple types of cms { `wordpress , joomla , drupal , prestashop .. `}
Stars: ✭ 1,009 (+777.39%)
Prerender.jsFast webpages for all browsers.
Stars: ✭ 411 (+257.39%)