SerpscrapSEO python scraper to extract data from major searchengine result pages. Extract data like url, title, snippet, richsnippet and the type from searchresults for given keywords. Detect Ads or make automated screenshots. You can also fetch text content of urls provided in searchresults or by your own. It's usefull for SEO and business related research tasks.
Stars: ✭ 153 (-93.6%)
Docs《数据采集从入门到放弃》源码。内容简介:爬虫介绍、就业情况、爬虫工程师面试题 ;HTTP协议介绍; Requests使用 ;解析器Xpath介绍; MongoDB与MySQL; 多线程爬虫; Scrapy介绍 ;Scrapy-redis介绍; 使用docker部署; 使用nomad管理docker集群; 使用EFK查询docker日志
Stars: ✭ 118 (-95.07%)
Crack Js Spider破解JS反爬虫加密参数,已破解中国裁判文书网(2020-06-30更新),淘宝密码,天安保险登录,b站登录,房天下登录,WPS登录,微博登录,有道翻译,网易登录,微信公众号登录,空中网登录,今目标登录,学生信息管理系统登录,共赢金融登录,重庆科技资源共享平台登录,网易云音乐下载,一键解析视频链接,财联社登录。
Stars: ✭ 175 (-92.68%)
SeleniumcrawlerAn example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Stars: ✭ 117 (-95.11%)
PixevalA Strong, Fast and Flexible Pixiv Client based on .NET Core and WPF
Stars: ✭ 1,031 (-56.9%)
NgmetaDynamic meta tags in your AngularJS single page application
Stars: ✭ 152 (-93.65%)
Hivelots of spider (很多爬虫)
Stars: ✭ 110 (-95.4%)
JavpyEnjoy driving on a Javascriptive (originally Pythonic) way to Japanese AV!
Stars: ✭ 147 (-93.85%)
Instagram Profilecrawl💻 Quickly crawl the information (e.g. followers, tags, etc...) of an instagram profile. No login required!
Stars: ✭ 110 (-95.4%)
Copybook用爬虫爬取小说网站上所有小说,存储到数据库中,并用爬到的数据构建自己的小说网站
Stars: ✭ 117 (-95.11%)
Pylinkvalidatorpylinkvalidator is a standalone and pure python link validator and crawler that traverses a web site and reports errors (e.g., 500 and 404 errors) encountered.
Stars: ✭ 109 (-95.44%)
CrawlerGo process used to crawl websites
Stars: ✭ 147 (-93.85%)
LinkcrawlerCross-platform persistent and distributed web crawler 🔗
Stars: ✭ 109 (-95.44%)
LumberjackAn automated website accessibility scanner and cli
Stars: ✭ 109 (-95.44%)
Scraperwiki PythonScraperWiki Python library for scraping and saving data
Stars: ✭ 146 (-93.9%)
FawkesFawkes is a tool to search for targets vulnerable to SQL Injection. Performs the search using Google search engine.
Stars: ✭ 108 (-95.48%)
SerpGoogle Search SERP Scraper
Stars: ✭ 40 (-98.33%)
Ptt Alertor📢 Ptt 文章通知機器人!Notify Ptt Article in Realtime
Stars: ✭ 150 (-93.73%)
DingdianPython爬虫和Flask实现小说网站
Stars: ✭ 115 (-95.19%)
App comments spider爬取百度贴吧、TapTap、appstore、微博官方博主上的游戏评论(基于redis_scrapy),过滤器采用了bloomfilter。
Stars: ✭ 38 (-98.41%)
GmdbGMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)
Stars: ✭ 189 (-92.1%)
DirhuntFind web directories without bruteforce
Stars: ✭ 983 (-58.9%)
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+1670.19%)
GargantuaThe fast website crawler
Stars: ✭ 35 (-98.54%)
CocrawlerCoCrawler is a versatile web crawler built using modern tools and concurrency.
Stars: ✭ 148 (-93.81%)
DiskoverFile system crawler, disk space usage, file search engine and file system analytics powered by Elasticsearch
Stars: ✭ 977 (-59.16%)
Owllookowllook-小说搜索引擎
Stars: ✭ 2,163 (-9.57%)
CangibrinaA fast and powerfull dashboard (admin) finder
Stars: ✭ 200 (-91.64%)
Portia Dashboardportia-dashboard is a visual web crawler based on scrapinghub/portia
Stars: ✭ 199 (-91.68%)
Node Ytdl CoreYouTube video downloader in javascript.
Stars: ✭ 3,004 (+25.59%)
Videospider抓取豆瓣,bilibili等中的电视剧、电影、动漫演员等信息
Stars: ✭ 186 (-92.22%)
ReadablewebproxyRewriting web proxy and archival tool. At this point, it just tries to download all the things.
Stars: ✭ 172 (-92.81%)
Taobaoscrapy😩Tool For Taobao/Tmall| 儿时玩具已经过时
Stars: ✭ 146 (-93.9%)
WebmagicA scalable web crawler framework for Java.
Stars: ✭ 10,186 (+325.84%)
Geetest滑动验证码,希望对你们有所帮助❤️
Stars: ✭ 114 (-95.23%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (-93.81%)
Real Estate ScraperWeb scraper that makes it easier to find real estate in Slovenia.
Stars: ✭ 31 (-98.7%)
ScralaUnmaintained 🐳 ☕️ 🕷 Scala crawler(spider) framework, inspired by scrapy, created by @gaocegege
Stars: ✭ 113 (-95.28%)
Lcrawl一只优雅的正方教务系统爬虫。
Stars: ✭ 112 (-95.32%)
AnitopAnitop is an unofficial simple API from https://anitrendz.net/ site
Stars: ✭ 30 (-98.75%)
Pachong一些爬虫的代码
Stars: ✭ 147 (-93.85%)
HuginnCreate agents that monitor and act on your behalf. Your agents are standing by!
Stars: ✭ 33,694 (+1308.61%)
HeadlesschromeA Go package for working with headless Chrome. Run interactive JavaScript commands on web pages with Go and Chrome.
Stars: ✭ 112 (-95.32%)
Node Website ScraperDownload website to local directory (including all css, images, js, etc.)
Stars: ✭ 912 (-61.87%)
Novel基于 Laravel 5.2 的小说网站
Stars: ✭ 172 (-92.81%)