chirpsTwitter bot powering @arichduvet
Stars: ✭ 35 (+105.88%)
factoryDocker microservice & Crawler by scrapy
Stars: ✭ 56 (+229.41%)
go-scrapyWeb crawling and scraping framework for Golang
Stars: ✭ 17 (+0%)
Scraper-Projects🕸 List of mini projects that involve web scraping 🕸
Stars: ✭ 25 (+47.06%)
elves🎊 Design and implement of lightweight crawler framework.
Stars: ✭ 322 (+1794.12%)
web-clipperEasily download the main content of a web page in html, markdown, and/or epub format from command line.
Stars: ✭ 15 (-11.76%)
scrapy-cookiesA middleware of cookies persistence for Scrapy
Stars: ✭ 19 (+11.76%)
humanparserParse a human name string into salutation, first name, middle name, last name, suffix.
Stars: ✭ 78 (+358.82%)
AngleParseHTML parsing and processing tool for PowerShell.
Stars: ✭ 35 (+105.88%)
top-github-scraperScape top GitHub repositories and users based on keywords
Stars: ✭ 40 (+135.29%)
scavengerScrape and take screenshots of dynamic and static webpages
Stars: ✭ 14 (-17.65%)
dustArchive web pages with all relevant assets or save as a single file HTML
Stars: ✭ 19 (+11.76%)
anime-scraper[partially working] Scrape and add anime episode stream URLs to uGet (Linux) or IDM (Windows) ~ Python3
Stars: ✭ 21 (+23.53%)
copycatA PHP Scraping Class
Stars: ✭ 70 (+311.76%)
invana-botA Web Crawler that scrapes using YAML and python code.
Stars: ✭ 30 (+76.47%)
ScrapyProjectScrapy项目(mysql+mongodb豆瓣top250电影)
Stars: ✭ 18 (+5.88%)
proxycrawl-pythonProxyCrawl Python library for scraping and crawling
Stars: ✭ 51 (+200%)
devsearchA web search engine built with Python which uses TF-IDF and PageRank to sort search results.
Stars: ✭ 52 (+205.88%)
AutohomeUsing Scrapy to crawl Autohome, storage into MonogDB, simple analysis and NLP coming soon
Stars: ✭ 23 (+35.29%)
GPlayCrawlerNo description or website provided.
Stars: ✭ 47 (+176.47%)
browser-automation-apiBrowser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (+41.18%)
ferendaTransform unstructured document collections to structured Linked Data
Stars: ✭ 22 (+29.41%)
ksoupKotlin Wrapper for Jsoup
Stars: ✭ 59 (+247.06%)
puppeteer-botcheck🕵♂ Bot detection tests for Puppeteer. Hide and seek!
Stars: ✭ 42 (+147.06%)
yttrexyoutube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling
Stars: ✭ 31 (+82.35%)
internet-affordability🌍 Dataset that shows the Internet affordability by country (a shocking reality!)
Stars: ✭ 13 (-23.53%)
scrapy spiderNo description or website provided.
Stars: ✭ 58 (+241.18%)
dmi-instascraperA GUI for Instaloader to scrape users and hashtags with on Instagram
Stars: ✭ 21 (+23.53%)
scraping-ebayScraping Ebay's products using Scrapy Web Crawling Framework
Stars: ✭ 79 (+364.71%)
selectorlibA library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+211.76%)
aioScrapy基于asyncio与aiohttp的异步协程爬虫框架 欢迎Star
Stars: ✭ 34 (+100%)
logparserA tool for parsing Scrapy log files periodically and incrementally, extending the HTTP JSON API of Scrapyd.
Stars: ✭ 70 (+311.76%)
crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (+29.41%)
reason-rust-scraper🦀 Scraping & crawling websites using Rust, and ReasonML
Stars: ✭ 21 (+23.53%)
shupA POSIX shell script to parse HTML
Stars: ✭ 28 (+64.71%)
bgmtoolsBangumi小工具
Stars: ✭ 66 (+288.24%)
JD Spider👍 京东爬虫(大量注释,对刚入门爬虫者极度友好)
Stars: ✭ 56 (+229.41%)
docker-selenium-lambdaThe simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (+911.76%)