htmltabCommand-line utility to convert HTML tables into CSV files
Stars: ✭ 13 (-75.93%)
autumnA Java parser combinator library written with an unmatched feature set.
Stars: ✭ 112 (+107.41%)
TwitterScraperScrape a User's Twitter data! Bypass the 3,200 tweet API limit for a User!
Stars: ✭ 80 (+48.15%)
WombatLightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.
Stars: ✭ 1,220 (+2159.26%)
rita-dslA Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format
Stars: ✭ 60 (+11.11%)
Kikoeru Expresskikoeru 后端,不再维护,请到https://github.com/umonaca/kikoeru-express 获取更新
Stars: ✭ 79 (+46.3%)
Instascrape🚀 A fast and lightweight utility and Python library for downloading posts, stories, and highlights from Instagram.
Stars: ✭ 76 (+40.74%)
tdop.github.ioReprinting Vaughan Pratt's Paper on Top Down Operator Precedence Parsing
Stars: ✭ 99 (+83.33%)
PymarketcapPython3 API wrapper and web scraper for https://coinmarketcap.com
Stars: ✭ 73 (+35.19%)
TeamReferenceTeam reference for Competitive Programming. Algorithms implementations very used in the ACM-ICPC contests. Latex template to build your own team reference.
Stars: ✭ 29 (-46.3%)
Jd AutobuyPython爬虫,京东自动登录,在线抢购商品
Stars: ✭ 1,174 (+2074.07%)
fanslySimply scrape / download all the media from an fansly account
Stars: ✭ 351 (+550%)
GoscrapeWeb scraper that can create an offline readable version of a website
Stars: ✭ 69 (+27.78%)
nest-crawlerAn easiest crawling and scraping module for NestJS
Stars: ✭ 45 (-16.67%)
socials👨👩👦 Social account detection and extraction in Python, e.g. for crawling/scraping.
Stars: ✭ 37 (-31.48%)
Bad Robo🐙 Get Daily 400-500 Real Followers 👽 [BadRobo] is Best Instagram Bot Available Now with All Features!. Our BOT did not violate any of Instagram's rules, so you don't have to worry about getting ACTION BLOCK!
Stars: ✭ 59 (+9.26%)
peFastest general-purpose parsing library for Python with a familiar API
Stars: ✭ 21 (-61.11%)
TangerineTangerine Bank scraper
Stars: ✭ 54 (+0%)
PyScholarA 'supervised' parser for Google Scholar
Stars: ✭ 74 (+37.04%)
Pitchfork NpmAn Unofficial Pitchfork Music API client for Node.js
Stars: ✭ 50 (-7.41%)
docker-selenium-lambdaThe simplest demo of chrome automation by python and selenium in AWS Lambda
Stars: ✭ 172 (+218.52%)
Social ScraperTổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt
Stars: ✭ 47 (-12.96%)
PoliteBe nice on the web
Stars: ✭ 253 (+368.52%)
barclayscrapeA small app to programmatically mainpulate Barclays online banking
Stars: ✭ 57 (+5.56%)
SerpGoogle Search SERP Scraper
Stars: ✭ 40 (-25.93%)
etf4u📊 Python tool to scrape real-time information about ETFs from the web and mixing them together by proportionally distributing their assets allocation
Stars: ✭ 29 (-46.3%)
civic-scraperTools for downloading agendas, minutes and other documents produced by local government
Stars: ✭ 21 (-61.11%)
GChanScrape boards and threads from 4chan (8kun WIP). Downloads images, videos and HTML if desired.
Stars: ✭ 31 (-42.59%)
Heroku ebooksA script to generate Markov chains and to post to an _ebooks account on Twitter using Heroku
Stars: ✭ 251 (+364.81%)
AnitopAnitop is an unofficial simple API from https://anitrendz.net/ site
Stars: ✭ 30 (-44.44%)
JagTag📝 JagTag is a simple - yet powerful and customizable - interpretted text parsing language!
Stars: ✭ 40 (-25.93%)
HuginnCreate agents that monitor and act on your behalf. Your agents are standing by!
Stars: ✭ 33,694 (+62296.3%)
PypergrabberFetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.
Stars: ✭ 14 (-74.07%)
turtleInstagram Photo Downloader
Stars: ✭ 15 (-72.22%)
Scanlessonline port scan scraper
Stars: ✭ 875 (+1520.37%)
ScrappingMastering the art of scrapping 🎓
Stars: ✭ 24 (-55.56%)
Voyages Sncf ApiA scrapy spider that scraps times and prices from Voyages Sncf. It uses scrapyrt to provide an API interface.
Stars: ✭ 7 (-87.04%)
ColegaDondeEstaMiTFMUn bot de Twitter que comparte cada hora un TFM hasta que Cristina Cifuentes enseñe el suyo.
Stars: ✭ 14 (-74.07%)
ScrapitScraping scripts for various websites.
Stars: ✭ 25 (-53.7%)
node-red-contrib-nbrowserProvides a virtual web browser (a.k.a. "headless browser") appearing as a node.
Stars: ✭ 31 (-42.59%)
Imagenetscraper👁 Bulk-download all thumbnails from an ImageNet synset, with optional rescaling
Stars: ✭ 24 (-55.56%)
double-agentA test suite of common scraper detection techniques. See how detectable your scraper stack is.
Stars: ✭ 123 (+127.78%)
browser-automation-apiBrowser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (-55.56%)
StatementParserIdea behind the StatementParser is, that it would be nice to be able to process financial data from different kind of statements in automatized way. This is often pretty hard as brokers are giving these data only in form of xls/xlst/pdf or other format which is not directly processable and here comes StatmentParser.
Stars: ✭ 21 (-61.11%)
jsonHunter在线爬虫,online web scraper
Stars: ✭ 86 (+59.26%)
GetsyA simple browser/client-side web scraper.
Stars: ✭ 238 (+340.74%)
Skrape.itA Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.
Stars: ✭ 231 (+327.78%)
scrapy-fieldstatsA Scrapy extension to log items coverage when the spider shuts down
Stars: ✭ 17 (-68.52%)