IdtImage Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process.
Stars: ✭ 202 (-91.45%)
awesome-search-engine-optimizationA curated list of backlink, social signal opportunities, and link building strategies and tactics to help improve search engine results and ranking.
Stars: ✭ 82 (-96.53%)
ArachnidCrawl all unique internal links found on a given website, and extract SEO related information - supports javascript based sites
Stars: ✭ 224 (-90.52%)
AntchAntch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (-91.62%)
JivesearchA search engine that doesn't track you.
Stars: ✭ 364 (-84.6%)
Lulu[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (-66.61%)
Search Engine ParserLightweight package to query popular search engines and scrape for result titles, links and descriptions
Stars: ✭ 216 (-90.86%)
OpensearchserverOpen-source Enterprise Grade Search Engine Software
Stars: ✭ 408 (-82.73%)
FessFess is very powerful and easily deployable Enterprise Search Server.
Stars: ✭ 561 (-76.26%)
ScrappleA framework for creating semi-automatic web content extractors
Stars: ✭ 464 (-80.36%)
WebmagicA scalable web crawler framework for Java.
Stars: ✭ 10,186 (+331.06%)
Linkedin Profile Scraper🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-92.76%)
Gopa[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (-88.28%)
indieweb-searchSource code for the IndieWeb search engine.
Stars: ✭ 16 (-99.32%)
Awesome Python Primer自学入门 Python 优质中文资源索引,包含 书籍 / 文档 / 视频,适用于 爬虫 / Web / 数据分析 / 机器学习 方向
Stars: ✭ 57 (-97.59%)
Goose ParserUniversal scrapping tool, which allows you to extract data using multiple environments
Stars: ✭ 211 (-91.07%)
SmartImageReverse image search tool (SauceNao, ImgOps, trace.moe, and more)
Stars: ✭ 346 (-85.36%)
Sasila一个灵活、友好的爬虫框架
Stars: ✭ 286 (-87.9%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+72.53%)
CrawlyCrawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (-81.38%)
GeziyorGeziyor, a fast web crawling & scraping framework for Go. Supports JS rendering.
Stars: ✭ 1,246 (-47.27%)
DotnetcrawlerDotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-95.77%)
D4n155OWASP D4N155 - Intelligent and dynamic wordlist using OSINT
Stars: ✭ 105 (-95.56%)
FilemastaA search application to explore, discover and share online files
Stars: ✭ 571 (-75.84%)
CollyElegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+557.43%)
bots-zooNo description or website provided.
Stars: ✭ 59 (-97.5%)
FerretDeclarative web scraping
Stars: ✭ 4,837 (+104.7%)
papercutPapercut is a scraping/crawling library for Node.js built on top of JSDOM. It provides basic selector features together with features like Page Caching and Geosearch.
Stars: ✭ 15 (-99.37%)
NewcrawlerFree Web Scraping Tool with Java
Stars: ✭ 589 (-75.07%)
ScrapyScrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+1691.92%)
SearchAn Open Source Search Engine
Stars: ✭ 139 (-94.12%)
Lolcate RsLolcate -- A comically fast way of indexing and querying your filesystem. Replaces locate / mlocate / updatedb. Written in Rust.
Stars: ✭ 191 (-91.92%)
Crawler illegal cases in chinaCollection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。 [AD]中文知识图谱门户
Stars: ✭ 2,448 (+3.6%)
Google Group CrawlerGet (almost) original messages from google group archives. Your data is yours.
Stars: ✭ 190 (-91.96%)
ItemsjsFull text, faceted, (almost) dependency free search engine in javascript
Stars: ✭ 179 (-92.42%)
NosmokeA cross platform UI crawler which scans view trees then generate and execute UI test cases.
Stars: ✭ 178 (-92.47%)
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (-91.96%)
Instagram CrawlerCrawl instagram photos, posts and videos for download.
Stars: ✭ 178 (-92.47%)
N2h4네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (-92.51%)
Jsonframe Cheeriosimple multi-level scraper json input/output for Cheerio
Stars: ✭ 196 (-91.71%)
GeccoEasy to use lightweight web crawler(易用的轻量化网络爬虫)
Stars: ✭ 2,310 (-2.24%)
Goribot[Crawler/Scraper for Golang]🕷A lightweight distributed friendly Golang crawler framework.一个轻量的分布式友好的 Golang 爬虫框架。
Stars: ✭ 190 (-91.96%)
Spoon🥄 A package for building specific Proxy Pool for different Sites.
Stars: ✭ 173 (-92.68%)
GmdbGMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)
Stars: ✭ 189 (-92%)