DownzemallDownZemAll! is a download manager for Windows, MacOS and Linux
Stars: ✭ 157 (-93.36%)
Lolcate RsLolcate -- A comically fast way of indexing and querying your filesystem. Replaces locate / mlocate / updatedb. Written in Rust.
Stars: ✭ 191 (-91.92%)
Instagram Scraperscrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Stars: ✭ 2,209 (-6.52%)
Crawler illegal cases in chinaCollection of China illegal cases about web crawler 本项目用来整理所有中国大陆爬虫开发者涉诉与违规相关的新闻、资料与法律法规。致力于帮助在中国大陆工作的爬虫行业从业者了解我国相关法律,避免触碰数据合规红线。 [AD]中文知识图谱门户
Stars: ✭ 2,448 (+3.6%)
XqueryExtract data or evaluate value from HTML/XML documents using XPath
Stars: ✭ 155 (-93.44%)
NosmokeA cross platform UI crawler which scans view trees then generate and execute UI test cases.
Stars: ✭ 178 (-92.47%)
Python3 SpiderPython爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (-9.9%)
Anime DlAnime-dl is a command-line program to download anime from CrunchyRoll and Funimation.
Stars: ✭ 190 (-91.96%)
JlitespiderA lite distributed Java spider framework :-)
Stars: ✭ 151 (-93.61%)
N2h4네이버 뉴스 수집을 위한 도구
Stars: ✭ 177 (-92.51%)
Dxy Covid 19 Crawler2019新型冠状病毒疫情实时爬虫及API | COVID-19/2019-nCoV Realtime Infection Crawler and API
Stars: ✭ 1,865 (-21.07%)
Fantasy Basketball Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (-93.82%)
Shadow UseragentPick the most common user-agents on the Internet 👻
Stars: ✭ 147 (-93.78%)
PhpscraperPHP Scraper - an highly opinionated web-interface for PHP
Stars: ✭ 148 (-93.74%)
GeccoEasy to use lightweight web crawler(易用的轻量化网络爬虫)
Stars: ✭ 2,310 (-2.24%)
Httpcode.core简单、易用、高效 一个有态度的开源.Net Http请求框架!可以用制作爬虫,api请求等等。
Stars: ✭ 146 (-93.82%)
JavpyEnjoy driving on a Javascriptive (originally Pythonic) way to Japanese AV!
Stars: ✭ 147 (-93.78%)
GmdbGMDB is the ultra-simple, cross-platform Movie Library with Features (Search, Take Note, Watch Later, Like, Import, Learn, Instantly Torrent Magnet Watch)
Stars: ✭ 189 (-92%)
SqrapeSimple Query Scraping with CSS and Go Reflection (MOVED to Gitlab)
Stars: ✭ 144 (-93.91%)
Youtube ProjectsThis repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (-93.91%)
Image To Image SearchA reverse image search engine powered by elastic search and tensorflow
Stars: ✭ 200 (-91.54%)
VectoraiVector AI — A platform for building vector based applications. Encode, query and analyse data using vectors.
Stars: ✭ 195 (-91.75%)
Learn AnythingOrganize world's knowledge, explore connections and curate learning paths
Stars: ✭ 13,532 (+472.66%)
Google Play ScraperGoogle play scraper for Python inspired by <facundoolano/google-play-scraper>
Stars: ✭ 143 (-93.95%)
Caiss跨平台/多语言的 相似向量/相似词/相似句 高性能检索引擎。功能强大,使用方便。欢迎star & fork。Build together! Power another !
Stars: ✭ 142 (-93.99%)
RusticsearchLightweight Elasticsearch compatible search server.
Stars: ✭ 171 (-92.76%)
Robots TxtDetermine if a page may be crawled from robots.txt, robots meta tags and robot headers
Stars: ✭ 142 (-93.99%)
EmbedGet info from any web service or page
Stars: ✭ 1,808 (-23.49%)
MeilisearchPowerful, fast, and an easy to use search engine
Stars: ✭ 20,236 (+756.37%)
Proxy poolPython爬虫代理IP池(proxy pool)
Stars: ✭ 13,964 (+490.94%)
OddishTo crawl all csgo skins from website.
Stars: ✭ 139 (-94.12%)
Awesome Deep Learning Papers For Search Recommendation AdvertisingAwesome Deep Learning papers for industrial Search, Recommendation and Advertising. They focus on Embedding, Matching, Ranking (CTR prediction, CVR prediction), Post Ranking, Transfer, Reinforcement Learning, Self-supervised Learning and so on.
Stars: ✭ 136 (-94.24%)
GainWeb crawling framework based on asyncio.
Stars: ✭ 2,002 (-15.28%)
Ambar🔍 Ambar: Document Search Engine
Stars: ✭ 1,829 (-22.6%)
Amazonbigspider😱Full Automatic Amazon Distributed Spider | 亚马逊分布式四国际站采集选款产品|账号admin,密码adminadmin
Stars: ✭ 140 (-94.08%)
Fooproxy稳健高效的评分制-针对性- IP代理池 + API服务,可以自己插入采集器进行代理IP的爬取,针对你的爬虫的一个或多个目标网站分别生成有效的IP代理数据库,支持MongoDB 4.0 使用 Python3.7(Scored IP proxy pool ,customise proxy data crawler can be added anytime)
Stars: ✭ 195 (-91.75%)
Marmot💐Marmot | Web Crawler/HTTP protocol Download Package 🐭
Stars: ✭ 186 (-92.13%)
Fun crawlerCrawl some picture for fun
Stars: ✭ 169 (-92.85%)
Instagram BotAn Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (-94.16%)
Educative.io Downloader📖 This tool is to download course from educative.io for offline usage. It uses your login credentials and download the course.
Stars: ✭ 139 (-94.12%)
ComiccrawlerAn image crawler written in Python.
Stars: ✭ 185 (-92.17%)
Douyin crawler 抖音爬虫,tiktok crawler,抖音数据采集接口,抖音视频去水印,百分百成功,不需要服务器,不需要代理 IP。
Stars: ✭ 169 (-92.85%)
Go spider[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
Stars: ✭ 1,745 (-26.15%)
PoseidonA search engine which can hold 100 trillion lines of log data.
Stars: ✭ 1,793 (-24.12%)
BitextorBitextor generates translation memories from multilingual websites.
Stars: ✭ 168 (-92.89%)
Zhihu Spider一个获取知乎用户主页信息的多线程Python爬虫程序。
Stars: ✭ 137 (-94.2%)