All Projects → N2h4 → Similar Projects or Alternatives

942 Open source projects that are alternatives of or similar to N2h4

Newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+6422.6%)
Mutual labels:  news, crawler, crawling
Colly
Elegant Scraper and Crawler Framework for Golang
Stars: ✭ 15,535 (+8676.84%)
Mutual labels:  crawler, crawling
Spidy
The simple, easy to use command line web crawler.
Stars: ✭ 257 (+45.2%)
Mutual labels:  crawler, crawling
Just News
a userscript project that parses korean news site and then making more readable view
Stars: ✭ 173 (-2.26%)
Mutual labels:  korean, news
flink-crawler
Continuous scalable web crawler built on top of Flink and crawler-commons
Stars: ✭ 48 (-72.88%)
Mutual labels:  crawler, crawling
Woid
Simple news aggregator displaying top stories in real time
Stars: ✭ 204 (+15.25%)
Mutual labels:  news, crawler
Dotnetcrawler
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Stars: ✭ 100 (-43.5%)
Mutual labels:  crawler, crawling
Hotnewsanalysis
利用文本挖掘技术进行新闻热点关注问题分析
Stars: ✭ 93 (-47.46%)
Mutual labels:  news, crawler
Scrapyrt
HTTP API for Scrapy spiders
Stars: ✭ 637 (+259.89%)
Mutual labels:  crawler, crawling
Webster
a reliable high-level web crawling & scraping framework for Node.js.
Stars: ✭ 364 (+105.65%)
Mutual labels:  crawler, crawling
News Please
news-please - an integrated web crawler and information extractor for news that just works.
Stars: ✭ 969 (+447.46%)
Mutual labels:  news, crawler
bots-zoo
No description or website provided.
Stars: ✭ 59 (-66.67%)
Mutual labels:  crawler, crawling
Headless Chrome Crawler
Distributed crawler powered by Headless Chrome
Stars: ✭ 5,129 (+2797.74%)
Mutual labels:  crawler, crawling
Crawler
Go process used to crawl websites
Stars: ✭ 147 (-16.95%)
Mutual labels:  crawler, crawling
img-cli
An interactive Command-Line Interface Build in NodeJS for downloading a single or multiple images to disk from URL
Stars: ✭ 15 (-91.53%)
Mutual labels:  crawler, crawling
Ferret
Declarative web scraping
Stars: ✭ 4,837 (+2632.77%)
Mutual labels:  crawler, crawling
Lulu
[Unmaintained] A simple and clean video/music/image downloader 👾
Stars: ✭ 789 (+345.76%)
Mutual labels:  crawler, crawling
Antch
Antch, a fast, powerful and extensible web crawling & scraping framework for Go
Stars: ✭ 198 (+11.86%)
Mutual labels:  crawler, crawling
Taiwan News Crawlers
Scrapy-based Crawlers for news of Taiwan
Stars: ✭ 83 (-53.11%)
Mutual labels:  news, crawler
Gopa
[WIP] GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Stars: ✭ 277 (+56.5%)
Mutual labels:  crawler, crawling
Sasila
一个灵活、友好的爬虫框架
Stars: ✭ 286 (+61.58%)
Mutual labels:  crawler, crawling
Easy Scraping Tutorial
Simple but useful Python web scraping tutorial code.
Stars: ✭ 583 (+229.38%)
Mutual labels:  crawler, crawling
Crawly
Crawly, a high-level web crawling & scraping framework for Elixir.
Stars: ✭ 440 (+148.59%)
Mutual labels:  crawler, crawling
Scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Stars: ✭ 42,343 (+23822.6%)
Mutual labels:  crawler, crawling
Skycaiji
蓝天采集器是一款免费的数据采集发布爬虫软件,采用php+mysql开发,可部署在云服务器,几乎能采集所有类型的网页,无缝对接各类CMS建站程序,免登录实时发布数据,全自动无需人工干预!是网页大数据采集软件中完全跨平台的云端爬虫系统
Stars: ✭ 1,514 (+755.37%)
Mutual labels:  crawler, crawling
Squidwarc
Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head
Stars: ✭ 125 (-29.38%)
Mutual labels:  crawler, crawling
Ttbot
今日头条机器人,支持用户登陆、关注、取消关注、获取关注粉丝、发文、发悟空问答、点赞、评论、采集各种类型新闻讯息等,使用今日头条网页版API实现
Stars: ✭ 338 (+90.96%)
Mutual labels:  news, crawler
Golang News
Golang 기술 소식 뉴스레터
Stars: ✭ 233 (+31.64%)
Mutual labels:  korean, news
Arachnid
Powerful web scraping framework for Crystal
Stars: ✭ 68 (-61.58%)
Mutual labels:  crawler, crawling
Instagram Bot
An Instagram bot developed using the Selenium Framework
Stars: ✭ 138 (-22.03%)
Mutual labels:  crawler, crawling
Linkedin Profile Scraper
🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Stars: ✭ 171 (-3.39%)
Mutual labels:  crawler, crawling
Downzemall
DownZemAll! is a download manager for Windows, MacOS and Linux
Stars: ✭ 157 (-11.3%)
Mutual labels:  crawler
Bitextor
Bitextor generates translation memories from multilingual websites.
Stars: ✭ 168 (-5.08%)
Mutual labels:  crawler
Abot
Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Stars: ✭ 1,961 (+1007.91%)
Mutual labels:  crawler
Instagram Scraper
scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot
Stars: ✭ 2,209 (+1148.02%)
Mutual labels:  crawler
Scrapedin Linkedin Crawler
Crawler for LinkedIn full profiles 2019
Stars: ✭ 170 (-3.95%)
Mutual labels:  crawler
Nytdiff
Code for the twitter bot nyt_diff
Stars: ✭ 166 (-6.21%)
Mutual labels:  news
Newswatch React Native
📺 A news app using YouTube playlists, built with React Native
Stars: ✭ 155 (-12.43%)
Mutual labels:  news
Crawler
An easy to use, powerful crawler implemented in PHP. Can execute Javascript.
Stars: ✭ 2,055 (+1061.02%)
Mutual labels:  crawler
Scrapingoutsourcing
ScrapingOutsourcing专注分享爬虫代码 尽量每周更新一个
Stars: ✭ 164 (-7.34%)
Mutual labels:  crawler
Django Newsfeed
A news curator and newsletter subscription package for Django
Stars: ✭ 155 (-12.43%)
Mutual labels:  news
Weibo wordcloud
根据关键词抓取微博数据,再生成词云
Stars: ✭ 154 (-12.99%)
Mutual labels:  crawler
Sensitivefilescan
Stars: ✭ 174 (-1.69%)
Mutual labels:  crawler
Js Flock
Collection of neat modular utilities for bumping up development in NODE and Browser
Stars: ✭ 172 (-2.82%)
Mutual labels:  sort
Php Formatter
PHP Formatter is a PHP developer friendly set of tools
Stars: ✭ 163 (-7.91%)
Mutual labels:  sort
Ordinare
Ordinare sorts gems in your Gemfile alphabetically
Stars: ✭ 153 (-13.56%)
Mutual labels:  sort
Sonatanewsbundle
Symfony SonataNewsBundle
Stars: ✭ 153 (-13.56%)
Mutual labels:  news
Algorithm
The repository algorithms implemented on the Go
Stars: ✭ 163 (-7.91%)
Mutual labels:  sort
Python3 Spider
Python爬虫实战 - 模拟登陆各大网站 包含但不限于:滑块验证、拼多多、美团、百度、bilibili、大众点评、淘宝,如果喜欢请start ❤️
Stars: ✭ 2,129 (+1102.82%)
Mutual labels:  crawler
Ngmeta
Dynamic meta tags in your AngularJS single page application
Stars: ✭ 152 (-14.12%)
Mutual labels:  crawler
Crawler For Github Trending
🕷️ A node crawler for github trending.
Stars: ✭ 172 (-2.82%)
Mutual labels:  crawler
Chatspace
핑퐁에서 만든 채팅체랑 잘 맞는 띄어쓰기 모델!
Stars: ✭ 163 (-7.91%)
Mutual labels:  korean
Android Video Listing Mvp
Android video listing with swipe view tabs based on mvp design pattern with complete functionalities like search and sort
Stars: ✭ 151 (-14.69%)
Mutual labels:  sort
Hangulize
Hangulize transcribes non-Korean words into Hangul
Stars: ✭ 152 (-14.12%)
Mutual labels:  korean
Gocrawl
Polite, slim and concurrent web crawler.
Stars: ✭ 1,962 (+1008.47%)
Mutual labels:  crawler
Jlitespider
A lite distributed Java spider framework :-)
Stars: ✭ 151 (-14.69%)
Mutual labels:  crawler
Leetcode Spider
用 node.js 爬你自己的 leetcode 解题源码
Stars: ✭ 176 (-0.56%)
Mutual labels:  crawler
Laravel Api Handler
Package providing helper functions for a Laravel REST-API
Stars: ✭ 150 (-15.25%)
Mutual labels:  sort
Proxy pool
Python爬虫代理IP池(proxy pool)
Stars: ✭ 13,964 (+7789.27%)
Mutual labels:  crawler
Tossi
Chooses correct Korean particle morphs for arbitrary words.
Stars: ✭ 160 (-9.6%)
Mutual labels:  korean
1-60 of 942 similar projects