All Projects → Newspaper → Similar Projects or Alternatives

1004 Open source projects that are alternatives of or similar to Newspaper

Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!

Stars: ✭ 1,322 (-88.55%)

Mutual labels: crawler, scraper

MyCrawler

我的爬虫合集

Stars: ✭ 55 (-99.52%)

Mutual labels: crawler, scraper

Google Play Scraper

Node.js scraper to get data from Google Play

Stars: ✭ 1,606 (-86.09%)

Mutual labels: crawler, scraper

Rcrawler

An R web crawler and scraper

Stars: ✭ 274 (-97.63%)

Mutual labels: crawler, scraper

Onegram

This repository is no longer maintained.

Stars: ✭ 137 (-98.81%)

Mutual labels: crawler, scraper

Skycaiji

蓝天采集器是一款免费的数据采集发布爬虫软件，采用php+mysql开发，可部署在云服务器，几乎能采集所有类型的网页，无缝对接各类CMS建站程序，免登录实时发布数据，全自动无需人工干预！是网页大数据采集软件中完全跨平台的云端爬虫系统

Stars: ✭ 1,514 (-86.89%)

Mutual labels: crawler, crawling

Freshonions Torscraper

Fresh Onions is an open source TOR spider / hidden service onion crawler hosted at zlal32teyptf4tvi.onion

Stars: ✭ 348 (-96.99%)

Mutual labels: crawler, scraper

Dataflowkit

Extract structured data from web sites. Web sites scraping.

Stars: ✭ 456 (-96.05%)

Mutual labels: scraper, crawling

Scrapit

Scraping scripts for various websites.

Stars: ✭ 25 (-99.78%)

Mutual labels: crawler, scraper

Pypergrabber

Fetches PubMed article IDs (PMIDs) from email inbox, then crawls PubMed, Google Scholar and Sci-Hub for respective PDF files.

Stars: ✭ 14 (-99.88%)

Mutual labels: crawler, scraper

Crawler

A high performance web crawler in Elixir.

Stars: ✭ 781 (-93.24%)

Mutual labels: crawler, scraper

civic-scraper

Tools for downloading agendas, minutes and other documents produced by local government

Stars: ✭ 21 (-99.82%)

Mutual labels: scraper, news

Social Scraper

Tổng hợp script crawl dữ liệu từ các mạng xã hội & website tiếng Việt

Stars: ✭ 47 (-99.59%)

Mutual labels: crawler, scraper

Crawler

Go process used to crawl websites

Stars: ✭ 147 (-98.73%)

Mutual labels: crawler, crawling

Youtube Projects

This repository contains all the code I use in my YouTube tutorials.

Stars: ✭ 144 (-98.75%)

Mutual labels: crawler, scraper

Avbook

AV 电影管理系统， avmoo , javbus , javlibrary 爬虫，线上 AV 影片图书馆，AV 磁力链接数据库，Japanese Adult Video Library,Adult Video Magnet Links - Japanese Adult Video Database

Stars: ✭ 8,133 (-29.55%)

Mutual labels: crawler, scraper

Arachnid

Powerful web scraping framework for Crystal

Stars: ✭ 68 (-99.41%)

Mutual labels: crawler, crawling

Google Play Scraper

Google play scraper for Python inspired by <facundoolano/google-play-scraper>

Stars: ✭ 143 (-98.76%)

Mutual labels: crawler, scraper

Antch

Antch, a fast, powerful and extensible web crawling & scraping framework for Go

Stars: ✭ 198 (-98.28%)

Mutual labels: crawler, crawling

Jvppeteer

Headless Chrome For Java （Java 爬虫）

Stars: ✭ 193 (-98.33%)

Mutual labels: crawler, scraper

Instagram Bot

An Instagram bot developed using the Selenium Framework

Stars: ✭ 138 (-98.8%)

Mutual labels: crawler, crawling

Skrape.it

A Kotlin-based testing/scraping/parsing library providing the ability to analyze and extract data from HTML (server & client-side rendered). It places particular emphasis on ease of use and a high level of readability by providing an intuitive DSL. It aims to be a testing lib, but can also be used to scrape websites in a convenient fashion.

Stars: ✭ 231 (-98%)

Mutual labels: crawler, scraper

Annie

👾 Fast and simple video download library and CLI tool written in Go

Stars: ✭ 16,369 (+41.78%)

Mutual labels: crawler, scraper

Ruiji.net

crawler framework, distributed crawler extractor

Stars: ✭ 220 (-98.09%)

Mutual labels: crawler, scraper

Goscraper

Golang pkg to quickly return a preview of a webpage (title/description/images)

Stars: ✭ 72 (-99.38%)

Mutual labels: crawler, scraper

MalScraper

Scrape everything you can from MyAnimeList.net

Stars: ✭ 132 (-98.86%)

Mutual labels: scraper, news

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (-99.55%)

Mutual labels: scraper, crawling

Scrapedin

LinkedIn Scraper (currently working 2020)

Stars: ✭ 453 (-96.08%)

Mutual labels: crawler, scraper

News Please

news-please - an integrated web crawler and information extractor for news that just works.

Stars: ✭ 969 (-91.61%)

Mutual labels: news, crawler

Jd Autobuy

Python爬虫，京东自动登录，在线抢购商品

Stars: ✭ 1,174 (-89.83%)

Mutual labels: crawler, scraper

Wombat

Lightweight Ruby web crawler/scraper with an elegant DSL which extracts structured data from pages.

Stars: ✭ 1,220 (-89.43%)

Mutual labels: crawler, scraper

Wxapp toutiaonews

📰微信小程序--头条新闻

Stars: ✭ 119 (-98.97%)

Mutual labels: news

Fawkes

Fawkes is a tool to search for targets vulnerable to SQL Injection. Performs the search using Google search engine.

Stars: ✭ 108 (-99.06%)

Mutual labels: crawler

Free proxy website

获取免费socks/https/http代理的网站集合

Stars: ✭ 119 (-98.97%)

Mutual labels: crawler

Webmagic

A scalable web crawler framework for Java.

Stars: ✭ 10,186 (-11.77%)

Mutual labels: crawler

Reactriot2017 Dotamania

🌐 Web scraping made easy with the visual 🗺 mind map editor to JSON

Stars: ✭ 107 (-99.07%)

Mutual labels: scraper

Sentinel Crawler

Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure 💫 多语言执行器，分布式爬虫

Stars: ✭ 118 (-98.98%)

Mutual labels: crawler

Crawler Detect

🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent

Stars: ✭ 1,549 (-86.58%)

Mutual labels: crawler

Haxe.io

The home of the Haxe Roundup's (Work in Progress)

Stars: ✭ 106 (-99.08%)

Mutual labels: news

Mm131

MM131网站图片爬取 🚨

Stars: ✭ 129 (-98.88%)

Mutual labels: crawler

Fontobfuscator

字体混淆服务

Stars: ✭ 125 (-98.92%)

Mutual labels: crawler

Docs

《数据采集从入门到放弃》源码。内容简介：爬虫介绍、就业情况、爬虫工程师面试题；HTTP协议介绍； Requests使用；解析器Xpath介绍； MongoDB与MySQL；多线程爬虫； Scrapy介绍；Scrapy-redis介绍；使用docker部署；使用nomad管理docker集群；使用EFK查询docker日志

Stars: ✭ 118 (-98.98%)

Mutual labels: crawler

Crawler

爬虫, http代理, 模拟登陆!

Stars: ✭ 106 (-99.08%)

Mutual labels: crawler

Moodle Downloader 2

A Moodle downloader that downloads course content fast from Moodle (eg. lecture pdfs)

Stars: ✭ 118 (-98.98%)

Mutual labels: crawler

D4n155

OWASP D4N155 - Intelligent and dynamic wordlist using OSINT