All Projects → newspaperjs → Similar Projects or Alternatives

716 Open source projects that are alternatives of or similar to newspaperjs

newsemble

API for fetching data from news websites.

Stars: ✭ 42 (-28.81%)

Mutual labels: scraper, news, webscraping

Newspaper

News, full-text, and article metadata extraction in Python 3. Advanced docs:

Stars: ✭ 11,545 (+19467.8%)

Mutual labels: scraper, news, news-aggregator

Utlyz-CLI

Let's you to access your FB account from the command line and returns various things number of unread notifications, messages or friend requests you have.

Stars: ✭ 30 (-49.15%)

Mutual labels: news, webscraping

ioweb

Web Scraping Framework

Stars: ✭ 31 (-47.46%)

Mutual labels: webscraping, webcrawling

BookingScraper

🌎 🏨 Scrape Booking.com 🏨 🌎

Stars: ✭ 68 (+15.25%)

Mutual labels: scraper, webscraping

TrollHunter

Twitter Troll & Fake News Hunter - Crawls news websites and twitter to identify fake news

Stars: ✭ 38 (-35.59%)

Mutual labels: scraper, news

extractnet

A Dragnet that also extract author, headline, date, keywords from context

Stars: ✭ 52 (-11.86%)

Mutual labels: news, webscraping

Huginn

Create agents that monitor and act on your behalf. Your agents are standing by!

Stars: ✭ 33,694 (+57008.47%)

Mutual labels: scraper, webscraping

Polite

Be nice on the web

Stars: ✭ 253 (+328.81%)

Mutual labels: scraper, webscraping

Xidel

Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.

Stars: ✭ 335 (+467.8%)

Mutual labels: scraper, webscraping

Youtube Projects

This repository contains all the code I use in my YouTube tutorials.

Stars: ✭ 144 (+144.07%)

Mutual labels: scraper, webscraping

TradeTheEvent

Implementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021

Stars: ✭ 64 (+8.47%)

Mutual labels: scraper, news

PressCenters.com

News aggregator for the press releases of the Bulgarian government sites written in ASP.NET Core

Stars: ✭ 91 (+54.24%)

Mutual labels: news, news-aggregator

bing-ip2hosts

bingip2hosts is a Bing.com web scraper that discovers websites by IP address

Stars: ✭ 99 (+67.8%)

Mutual labels: scraper, webscraping

ARGUS

ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9

Stars: ✭ 68 (+15.25%)

Mutual labels: webscraping, webcrawling

metacritic api

PHP Metacritic API - Mirrored by my GitLab

Stars: ✭ 31 (-47.46%)

Mutual labels: scraper, webscraping

Autoscraper

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Stars: ✭ 4,077 (+6810.17%)

Mutual labels: scraper, webscraping

MalScraper

Scrape everything you can from MyAnimeList.net

Stars: ✭ 132 (+123.73%)

Mutual labels: scraper, news

gotor

This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.

Stars: ✭ 97 (+64.41%)

Mutual labels: webscraping, webcrawling

Instagram-Scraper-2021

Scrape Instagram content and stories anonymously, using a new technique based on the har file (No Token + No public API).

Stars: ✭ 57 (-3.39%)

Mutual labels: scraper, webscraping

Rcrawler

An R web crawler and scraper

Stars: ✭ 274 (+364.41%)

Mutual labels: scraper, webscraping

Django Dynamic Scraper

Creating Scrapy scrapers via the Django admin interface

Stars: ✭ 1,024 (+1635.59%)

Mutual labels: scraper, webscraping

Mailinglistscraper

A python web scraper for public email lists.

Stars: ✭ 19 (-67.8%)

Mutual labels: scraper, webscraping

google-news-scraper

Google News Scraper for languages like Japanese, Chinese... [VPN Support]

Stars: ✭ 88 (+49.15%)

Mutual labels: news, news-aggregator

robotstxt

robots.txt file parsing and checking for R

Stars: ✭ 65 (+10.17%)

Mutual labels: scraper, webscraping

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Stars: ✭ 711 (+1105.08%)

Mutual labels: news, news-aggregator

civic-scraper

Tools for downloading agendas, minutes and other documents produced by local government

Stars: ✭ 21 (-64.41%)

Mutual labels: scraper, news

Mimo-Crawler

A web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.

Stars: ✭ 22 (-62.71%)

Mutual labels: scraper, webscraping

site-audit-seo

Web service and CLI tool for SEO site audit: crawl site, lighthouse all pages, view public reports in browser. Also output to console, json, csv, xlsx, Google Drive.

Stars: ✭ 91 (+54.24%)

Mutual labels: scraper

document-dl

Command line program to download documents from web portals.

Stars: ✭ 14 (-76.27%)

Mutual labels: scraper

google-this

🔎 A simple yet powerful module to retrieve organic search results and much more from Google.

Stars: ✭ 88 (+49.15%)

Mutual labels: scraper

PDAP-Scrapers

Code relating to scraping public police data.

Stars: ✭ 72 (+22.03%)

Mutual labels: scraper

chesf

CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages

Stars: ✭ 18 (-69.49%)

Mutual labels: webscraping

scraper

A simple web scraper built around the JavaFX WebEngine

Stars: ✭ 13 (-77.97%)

Mutual labels: scraper

Inshorts-News-API

Unofficial API of Inshorts written in Flask

Stars: ✭ 87 (+47.46%)

Mutual labels: news

feedIO

A Feed Aggregator that Knows What You Want to Read.

Stars: ✭ 26 (-55.93%)

Mutual labels: news

MangaReaderScraper

Search and download mangas from the command line

Stars: ✭ 23 (-61.02%)

Mutual labels: scraper

wget-lua

Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.

Stars: ✭ 52 (-11.86%)

Mutual labels: scraper

web-scraping-engine

A simple web scraping engine supporting concurrent and anonymous scraping

Stars: ✭ 27 (-54.24%)

Mutual labels: scraper

tieba-zhuaqu

百度贴吧分布式爬虫，用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析

Stars: ✭ 56 (-5.08%)

Mutual labels: scraper

yt-videos-list

Create and **automatically** update a list of all videos on a YouTube channel (in txt/csv/md form) via YouTube bot with end-to-end web scraping - no API tokens required. Multi-threaded support for YouTube videos list updates.

Stars: ✭ 64 (+8.47%)

Mutual labels: scraper

census-error-analyzer

Analyze the margin of error in U.S. census data

Stars: ✭ 15 (-74.58%)

Mutual labels: news

ScrapeM

A monadic web scraping library

Stars: ✭ 17 (-71.19%)

Mutual labels: scraper

archiveis

A simple Python wrapper for the archive.is capturing service