All Projects → newspaperjs → Similar Projects or Alternatives

716 Open source projects that are alternatives of or similar to newspaperjs

newsemble
API for fetching data from news websites.
Stars: ✭ 42 (-28.81%)
Mutual labels:  scraper, news, webscraping
Newspaper
News, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+19467.8%)
Mutual labels:  scraper, news, news-aggregator
Utlyz-CLI
Let's you to access your FB account from the command line and returns various things number of unread notifications, messages or friend requests you have.
Stars: ✭ 30 (-49.15%)
Mutual labels:  news, webscraping
ioweb
Web Scraping Framework
Stars: ✭ 31 (-47.46%)
Mutual labels:  webscraping, webcrawling
BookingScraper
🌎 🏨 Scrape Booking.com 🏨 🌎
Stars: ✭ 68 (+15.25%)
Mutual labels:  scraper, webscraping
TrollHunter
Twitter Troll & Fake News Hunter - Crawls news websites and twitter to identify fake news
Stars: ✭ 38 (-35.59%)
Mutual labels:  scraper, news
extractnet
A Dragnet that also extract author, headline, date, keywords from context
Stars: ✭ 52 (-11.86%)
Mutual labels:  news, webscraping
Huginn
Create agents that monitor and act on your behalf. Your agents are standing by!
Stars: ✭ 33,694 (+57008.47%)
Mutual labels:  scraper, webscraping
Polite
Be nice on the web
Stars: ✭ 253 (+328.81%)
Mutual labels:  scraper, webscraping
Xidel
Command line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+467.8%)
Mutual labels:  scraper, webscraping
Youtube Projects
This repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+144.07%)
Mutual labels:  scraper, webscraping
TradeTheEvent
Implementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Stars: ✭ 64 (+8.47%)
Mutual labels:  scraper, news
PressCenters.com
News aggregator for the press releases of the Bulgarian government sites written in ASP.NET Core
Stars: ✭ 91 (+54.24%)
Mutual labels:  news, news-aggregator
bing-ip2hosts
bingip2hosts is a Bing.com web scraper that discovers websites by IP address
Stars: ✭ 99 (+67.8%)
Mutual labels:  scraper, webscraping
ARGUS
ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (+15.25%)
Mutual labels:  webscraping, webcrawling
metacritic api
PHP Metacritic API - Mirrored by my GitLab
Stars: ✭ 31 (-47.46%)
Mutual labels:  scraper, webscraping
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+6810.17%)
Mutual labels:  scraper, webscraping
MalScraper
Scrape everything you can from MyAnimeList.net
Stars: ✭ 132 (+123.73%)
Mutual labels:  scraper, news
gotor
This program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.
Stars: ✭ 97 (+64.41%)
Mutual labels:  webscraping, webcrawling
Instagram-Scraper-2021
Scrape Instagram content and stories anonymously, using a new technique based on the har file (No Token + No public API).
Stars: ✭ 57 (-3.39%)
Mutual labels:  scraper, webscraping
Rcrawler
An R web crawler and scraper
Stars: ✭ 274 (+364.41%)
Mutual labels:  scraper, webscraping
Django Dynamic Scraper
Creating Scrapy scrapers via the Django admin interface
Stars: ✭ 1,024 (+1635.59%)
Mutual labels:  scraper, webscraping
Mailinglistscraper
A python web scraper for public email lists.
Stars: ✭ 19 (-67.8%)
Mutual labels:  scraper, webscraping
google-news-scraper
Google News Scraper for languages like Japanese, Chinese... [VPN Support]
Stars: ✭ 88 (+49.15%)
Mutual labels:  news, news-aggregator
robotstxt
robots.txt file parsing and checking for R
Stars: ✭ 65 (+10.17%)
Mutual labels:  scraper, webscraping
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+1105.08%)
Mutual labels:  news, news-aggregator
civic-scraper
Tools for downloading agendas, minutes and other documents produced by local government
Stars: ✭ 21 (-64.41%)
Mutual labels:  scraper, news
Mimo-Crawler
A web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (-62.71%)
Mutual labels:  scraper, webscraping
site-audit-seo
Web service and CLI tool for SEO site audit: crawl site, lighthouse all pages, view public reports in browser. Also output to console, json, csv, xlsx, Google Drive.
Stars: ✭ 91 (+54.24%)
Mutual labels:  scraper
document-dl
Command line program to download documents from web portals.
Stars: ✭ 14 (-76.27%)
Mutual labels:  scraper
google-this
🔎 A simple yet powerful module to retrieve organic search results and much more from Google.
Stars: ✭ 88 (+49.15%)
Mutual labels:  scraper
PDAP-Scrapers
Code relating to scraping public police data.
Stars: ✭ 72 (+22.03%)
Mutual labels:  scraper
chesf
CHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages
Stars: ✭ 18 (-69.49%)
Mutual labels:  webscraping
scraper
A simple web scraper built around the JavaFX WebEngine
Stars: ✭ 13 (-77.97%)
Mutual labels:  scraper
Inshorts-News-API
Unofficial API of Inshorts written in Flask
Stars: ✭ 87 (+47.46%)
Mutual labels:  news
feedIO
A Feed Aggregator that Knows What You Want to Read.
Stars: ✭ 26 (-55.93%)
Mutual labels:  news
MangaReaderScraper
Search and download mangas from the command line
Stars: ✭ 23 (-61.02%)
Mutual labels:  scraper
wget-lua
Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-11.86%)
Mutual labels:  scraper
web-scraping-engine
A simple web scraping engine supporting concurrent and anonymous scraping
Stars: ✭ 27 (-54.24%)
Mutual labels:  scraper
tieba-zhuaqu
百度贴吧分布式爬虫,用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析
Stars: ✭ 56 (-5.08%)
Mutual labels:  scraper
yt-videos-list
Create and **automatically** update a list of all videos on a YouTube channel (in txt/csv/md form) via YouTube bot with end-to-end web scraping - no API tokens required. Multi-threaded support for YouTube videos list updates.
Stars: ✭ 64 (+8.47%)
Mutual labels:  scraper
census-error-analyzer
Analyze the margin of error in U.S. census data
Stars: ✭ 15 (-74.58%)
Mutual labels:  news
ScrapeM
A monadic web scraping library
Stars: ✭ 17 (-71.19%)
Mutual labels:  scraper
archiveis
A simple Python wrapper for the archive.is capturing service
Stars: ✭ 152 (+157.63%)
Mutual labels:  news
VK-Scraper
Scrapes VK user's photos
Stars: ✭ 42 (-28.81%)
Mutual labels:  scraper
scraped-tvtime-api
A free TVTime API based on scraping TVTime website. No API key required
Stars: ✭ 23 (-61.02%)
Mutual labels:  scraper
blog.brasil.io
Blog do Brasil.IO
Stars: ✭ 24 (-59.32%)
Mutual labels:  webscraping
scraper
A web scraper starter project
Stars: ✭ 18 (-69.49%)
Mutual labels:  scraper
daily-paper
For viewing a daily issue of the Guardian and Observer newspapers. `main` branch should be stable, current work is in `dev` branch.
Stars: ✭ 23 (-61.02%)
Mutual labels:  news
amazon-transcribe-news-media-analysis
Transcribe news audio in realtime
Stars: ✭ 21 (-64.41%)
Mutual labels:  news
NewsApp
An app that fetches latest news, headlines
Stars: ✭ 28 (-52.54%)
Mutual labels:  news
Online-News-Portal-with-Django
Daily News For You is an online news portal developed by Django and SQLite
Stars: ✭ 45 (-23.73%)
Mutual labels:  news
pinance
Python module(s) to get stock data, options data and news.
Stars: ✭ 70 (+18.64%)
Mutual labels:  news
overflow-news
📚 Don't waste time searching for good dev blog posts. Get the latest news here.
Stars: ✭ 32 (-45.76%)
Mutual labels:  news
Android-Web-Scraper
Android Web Scraper is a simple library for android web automation. You can perform web task in background to fetch website data programmatically.
Stars: ✭ 38 (-35.59%)
Mutual labels:  webscraping
bullshit-detector
🔍 Chráňte vašich blízkych pred nedôveryhodným 🇸🇰 a 🇨🇿 obsahom
Stars: ✭ 24 (-59.32%)
Mutual labels:  news
youtube-unofficial
Access parts of your account unavailable through normal YouTube API access.
Stars: ✭ 33 (-44.07%)
Mutual labels:  scraper
diosts
A Go scraper that validates security.txt files and outputs them in the disclose.io JSON format.
Stars: ✭ 18 (-69.49%)
Mutual labels:  scraper
web-scraping-101
An Introduction to Web Scraping
Stars: ✭ 13 (-77.97%)
Mutual labels:  webscraping
gnewsclient
An easy-to-use python client for Google News feeds.
Stars: ✭ 42 (-28.81%)
Mutual labels:  news
1-60 of 716 similar projects