newsembleAPI for fetching data from news websites.
Stars: ✭ 42 (-28.81%)
NewspaperNews, full-text, and article metadata extraction in Python 3. Advanced docs:
Stars: ✭ 11,545 (+19467.8%)
Utlyz-CLILet's you to access your FB account from the command line and returns various things number of unread notifications, messages or friend requests you have.
Stars: ✭ 30 (-49.15%)
iowebWeb Scraping Framework
Stars: ✭ 31 (-47.46%)
TrollHunterTwitter Troll & Fake News Hunter - Crawls news websites and twitter to identify fake news
Stars: ✭ 38 (-35.59%)
extractnetA Dragnet that also extract author, headline, date, keywords from context
Stars: ✭ 52 (-11.86%)
HuginnCreate agents that monitor and act on your behalf. Your agents are standing by!
Stars: ✭ 33,694 (+57008.47%)
PoliteBe nice on the web
Stars: ✭ 253 (+328.81%)
XidelCommand line tool to download and extract data from HTML/XML pages or JSON-APIs, using CSS, XPath 3.0, XQuery 3.0, JSONiq or pattern matching. It can also create new or transformed XML/HTML/JSON documents.
Stars: ✭ 335 (+467.8%)
Youtube ProjectsThis repository contains all the code I use in my YouTube tutorials.
Stars: ✭ 144 (+144.07%)
TradeTheEventImplementation of "Trade the Event: Corporate Events Detection for News-Based Event-Driven Trading." In Findings of ACL2021
Stars: ✭ 64 (+8.47%)
PressCenters.comNews aggregator for the press releases of the Bulgarian government sites written in ASP.NET Core
Stars: ✭ 91 (+54.24%)
bing-ip2hostsbingip2hosts is a Bing.com web scraper that discovers websites by IP address
Stars: ✭ 99 (+67.8%)
ARGUSARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of different websites. On the websites, ARGUS is able to perform tasks like scraping texts or collecting hyperlinks between websites. See: https://link.springer.com/article/10.1007/s11192-020-03726-9
Stars: ✭ 68 (+15.25%)
metacritic apiPHP Metacritic API - Mirrored by my GitLab
Stars: ✭ 31 (-47.46%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+6810.17%)
MalScraperScrape everything you can from MyAnimeList.net
Stars: ✭ 132 (+123.73%)
gotorThis program provides efficient web scraping services for Tor and non-Tor sites. The program has both a CLI and REST API.
Stars: ✭ 97 (+64.41%)
Instagram-Scraper-2021Scrape Instagram content and stories anonymously, using a new technique based on the har file (No Token + No public API).
Stars: ✭ 57 (-3.39%)
RcrawlerAn R web crawler and scraper
Stars: ✭ 274 (+364.41%)
google-news-scraperGoogle News Scraper for languages like Japanese, Chinese... [VPN Support]
Stars: ✭ 88 (+49.15%)
robotstxtrobots.txt file parsing and checking for R
Stars: ✭ 65 (+10.17%)
trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+1105.08%)
civic-scraperTools for downloading agendas, minutes and other documents produced by local government
Stars: ✭ 21 (-64.41%)
Mimo-CrawlerA web crawler that uses Firefox and js injection to interact with webpages and crawl their content, written in nodejs.
Stars: ✭ 22 (-62.71%)
site-audit-seoWeb service and CLI tool for SEO site audit: crawl site, lighthouse all pages, view public reports in browser. Also output to console, json, csv, xlsx, Google Drive.
Stars: ✭ 91 (+54.24%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-76.27%)
google-this🔎 A simple yet powerful module to retrieve organic search results and much more from Google.
Stars: ✭ 88 (+49.15%)
PDAP-ScrapersCode relating to scraping public police data.
Stars: ✭ 72 (+22.03%)
chesfCHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages
Stars: ✭ 18 (-69.49%)
scraperA simple web scraper built around the JavaFX WebEngine
Stars: ✭ 13 (-77.97%)
feedIOA Feed Aggregator that Knows What You Want to Read.
Stars: ✭ 26 (-55.93%)
wget-luaWget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.
Stars: ✭ 52 (-11.86%)
web-scraping-engineA simple web scraping engine supporting concurrent and anonymous scraping
Stars: ✭ 27 (-54.24%)
tieba-zhuaqu百度贴吧分布式爬虫,用于贴吧数据挖掘。从贴吧维度和用户维度进行数据分析
Stars: ✭ 56 (-5.08%)
yt-videos-listCreate and **automatically** update a list of all videos on a YouTube channel (in txt/csv/md form) via YouTube bot with end-to-end web scraping - no API tokens required. Multi-threaded support for YouTube videos list updates.
Stars: ✭ 64 (+8.47%)
ScrapeMA monadic web scraping library
Stars: ✭ 17 (-71.19%)
archiveisA simple Python wrapper for the archive.is capturing service
Stars: ✭ 152 (+157.63%)
VK-ScraperScrapes VK user's photos
Stars: ✭ 42 (-28.81%)
scraped-tvtime-apiA free TVTime API based on scraping TVTime website. No API key required
Stars: ✭ 23 (-61.02%)
scraperA web scraper starter project
Stars: ✭ 18 (-69.49%)
daily-paperFor viewing a daily issue of the Guardian and Observer newspapers. `main` branch should be stable, current work is in `dev` branch.
Stars: ✭ 23 (-61.02%)
NewsAppAn app that fetches latest news, headlines
Stars: ✭ 28 (-52.54%)
pinancePython module(s) to get stock data, options data and news.
Stars: ✭ 70 (+18.64%)
overflow-news📚 Don't waste time searching for good dev blog posts. Get the latest news here.
Stars: ✭ 32 (-45.76%)
Android-Web-ScraperAndroid Web Scraper is a simple library for android web automation. You can perform web task in background to fetch website data programmatically.
Stars: ✭ 38 (-35.59%)
bullshit-detector🔍 Chráňte vašich blízkych pred nedôveryhodným 🇸🇰 a 🇨🇿 obsahom
Stars: ✭ 24 (-59.32%)
youtube-unofficialAccess parts of your account unavailable through normal YouTube API access.
Stars: ✭ 33 (-44.07%)
diostsA Go scraper that validates security.txt files and outputs them in the disclose.io JSON format.
Stars: ✭ 18 (-69.49%)
gnewsclientAn easy-to-use python client for Google News feeds.
Stars: ✭ 42 (-28.81%)