trafilaturaPython & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+1267.31%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (-7.69%)
Giveme5WExtraction of the five journalistic W-questions (5W) from news articles
Stars: ✭ 16 (-69.23%)
AutoscraperA Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+7740.38%)
restaurant-finder-featureReviewsBuild a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-59.62%)
R Web Scraping Cheat SheetGuide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.
Stars: ✭ 207 (+298.08%)
InstagoDownload/access photos, videos, stories, story highlights, postlives, following and followers of Instagram
Stars: ✭ 59 (+13.46%)
Utlyz-CLILet's you to access your FB account from the command line and returns various things number of unread notifications, messages or friend requests you have.
Stars: ✭ 30 (-42.31%)
newspaperjsNews extraction and scraping. Article Parsing
Stars: ✭ 59 (+13.46%)
Learning Social Media Analytics With RThis repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (+96.15%)
newsembleAPI for fetching data from news websites.
Stars: ✭ 42 (-19.23%)
iowebWeb Scraping Framework
Stars: ✭ 31 (-40.38%)
cl-torrentsSearching torrents on popular trackers - CLI, readline, GUI, web client. Tutorial and binaries (issue tracker on https://gitlab.com/vindarel/cl-torrents/)
Stars: ✭ 83 (+59.62%)
grailerweb scraping tool for grailed.com
Stars: ✭ 30 (-42.31%)
rymscraperPython API to extract data from rateyourmusic.com.
Stars: ✭ 63 (+21.15%)
DiurnaBasic/Classic Hacker News app, used as a Cocoa & Swift learning platform
Stars: ✭ 100 (+92.31%)
PacPawPawn package manager for SA-MP
Stars: ✭ 14 (-73.08%)
CourseDownloaderGUI app for downloading whole online courses with folder structure from one url
Stars: ✭ 20 (-61.54%)
GamerClubWeb🎮 A gaming news frontend, base on vuetify
Stars: ✭ 17 (-67.31%)
eve👻 everyday explore, Github / HackNews / V2EX / Medium / Product Hunt.
Stars: ✭ 13 (-75%)
google-news-scraperGoogle News Scraper for languages like Japanese, Chinese... [VPN Support]
Stars: ✭ 88 (+69.23%)
TopWerewolf狼人杀头条App安卓项目开源,贴吧社区。爬虫抓取了包括今日头条、优酷、sohu、百度等网站中包含狼人杀及相关的新闻
Stars: ✭ 30 (-42.31%)
youtube-audioextract videos from youtube in audio format using webscraping techniques 🎶
Stars: ✭ 68 (+30.77%)
kobe-every-shot-everA Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (+26.92%)
Email-Crawler-Lead-GeneratorThis email crawler will visit all pages of a provided website and parse and save emails found to a csv file.
Stars: ✭ 47 (-9.62%)
react-native-news-appGet breaking news headlines with short description filtered by your interests and country preferences
Stars: ✭ 75 (+44.23%)
covid19.swift🌐 Small iOS app to show some COVID-19 health, data, news and tweets
Stars: ✭ 25 (-51.92%)
django-calaccess-raw-dataA Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+17.31%)
odinsonOdinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.
Stars: ✭ 59 (+13.46%)
SearchBlue Brain text mining toolbox for semantic search and structured information extraction
Stars: ✭ 26 (-50%)
Data-Wrangling-with-PythonSimplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (+73.08%)
dynamic-marqueeA small library for creating marquees.
Stars: ✭ 64 (+23.08%)
File-MakerGenerate data files for Wii Channels that have the latest news, forecast data, etc.
Stars: ✭ 65 (+25%)
selectorlibA library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+1.92%)
civic-scraperTools for downloading agendas, minutes and other documents produced by local government
Stars: ✭ 21 (-59.62%)
JoSH[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Stars: ✭ 55 (+5.77%)
BoxFeedNews App 📱 built to demonstrate the use of SwiftUI 3 features, Async/Await, CoreData and MVVM architecture pattern.
Stars: ✭ 112 (+115.38%)
requestsRR interface to Python requests module
Stars: ✭ 12 (-76.92%)
Linkedin-ClientWeb scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (-19.23%)
JARVISJarvis is a simple Chatbot with a GUI capable of chatting and retrieving information and daily news from the internet for it's user using python.
Stars: ✭ 49 (-5.77%)
non-api-fb-scraperScrape public FaceBook posts from any group or user into a .csv file without needing to register for any API access
Stars: ✭ 40 (-23.08%)
LaravelNewsAppAndroid App for the Laravel news website (Unofficial)
Stars: ✭ 18 (-65.38%)
HungryHippo🦛 scrapes websites and generates rss feeds
Stars: ✭ 33 (-36.54%)
malay-datasetText corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+263.46%)
MalScraperScrape everything you can from MyAnimeList.net
Stars: ✭ 132 (+153.85%)
CatalystA VS code Extension to accelerate the process of solving problems on Codeforces.
Stars: ✭ 69 (+32.69%)
browser-automation-apiBrowser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (-53.85%)
PressCenters.comNews aggregator for the press releases of the Bulgarian government sites written in ASP.NET Core
Stars: ✭ 91 (+75%)
serializerA linearizing social tech news reader
Stars: ✭ 89 (+71.15%)
news🕸 【MDH • 前端情报】
Stars: ✭ 277 (+432.69%)
ebayMarketAnalyzerScrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel
Stars: ✭ 116 (+123.08%)
gosquitogosquito ("go" + "mosquito") is a pluggable tool for data gathering, data processing and data transmitting to various destinations.
Stars: ✭ 25 (-51.92%)