All Projects → extractnet → Similar Projects or Alternatives

615 Open source projects that are alternatives of or similar to extractnet

trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments
Stars: ✭ 711 (+1267.31%)
text-mining-corona-articles
Text Mining for Indonesian Online News Articles About Corona
Stars: ✭ 15 (-71.15%)
Uc Davis Cs Exams Analysis
📈 Regression and Classification with UC Davis student quiz data and exam data
Stars: ✭ 33 (-36.54%)
Mutual labels:  text-mining, web-scraping
Text-Analysis
Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (-7.69%)
Mutual labels:  text-mining, web-scraping
Giveme5W
Extraction of the five journalistic W-questions (5W) from news articles
Stars: ✭ 16 (-69.23%)
Mutual labels:  news, news-articles
Autoscraper
A Smart, Automatic, Fast and Lightweight Web Scraper for Python
Stars: ✭ 4,077 (+7740.38%)
Mutual labels:  web-scraping, webscraping
restaurant-finder-featureReviews
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
Stars: ✭ 21 (-59.62%)
Mutual labels:  text-mining, web-scraping
R Web Scraping Cheat Sheet
Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.
Stars: ✭ 207 (+298.08%)
Mutual labels:  web-scraping, webscraping
Instago
Download/access photos, videos, stories, story highlights, postlives, following and followers of Instagram
Stars: ✭ 59 (+13.46%)
Mutual labels:  web-scraping, webscraping
Utlyz-CLI
Let's you to access your FB account from the command line and returns various things number of unread notifications, messages or friend requests you have.
Stars: ✭ 30 (-42.31%)
Mutual labels:  news, webscraping
newspaperjs
News extraction and scraping. Article Parsing
Stars: ✭ 59 (+13.46%)
Mutual labels:  news, webscraping
Learning Social Media Analytics With R
This repository contains code and bonus content which will be added from time to time for the book "Learning Social Media Analytics with R" by Packt
Stars: ✭ 102 (+96.15%)
Mutual labels:  text-mining, news
newsemble
API for fetching data from news websites.
Stars: ✭ 42 (-19.23%)
Mutual labels:  news, webscraping
BookingScraper
🌎 🏨 Scrape Booking.com 🏨 🌎
Stars: ✭ 68 (+30.77%)
Mutual labels:  web-scraping, webscraping
ioweb
Web Scraping Framework
Stars: ✭ 31 (-40.38%)
Mutual labels:  web-scraping, webscraping
cl-torrents
Searching torrents on popular trackers - CLI, readline, GUI, web client. Tutorial and binaries (issue tracker on https://gitlab.com/vindarel/cl-torrents/)
Stars: ✭ 83 (+59.62%)
Mutual labels:  web-scraping
grailer
web scraping tool for grailed.com
Stars: ✭ 30 (-42.31%)
Mutual labels:  web-scraping
rymscraper
Python API to extract data from rateyourmusic.com.
Stars: ✭ 63 (+21.15%)
Mutual labels:  web-scraping
Diurna
Basic/Classic Hacker News app, used as a Cocoa & Swift learning platform
Stars: ✭ 100 (+92.31%)
Mutual labels:  news
PacPaw
Pawn package manager for SA-MP
Stars: ✭ 14 (-73.08%)
Mutual labels:  webscraping
CourseDownloader
GUI app for downloading whole online courses with folder structure from one url
Stars: ✭ 20 (-61.54%)
Mutual labels:  webscraping
jser.github.io
JSer.infoのブログリポジトリ
Stars: ✭ 90 (+73.08%)
Mutual labels:  news
GamerClubWeb
🎮 A gaming news frontend, base on vuetify
Stars: ✭ 17 (-67.31%)
Mutual labels:  news
Twitter-Sentiment-Analyzer
Twitter Sentiment Analyzer
Stars: ✭ 13 (-75%)
Mutual labels:  text-mining
eve
👻 everyday explore, Github / HackNews / V2EX / Medium / Product Hunt.
Stars: ✭ 13 (-75%)
Mutual labels:  news
google-news-scraper
Google News Scraper for languages like Japanese, Chinese... [VPN Support]
Stars: ✭ 88 (+69.23%)
Mutual labels:  news
TopWerewolf
狼人杀头条App安卓项目开源,贴吧社区。爬虫抓取了包括今日头条、优酷、sohu、百度等网站中包含狼人杀及相关的新闻
Stars: ✭ 30 (-42.31%)
Mutual labels:  news
youtube-audio
extract videos from youtube in audio format using webscraping techniques 🎶
Stars: ✭ 68 (+30.77%)
Mutual labels:  webscraping
kobe-every-shot-ever
A Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (+26.92%)
Mutual labels:  news
Email-Crawler-Lead-Generator
This email crawler will visit all pages of a provided website and parse and save emails found to a csv file.
Stars: ✭ 47 (-9.62%)
Mutual labels:  webscraping
react-native-news-app
Get breaking news headlines with short description filtered by your interests and country preferences
Stars: ✭ 75 (+44.23%)
Mutual labels:  news
covid19.swift
🌐 Small iOS app to show some COVID-19 health, data, news and tweets
Stars: ✭ 25 (-51.92%)
Mutual labels:  news
django-calaccess-raw-data
A Django app to download, extract and load campaign finance and lobbying activity data from the California Secretary of State's CAL-ACCESS database
Stars: ✭ 61 (+17.31%)
Mutual labels:  news
odinson
Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.
Stars: ✭ 59 (+13.46%)
Mutual labels:  text-mining
Search
Blue Brain text mining toolbox for semantic search and structured information extraction
Stars: ✭ 26 (-50%)
Mutual labels:  text-mining
Data-Wrangling-with-Python
Simplify your ETL processes with these hands-on data sanitation tips, tricks, and best practices
Stars: ✭ 90 (+73.08%)
Mutual labels:  web-scraping
dynamic-marquee
A small library for creating marquees.
Stars: ✭ 64 (+23.08%)
Mutual labels:  news
File-Maker
Generate data files for Wii Channels that have the latest news, forecast data, etc.
Stars: ✭ 65 (+25%)
Mutual labels:  news
selectorlib
A library to read a YML file with Xpath or CSS Selectors and extract data from HTML pages using them
Stars: ✭ 53 (+1.92%)
Mutual labels:  web-scraping
civic-scraper
Tools for downloading agendas, minutes and other documents produced by local government
Stars: ✭ 21 (-59.62%)
Mutual labels:  news
JoSH
[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding
Stars: ✭ 55 (+5.77%)
Mutual labels:  text-mining
BoxFeed
News App 📱 built to demonstrate the use of SwiftUI 3 features, Async/Await, CoreData and MVVM architecture pattern.
Stars: ✭ 112 (+115.38%)
Mutual labels:  news
requestsR
R interface to Python requests module
Stars: ✭ 12 (-76.92%)
Mutual labels:  webscraping
Linkedin-Client
Web scraper for grabing data from Linkedin profiles or company pages (personal project)
Stars: ✭ 42 (-19.23%)
Mutual labels:  web-scraping
JARVIS
Jarvis is a simple Chatbot with a GUI capable of chatting and retrieving information and daily news from the internet for it's user using python.
Stars: ✭ 49 (-5.77%)
Mutual labels:  news
non-api-fb-scraper
Scrape public FaceBook posts from any group or user into a .csv file without needing to register for any API access
Stars: ✭ 40 (-23.08%)
Mutual labels:  webscraping
LaravelNewsApp
Android App for the Laravel news website (Unofficial)
Stars: ✭ 18 (-65.38%)
Mutual labels:  news
HungryHippo
🦛 scrapes websites and generates rss feeds
Stars: ✭ 33 (-36.54%)
Mutual labels:  news
malay-dataset
Text corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+263.46%)
Mutual labels:  text-mining
MalScraper
Scrape everything you can from MyAnimeList.net
Stars: ✭ 132 (+153.85%)
Mutual labels:  news
Catalyst
A VS code Extension to accelerate the process of solving problems on Codeforces.
Stars: ✭ 69 (+32.69%)
Mutual labels:  webscraping
browser-automation-api
Browser automation API for repetitive web-based tasks, with a friendly user interface. You can use it to scrape content or do many other things like capture a screenshot, generate pdf, extract content or execute custom Puppeteer, Playwright functions.
Stars: ✭ 24 (-53.85%)
Mutual labels:  webscraping
cnnindonesia-news-api
Unofficial CNN Indonesia news API
Stars: ✭ 42 (-19.23%)
Mutual labels:  news
Youtube-Scraping-Selenium
Automatically creates a Youtube channel dashboard
Stars: ✭ 21 (-59.62%)
Mutual labels:  webscraping
PressCenters.com
News aggregator for the press releases of the Bulgarian government sites written in ASP.NET Core
Stars: ✭ 91 (+75%)
Mutual labels:  news
serializer
A linearizing social tech news reader
Stars: ✭ 89 (+71.15%)
Mutual labels:  news
news
🕸 【MDH • 前端情报】
Stars: ✭ 277 (+432.69%)
Mutual labels:  news
text-preprocess-python
Text preprocessing tools in python.
Stars: ✭ 22 (-57.69%)
Mutual labels:  text-cleaning
ebayMarketAnalyzer
Scrape all eBay sold listings to determine average/median pricing, plot listings over time with trend lines, and extract to excel
Stars: ✭ 116 (+123.08%)
Mutual labels:  webscraping
gosquito
gosquito ("go" + "mosquito") is a pluggable tool for data gathering, data processing and data transmitting to various destinations.
Stars: ✭ 25 (-51.92%)
Mutual labels:  news
1-60 of 615 similar projects