BM25Transformer(Python) transform a document-term matrix to an Okapi/BM25 representation
Stars: ✭ 50 (-83.66%)
Rsshub🍰 Everything is RSSible
Stars: ✭ 18,111 (+5818.63%)
document-dlCommand line program to download documents from web portals.
Stars: ✭ 14 (-95.42%)
luceneApache Lucene open-source search software
Stars: ✭ 1,009 (+229.74%)
crawleyCrawley the Telegram Beholder
Stars: ✭ 24 (-92.16%)
yttrexyoutube & tiktok analysis + youchoose recommendation custmizer. backend, extensions, and tooling
Stars: ✭ 31 (-89.87%)
R4dsR for data science: a book
Stars: ✭ 3,231 (+955.88%)
RSSnotifierNode RSS reader telegram bot. Provides notification on queries-matching elements and supports multiple users.
Stars: ✭ 15 (-95.1%)
srctools for fast reading of docs
Stars: ✭ 40 (-86.93%)
CartolaExtração de dados da API do CartolaFC, análise exploratória dos dados e modelos preditivos em R e Python - 2014-20. [EN] Data munging, analysis and modeling of CartolaFC - the most popular fantasy football game in Brazil and maybe in the world. Data cover years 2014-19.
Stars: ✭ 304 (-0.65%)
ml4irMachine Learning for Information Retrieval
Stars: ✭ 75 (-75.49%)
humanparserParse a human name string into salutation, first name, middle name, last name, suffix.
Stars: ✭ 78 (-74.51%)
crawling-frameworkEasily crawl news portals or blog sites using Storm Crawler.
Stars: ✭ 22 (-92.81%)
BitmagicBitMagic Library
Stars: ✭ 263 (-14.05%)
GNN-Recommender-SystemsAn index of recommendation algorithms that are based on Graph Neural Networks.
Stars: ✭ 505 (+65.03%)
scrapy-zyte-smartproxyZyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapy
Stars: ✭ 317 (+3.59%)
eventsourcing-goEvent Sourcing + CQRS using Golang Tutorial
Stars: ✭ 75 (-75.49%)
awesome-semantic-searchA curated list of awesome resources related to Semantic Search🔎 and Semantic Similarity tasks.
Stars: ✭ 161 (-47.39%)
covid19br-pubProjeto de monitoramento de publicações oficiais relacionadas a COVID-19 no Brasil.
Stars: ✭ 12 (-96.08%)
Awesome Mlops😎 A curated list of awesome MLOps tools
Stars: ✭ 258 (-15.69%)
TubefeederA Youtube, Lbry and Peertube client made for the Pinephone
Stars: ✭ 88 (-71.24%)
pompScreen scraping and web crawling framework
Stars: ✭ 61 (-80.07%)
jsitemapgeneratorJava sitemap generator. This library generates a web sitemap, can ping Google, generate RSS feed, robots.txt and more with friendly, easy to use Java 8 functional style of programming
Stars: ✭ 38 (-87.58%)
Data visualizationA collection of my data visualizations, mostly in Python.
Stars: ✭ 294 (-3.92%)
node-red-contrib-nbrowserProvides a virtual web browser (a.k.a. "headless browser") appearing as a node.
Stars: ✭ 31 (-89.87%)
cherche📑 Neural Search
Stars: ✭ 196 (-35.95%)
GoroA High-level Machine Learning Library for Go
Stars: ✭ 265 (-13.4%)
oversmashOverwatch API library for player details and career stats
Stars: ✭ 42 (-86.27%)
webrealness.online
Stars: ✭ 15 (-95.1%)
tidyRSSAn R package for extracting 'tidy' data frames from RSS, Atom, JSON and geoRSS feeds
Stars: ✭ 62 (-79.74%)
Workbase ServerSlack alternative, email integrated, build with Meteor
Stars: ✭ 284 (-7.19%)
bubo-rssAn irrationally minimalist, static RSS feed reader you can instantly deploy on Netlify, Glitch or your own server.
Stars: ✭ 41 (-86.6%)
DatasetsA repository of pretty cool datasets that I collected for network science and machine learning research.
Stars: ✭ 302 (-1.31%)
NewspipeA web news aggregator.
Stars: ✭ 300 (-1.96%)
HnrssCustom, realtime RSS feeds for Hacker News
Stars: ✭ 277 (-9.48%)
banditoreBanditore retrieves new releases from your starred GitHub repositories and generate an Atom feed with them.
Stars: ✭ 118 (-61.44%)
instagrammerGet personal RSS feed access to your Instagrams
Stars: ✭ 15 (-95.1%)
Apify JsApify SDK — The scalable web scraping and crawling library for JavaScript/Node.js. Enables development of data extraction and web automation jobs (not only) with headless Chrome and Puppeteer.
Stars: ✭ 3,154 (+930.72%)
stream-feed-flutterStream Feed official Flutter SDK. Build your own feed experience using Dart and Flutter.
Stars: ✭ 67 (-78.1%)
dmi-instascraperA GUI for Instaloader to scrape users and hashtags with on Instagram
Stars: ✭ 21 (-93.14%)
Scikit Learn VideosJupyter notebooks from the scikit-learn video series
Stars: ✭ 3,254 (+963.4%)
asyncio-hnPython (asyncio) wrapper for hackernews api
Stars: ✭ 27 (-91.18%)
rss2emailConvert RSS feeds to emails
Stars: ✭ 72 (-76.47%)
shorter.recipesA website dedicated to making recipes from any website easy to read.
Stars: ✭ 27 (-91.18%)
EpiboardWeb Extension — A new tab page extension with material design and useful features 🆕 🎉
Stars: ✭ 262 (-14.38%)
SAPC-APCAAPCA (Accessible Perceptual Contrast Algorithm) is a new method for predicting contrast for use in emerging web standards (WCAG 3) for determining readability contrast. APCA is derived form the SAPC (S-LUV Advanced Predictive Color) which is an accessibility-oriented color appearance model designed for self-illuminated displays.
Stars: ✭ 266 (-13.07%)
rss-button-for-safariSafari web extension for news feed discovery of RSS, Atom, JSON Feed & RDF+RSS.
Stars: ✭ 16 (-94.77%)
devwebfeedFirehose of team++ resources
Stars: ✭ 128 (-58.17%)
Overview中文编程的历史、现状和展望。issue 中进行相关问题的讨论.
Stars: ✭ 282 (-7.84%)
chesfCHeSF is the Chrome Headless Scraping Framework, a very very alpha code to scrape javascript intensive web pages
Stars: ✭ 18 (-94.12%)
verssionRSS feeds of stable release versions, as found in Wikipedia.
Stars: ✭ 15 (-95.1%)
Hacker News Digest📰 A responsive interface of Hacker News with summaries and thumbnails.
Stars: ✭ 278 (-9.15%)
rssRSS订阅插件 for Hoshinobot
Stars: ✭ 28 (-90.85%)