Few-Shot-Intent-DetectionFew-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.
Stars: ✭ 63 (+10.53%)
DareblopyData Reading Blocks for Python
Stars: ✭ 82 (+43.86%)
Gopup数据接口:百度、谷歌、头条、微博指数,宏观数据,利率数据,货币汇率,千里马、独角兽公司,新闻联播文字稿,影视票房数据,高校名单,疫情数据…
Stars: ✭ 1,229 (+2056.14%)
ml-datasets🌊 Machine learning dataset loaders for testing and example scripts
Stars: ✭ 40 (-29.82%)
RoapiCreate full-fledged APIs for static datasets without writing a single line of code.
Stars: ✭ 253 (+343.86%)
ColourColour Science for Python
Stars: ✭ 1,131 (+1884.21%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+24233.33%)
NemoNeMo: a toolkit for conversational AI
Stars: ✭ 3,685 (+6364.91%)
News push projectReal Time News Scraping and Recommendation System - React | Tensorflow | NLP | News Scrapers
Stars: ✭ 44 (-22.81%)
OpenPromptAn Open-Source Framework for Prompt-Learning.
Stars: ✭ 1,769 (+3003.51%)
DeepehrChronic Disease Prediction Using Medical Notes
Stars: ✭ 220 (+285.96%)
newsletter-archiveMarkdown archive & RSS/Atom feeds for Data Is Plural.
Stars: ✭ 65 (+14.04%)
Character Based CnnImplementation of character based convolutional neural network
Stars: ✭ 205 (+259.65%)
parlitoolsA collection of useful tools for UK politics
Stars: ✭ 22 (-61.4%)
Chatbot一个可以自己进行训练的中文聊天机器人, 根据自己的语料训练出自己想要的聊天机器人,可以用于智能客服、在线问答、智能聊天等场景。目前包含seq2seq、seqGAN版本、tf2.0版本、pytorch版本。
Stars: ✭ 2,441 (+4182.46%)
Voice datasets🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
Stars: ✭ 494 (+766.67%)
HntitlenatorTest your HN title against a neural network
Stars: ✭ 184 (+222.81%)
Dataset-Sentimen-Analisis-Bahasa-IndonesiaRepositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
Stars: ✭ 38 (-33.33%)
Nlp profilerA simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (+217.54%)
lingvo--Ner-ruNamed entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (-33.33%)
PydatasetInstant access to many datasets in Python.
Stars: ✭ 880 (+1443.86%)
Zzz Retired opensttRETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:
Stars: ✭ 146 (+156.14%)
NetEmb-DatasetsA collection of real-world networks/graphs for Network Embedding
Stars: ✭ 18 (-68.42%)
schrutepyThe Entire Transcript from the Office in Tidy Format
Stars: ✭ 22 (-61.4%)
Seq2seq tutorialCode For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (+131.58%)
datasetsThe primary repository for all of the CORGIS Datasets
Stars: ✭ 19 (-66.67%)
Nlp Pretrained ModelA collection of Natural language processing pre-trained models.
Stars: ✭ 122 (+114.04%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+9724.56%)
G Reader2018年机器阅读理解技术竞赛模型,国内外1000多支队伍中BLEU-4评分排名第6, ROUGE-L评分排名第14。(未ensemble,未嵌入训练好的词向量,无dropout)
Stars: ✭ 117 (+105.26%)
spectrochempySpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Stars: ✭ 34 (-40.35%)
recurrent-defocus-deblurring-synth-dual-pixelReference github repository for the paper "Learning to Reduce Defocus Blur by Realistically Modeling Dual-Pixel Data". We propose a procedure to generate realistic DP data synthetically. Our synthesis approach mimics the optical image formation found on DP sensors and can be applied to virtual scenes rendered with standard computer software. Lev…
Stars: ✭ 30 (-47.37%)
LemminflectA python module for English lemmatization and inflection.
Stars: ✭ 105 (+84.21%)
multi-task-defocus-deblurring-dual-pixel-nimatReference github repository for the paper "Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning". We propose a single-image deblurring network that incorporates the two sub-aperture views into a multitask framework. Specifically, we show that jointly learning to predict the two DP views from a single …
Stars: ✭ 29 (-49.12%)
Repo 2016R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (+80.7%)
Tika PythonTika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (+1649.12%)
Question GenerationGiven a sentence automatically generate reading comprehension style factual questions from that sentence, such that the sentence contains answers to those questions.
Stars: ✭ 100 (+75.44%)
embeddingsEmbeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
Stars: ✭ 27 (-52.63%)
Wiki SplitOne million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.
Stars: ✭ 95 (+66.67%)
disent🧶 Modular VAE disentanglement framework for python built with PyTorch Lightning ▸ Including metrics and datasets ▸ With strongly supervised, weakly supervised and unsupervised methods ▸ Easily configured and run with Hydra config ▸ Inspired by disentanglement_lib
Stars: ✭ 41 (-28.07%)
DatascienceIt consists of examples, assignments discussed in data science course taken at algorithmica.
Stars: ✭ 92 (+61.4%)
dw-jdbcJDBC driver for data.world
Stars: ✭ 17 (-70.18%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (+59.65%)
OpenmlOpen Machine Learning
Stars: ✭ 489 (+757.89%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+110.53%)
Gec PseudodataRepository of "An Empirical Study of Incorporating Pseudo Data into Grammatical Error Correction" (EMNLP-IJCNLP 2019)
Stars: ✭ 49 (-14.04%)
HealthcheckHealth Check ✔ is a Machine Learning Web Application made using Flask that can predict mainly three diseases i.e. Diabetes, Heart Disease, and Cancer.
Stars: ✭ 35 (-38.6%)
TapasEnd-to-end neural table-text understanding models.
Stars: ✭ 583 (+922.81%)
AkshareAKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+7503.51%)
SentimentAnalysisSentiment Analysis: Deep Bi-LSTM+attention model
Stars: ✭ 32 (-43.86%)