All Projects → ake-datasets → Similar Projects or Alternatives

684 Open source projects that are alternatives of or similar to ake-datasets

deep-keyphrase
seq2seq based keyphrase generation model sets, including copyrnn copycnn and copytransfomer
Stars: ✭ 51 (-59.2%)
kex
Kex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public datasets.
Stars: ✭ 46 (-63.2%)
perke
A keyphrase extractor for Persian
Stars: ✭ 60 (-52%)
Machine Learning Resources
A curated list of awesome machine learning frameworks, libraries, courses, books and many more.
Stars: ✭ 226 (+80.8%)
Mutual labels:  datasets, nlp-machine-learning
Openml R
R package to interface with OpenML
Stars: ✭ 81 (-35.2%)
Mutual labels:  benchmarking, datasets
Awesome Nlp Polish
A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.
Stars: ✭ 153 (+22.4%)
Mutual labels:  datasets, nlp-machine-learning
Dan Jurafsky Chris Manning Nlp
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-0.8%)
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+1002.4%)
Mutual labels:  datasets, nlp-machine-learning
Wongnai Corpus
Collection of Wongnai's datasets
Stars: ✭ 57 (-54.4%)
Mutual labels:  datasets, nlp-machine-learning
query-wellformedness
25,100 queries from the Paralex corpus (Fader et al., 2013) annotated with human ratings of whether they are well-formed natural language questions.
Stars: ✭ 80 (-36%)
SENet-for-Weakly-Supervised-Relation-Extraction
No description or website provided.
Stars: ✭ 39 (-68.8%)
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (-4%)
Mutual labels:  datasets
bookworm
📚 social networks from novels
Stars: ✭ 72 (-42.4%)
Mutual labels:  information-retrieval
PiBenchmarks
Raspberry Pi benchmarking scripts featuring a storage benchmark with score
Stars: ✭ 69 (-44.8%)
Mutual labels:  benchmarking
vlainic.github.io
My GitHub blog: things you might be interested, and probably not...
Stars: ✭ 26 (-79.2%)
Mutual labels:  nlp-machine-learning
FieldedSDM
Fielded Sequential Dependence Model (code and runs)
Stars: ✭ 32 (-74.4%)
Mutual labels:  information-retrieval
NLP PEMDC
NLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.
Stars: ✭ 58 (-53.6%)
Mutual labels:  datasets
Question-Answering-based-on-SQuAD
Question Answering System using BiDAF Model on SQuAD v2.0
Stars: ✭ 20 (-84%)
Mutual labels:  nlp-machine-learning
tutorials
A tutorial series by Preferred.AI
Stars: ✭ 136 (+8.8%)
Mutual labels:  information-retrieval
TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (-66.4%)
Mutual labels:  nlp-machine-learning
src
tools for fast reading of docs
Stars: ✭ 40 (-68%)
Mutual labels:  information-retrieval
ml4ir
Machine Learning for Information Retrieval
Stars: ✭ 75 (-40%)
Mutual labels:  information-retrieval
graphsim
R package: Simulate Expression data from igraph network using mvtnorm (CRAN; JOSS)
Stars: ✭ 16 (-87.2%)
Mutual labels:  benchmarking
multi-task-defocus-deblurring-dual-pixel-nimat
Reference github repository for the paper "Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning". We propose a single-image deblurring network that incorporates the two sub-aperture views into a multitask framework. Specifically, we show that jointly learning to predict the two DP views from a single …
Stars: ✭ 29 (-76.8%)
Mutual labels:  datasets
RadiologyReportEmbedding
Intelligent Word Embeddings of Free-Text Radiology Reports
Stars: ✭ 22 (-82.4%)
Mutual labels:  nlp-machine-learning
mrs testbed
Multi-robot Exploration Testbed
Stars: ✭ 26 (-79.2%)
Mutual labels:  benchmarking
cs6101
The Web IR / NLP Group (WING)'s public reading group at the National University of Singapore.
Stars: ✭ 17 (-86.4%)
Mutual labels:  information-retrieval
datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
Stars: ✭ 274 (+119.2%)
Mutual labels:  datasets
mindsdb-examples
Examples for usage of Mindsdb https://www.mindsdb.com/
Stars: ✭ 25 (-80%)
Mutual labels:  datasets
spectrochempy
SpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Stars: ✭ 34 (-72.8%)
Mutual labels:  datasets
pytorch-translm
An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.
Stars: ✭ 22 (-82.4%)
Mutual labels:  nlp-machine-learning
memex-gate
General Architecture for Text Engineering
Stars: ✭ 47 (-62.4%)
Mutual labels:  information-retrieval
DRhard
SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.
Stars: ✭ 93 (-25.6%)
Mutual labels:  information-retrieval
CVAE Dial
CVAE_XGate model in paper "Xu, Dusek, Konstas, Rieser. Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Diversity"
Stars: ✭ 16 (-87.2%)
Mutual labels:  nlp-machine-learning
BM25Transformer
(Python) transform a document-term matrix to an Okapi/BM25 representation
Stars: ✭ 50 (-60%)
Mutual labels:  information-retrieval
ezab
A suite of tools for benchmarking (load testing) web servers and databases
Stars: ✭ 16 (-87.2%)
Mutual labels:  benchmarking
Spatio-Temporal-papers
This project is a collection of recent research in areas such as new infrastructure and urban computing, including white papers, academic papers, AI lab and dataset etc.
Stars: ✭ 180 (+44%)
Mutual labels:  datasets
json2python-models
Generate Python model classes (pydantic, attrs, dataclasses) based on JSON datasets with typing module support
Stars: ✭ 119 (-4.8%)
Mutual labels:  datasets
language-benchmarks
A simple benchmark system for compiled and interpreted languages.
Stars: ✭ 21 (-83.2%)
Mutual labels:  benchmarking
query completion
Personalized Query Completion
Stars: ✭ 24 (-80.8%)
Mutual labels:  information-retrieval
GNN-Recommender-Systems
An index of recommendation algorithms that are based on Graph Neural Networks.
Stars: ✭ 505 (+304%)
Mutual labels:  information-retrieval
datasets
The primary repository for all of the CORGIS Datasets
Stars: ✭ 19 (-84.8%)
Mutual labels:  datasets
forest-benchmarking
A library for quantum characterization, verification, validation (QCVV), and benchmarking using pyQuil.
Stars: ✭ 41 (-67.2%)
Mutual labels:  benchmarking
vnla
Code accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155
Stars: ✭ 60 (-52%)
Mutual labels:  nlp-machine-learning
knime-textprocessing
KNIME - Text Processing Extension (Labs)
Stars: ✭ 17 (-86.4%)
Mutual labels:  nlp-machine-learning
elastic transformers
Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers
Stars: ✭ 153 (+22.4%)
Mutual labels:  nlp-machine-learning
Quora question pairs NLP Kaggle
Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training
Stars: ✭ 17 (-86.4%)
Mutual labels:  nlp-machine-learning
allsummarizer
Multilingual automatic text summarizer using statistical approach and extraction
Stars: ✭ 28 (-77.6%)
Mutual labels:  information-retrieval
rake new2
A Python library that enables smooth keyword extraction from any text using the RAKE(Rapid Automatic Keyword Extraction) algorithm.
Stars: ✭ 23 (-81.6%)
Mutual labels:  keyword-extraction
kaggledatasets
Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (-64.8%)
Mutual labels:  datasets
EDTA
Extensive de-novo TE Annotator
Stars: ✭ 210 (+68%)
Mutual labels:  benchmarking
CS224NHomeworks
CS224N 2019 Homeworks
Stars: ✭ 18 (-85.6%)
Mutual labels:  nlp-machine-learning
mlconjug3
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (-62.4%)
Mutual labels:  nlp-machine-learning
kg one2set
Code for our ACL 2021 paper "One2Set: Generating Diverse Keyphrases as a Set"
Stars: ✭ 58 (-53.6%)
Mutual labels:  keyphrase-generation
embeddings
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
Stars: ✭ 27 (-78.4%)
Mutual labels:  nlp-machine-learning
IP-Tracker
Track any ip address with IP-Tracker. IP-Tracker is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracker.
Stars: ✭ 53 (-57.6%)
Mutual labels:  information-retrieval
ml-nlp-services
机器学习、深度学习、自然语言处理
Stars: ✭ 23 (-81.6%)
Mutual labels:  information-retrieval
cifair
A duplicate-free variant of the CIFAR test set.
Stars: ✭ 13 (-89.6%)
Mutual labels:  datasets
nlp classification workshop
NLP Classification Workshop
Stars: ✭ 22 (-82.4%)
Mutual labels:  nlp-machine-learning
php-orm-benchmark
The benchmark to compare performance of PHP ORM solutions.
Stars: ✭ 82 (-34.4%)
Mutual labels:  benchmarking
1-60 of 684 similar projects