All Projects → ml-datasets → Similar Projects or Alternatives

320 Open source projects that are alternatives of or similar to ml-datasets

cifair
A duplicate-free variant of the CIFAR test set.
Stars: ✭ 13 (-67.5%)
Projects
🪐 End-to-end NLP workflows from prototype to production
Stars: ✭ 397 (+892.5%)
Mutual labels:  spacy, datasets
bugrepo
A collection of publicly available bug reports
Stars: ✭ 93 (+132.5%)
Mutual labels:  datasets
SMMT
Social Media Mining Toolkit (SMMT) main repository
Stars: ✭ 116 (+190%)
Mutual labels:  spacy
spacy-iwnlp
German lemmatization with IWNLP as extension for spaCy
Stars: ✭ 22 (-45%)
Mutual labels:  spacy
big-data-exploration
[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (+7.5%)
Mutual labels:  datasets
spacy-server
🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec
Stars: ✭ 58 (+45%)
Mutual labels:  spacy
Google-Playstore-Dataset
Google PlayStore App dataset. (2.3 million App Data) and 24 attributes
Stars: ✭ 27 (-32.5%)
Mutual labels:  datasets
time-series-classification
Classifying time series using feature extraction
Stars: ✭ 75 (+87.5%)
Mutual labels:  datasets
spacy conll
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
Stars: ✭ 60 (+50%)
Mutual labels:  spacy
anonymisation
Anonymization of legal cases (Fr) based on Flair embeddings
Stars: ✭ 85 (+112.5%)
Mutual labels:  spacy
topic modelling financial news
Topic modelling on financial news with Natural Language Processing
Stars: ✭ 51 (+27.5%)
Mutual labels:  spacy
nlp-cheat-sheet-python
NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+72.5%)
Mutual labels:  spacy
json2python-models
Generate Python model classes (pydantic, attrs, dataclasses) based on JSON datasets with typing module support
Stars: ✭ 119 (+197.5%)
Mutual labels:  datasets
rs datasets
Tool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-45%)
Mutual labels:  datasets
spacy-french-models
French models for spacy
Stars: ✭ 22 (-45%)
Mutual labels:  spacy
DiscEval
Discourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-55%)
Mutual labels:  datasets
dw-jdbc
JDBC driver for data.world
Stars: ✭ 17 (-57.5%)
Mutual labels:  datasets
ginza-transformers
Use custom tokenizers in spacy-transformers
Stars: ✭ 15 (-62.5%)
Mutual labels:  spacy
parlitools
A collection of useful tools for UK politics
Stars: ✭ 22 (-45%)
Mutual labels:  datasets
biomechanics dataset
Information of public available data sets for biomechanics.
Stars: ✭ 31 (-22.5%)
Mutual labels:  datasets
bumblebee
🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+200%)
Mutual labels:  datasets
CHR
SIXray : A Large-scale Security Inspection X-ray Benchmark in CVPR 2019
Stars: ✭ 78 (+95%)
Mutual labels:  datasets
datasets
The primary repository for all of the CORGIS Datasets
Stars: ✭ 19 (-52.5%)
Mutual labels:  datasets
dagpi
Dagpi is a powerful and fast api that does image manipulation as well as serves datasets. It is fast and written in rust and python. Perfect for discord bots, social media apps, camera apps and more.
Stars: ✭ 25 (-37.5%)
Mutual labels:  datasets
mindsdb-examples
Examples for usage of Mindsdb https://www.mindsdb.com/
Stars: ✭ 25 (-37.5%)
Mutual labels:  datasets
metadat
Meta-analytic datasets for R
Stars: ✭ 21 (-47.5%)
Mutual labels:  datasets
kaggledatasets
Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (+10%)
Mutual labels:  datasets
airy
💬 Open source conversational platform to power conversations with an open source Live Chat, Messengers like Facebook Messenger, WhatsApp and more - 💎 UI from Inbox to dashboards - 🤖 Integrations to Conversational AI / NLP tools and standard enterprise software - ⚡ APIs, WebSocket, Webhook - 🔧 Create any conversational experience
Stars: ✭ 299 (+647.5%)
Mutual labels:  spacy
spacy hunspell
✏️ Hunspell extension for spaCy 2.0.
Stars: ✭ 94 (+135%)
Mutual labels:  spacy
open2ch-dialogue-corpus
おーぷん2ちゃんねるをクロールして作成した対話コーパス
Stars: ✭ 65 (+62.5%)
Mutual labels:  datasets
multi-task-defocus-deblurring-dual-pixel-nimat
Reference github repository for the paper "Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning". We propose a single-image deblurring network that incorporates the two sub-aperture views into a multitask framework. Specifically, we show that jointly learning to predict the two DP views from a single …
Stars: ✭ 29 (-27.5%)
Mutual labels:  datasets
SkillNER
A (smart) rule based NLP module to extract job skills from text
Stars: ✭ 69 (+72.5%)
Mutual labels:  spacy
DaCy
DaCy: The State of the Art Danish NLP pipeline using SpaCy
Stars: ✭ 66 (+65%)
Mutual labels:  spacy
torchgeo
TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+2712.5%)
Mutual labels:  datasets
alter-nlu
Natural language understanding library for chatbots with intent recognition and entity extraction.
Stars: ✭ 45 (+12.5%)
Mutual labels:  spacy
Thirukkural-Tamil-Dataset
திருக்குறள் by திருவள்ளுவர்.
Stars: ✭ 44 (+10%)
Mutual labels:  datasets
bert-tensorflow-pytorch-spacy-conversion
Instructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through uses DeepPavlov's RuBERT as example.
Stars: ✭ 26 (-35%)
Mutual labels:  spacy
spacy readability
spaCy pipeline component for adding text readability meta data to Doc objects.
Stars: ✭ 54 (+35%)
Mutual labels:  spacy
deplacy
CUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis
Stars: ✭ 97 (+142.5%)
Mutual labels:  spacy
scRNAseq cell cluster labeling
Scripts to run and benchmark scRNA-seq cell cluster labeling methods
Stars: ✭ 41 (+2.5%)
Mutual labels:  datasets
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+34575%)
Mutual labels:  datasets
nlp workshop odsc europe20
Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
Stars: ✭ 127 (+217.5%)
Mutual labels:  spacy
datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
Stars: ✭ 274 (+585%)
Mutual labels:  datasets
Quora QuestionPairs DL
Kaggle Competition: Using deep learning to solve quora's question pairs problem
Stars: ✭ 54 (+35%)
Mutual labels:  spacy
ake-datasets
Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+212.5%)
Mutual labels:  datasets
mlx
Machine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Stars: ✭ 132 (+230%)
Mutual labels:  datasets
NLP PEMDC
NLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.
Stars: ✭ 58 (+45%)
Mutual labels:  datasets
NLP Quickbook
NLP in Python with Deep Learning
Stars: ✭ 516 (+1190%)
Mutual labels:  spacy
Dataset-Sentimen-Analisis-Bahasa-Indonesia
Repositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
Stars: ✭ 38 (-5%)
Mutual labels:  datasets
rita-dsl
A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format
Stars: ✭ 60 (+50%)
Mutual labels:  spacy
ling
Natural Language Processing Toolkit in Golang
Stars: ✭ 57 (+42.5%)
Mutual labels:  spacy
dh-core
Functional data science
Stars: ✭ 123 (+207.5%)
Mutual labels:  datasets
agile
🌌 Global State and Logic Library for JavaScript/Typescript applications
Stars: ✭ 90 (+125%)
Mutual labels:  spacy
humanflow2
Official repository of Learning Multi-Human Optical Flow (IJCV 2019)
Stars: ✭ 37 (-7.5%)
Mutual labels:  datasets
anonymization-api
How to build and deploy an anonymization API with FastAPI
Stars: ✭ 51 (+27.5%)
Mutual labels:  spacy
dataset
dataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections
Stars: ✭ 21 (-47.5%)
Mutual labels:  datasets
pynsett
A programmable relation extraction tool
Stars: ✭ 25 (-37.5%)
Mutual labels:  spacy
newt
Natural World Tasks
Stars: ✭ 24 (-40%)
Mutual labels:  datasets
spectrochempy
SpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Stars: ✭ 34 (-15%)
Mutual labels:  datasets
1-60 of 320 similar projects