cifairA duplicate-free variant of the CIFAR test set.
Stars: ✭ 13 (-67.5%)
Projects🪐 End-to-end NLP workflows from prototype to production
Stars: ✭ 397 (+892.5%)
bugrepoA collection of publicly available bug reports
Stars: ✭ 93 (+132.5%)
SMMTSocial Media Mining Toolkit (SMMT) main repository
Stars: ✭ 116 (+190%)
spacy-iwnlpGerman lemmatization with IWNLP as extension for spaCy
Stars: ✭ 22 (-45%)
big-data-exploration[Archive] Intern project - Big Data Exploration using MongoDB - This Repository is NOT a supported MongoDB product
Stars: ✭ 43 (+7.5%)
spacy-server🦜 Containerized HTTP API for industrial-strength NLP via spaCy and sense2vec
Stars: ✭ 58 (+45%)
spacy conllPipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
Stars: ✭ 60 (+50%)
anonymisationAnonymization of legal cases (Fr) based on Flair embeddings
Stars: ✭ 85 (+112.5%)
nlp-cheat-sheet-pythonNLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (+72.5%)
json2python-modelsGenerate Python model classes (pydantic, attrs, dataclasses) based on JSON datasets with typing module support
Stars: ✭ 119 (+197.5%)
rs datasetsTool for autodownloading recommendation systems datasets
Stars: ✭ 22 (-45%)
DiscEvalDiscourse Based Evaluation of Language Understanding
Stars: ✭ 18 (-55%)
dw-jdbcJDBC driver for data.world
Stars: ✭ 17 (-57.5%)
parlitoolsA collection of useful tools for UK politics
Stars: ✭ 22 (-45%)
biomechanics datasetInformation of public available data sets for biomechanics.
Stars: ✭ 31 (-22.5%)
bumblebee🚕 A spreadsheet-like data preparation web app that works over Optimus (Pandas, Dask, cuDF, Dask-cuDF, Spark and Vaex)
Stars: ✭ 120 (+200%)
CHRSIXray : A Large-scale Security Inspection X-ray Benchmark in CVPR 2019
Stars: ✭ 78 (+95%)
datasetsThe primary repository for all of the CORGIS Datasets
Stars: ✭ 19 (-52.5%)
dagpiDagpi is a powerful and fast api that does image manipulation as well as serves datasets. It is fast and written in rust and python. Perfect for discord bots, social media apps, camera apps and more.
Stars: ✭ 25 (-37.5%)
mindsdb-examplesExamples for usage of Mindsdb https://www.mindsdb.com/
Stars: ✭ 25 (-37.5%)
metadatMeta-analytic datasets for R
Stars: ✭ 21 (-47.5%)
kaggledatasetsCollection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (+10%)
airy💬 Open source conversational platform to power conversations with an open source Live Chat, Messengers like Facebook Messenger, WhatsApp and more - 💎 UI from Inbox to dashboards - 🤖 Integrations to Conversational AI / NLP tools and standard enterprise software - ⚡ APIs, WebSocket, Webhook - 🔧 Create any conversational experience
Stars: ✭ 299 (+647.5%)
spacy hunspell✏️ Hunspell extension for spaCy 2.0.
Stars: ✭ 94 (+135%)
multi-task-defocus-deblurring-dual-pixel-nimatReference github repository for the paper "Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning". We propose a single-image deblurring network that incorporates the two sub-aperture views into a multitask framework. Specifically, we show that jointly learning to predict the two DP views from a single …
Stars: ✭ 29 (-27.5%)
SkillNERA (smart) rule based NLP module to extract job skills from text
Stars: ✭ 69 (+72.5%)
DaCyDaCy: The State of the Art Danish NLP pipeline using SpaCy
Stars: ✭ 66 (+65%)
torchgeoTorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
Stars: ✭ 1,125 (+2712.5%)
alter-nluNatural language understanding library for chatbots with intent recognition and entity extraction.
Stars: ✭ 45 (+12.5%)
bert-tensorflow-pytorch-spacy-conversionInstructions for how to convert a BERT Tensorflow model to work with HuggingFace's pytorch-transformers, and spaCy. This walk-through uses DeepPavlov's RuBERT as example.
Stars: ✭ 26 (-35%)
spacy readabilityspaCy pipeline component for adding text readability meta data to Doc objects.
Stars: ✭ 54 (+35%)
deplacyCUI-based Tree Visualizer for Universal Dependencies and Immediate Catena Analysis
Stars: ✭ 97 (+142.5%)
datasets🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Stars: ✭ 13,870 (+34575%)
nlp workshop odsc europe20Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
Stars: ✭ 127 (+217.5%)
datumaroDataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
Stars: ✭ 274 (+585%)
Quora QuestionPairs DLKaggle Competition: Using deep learning to solve quora's question pairs problem
Stars: ✭ 54 (+35%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+212.5%)
mlxMachine Learning eXchange (MLX). Data and AI Assets Catalog and Execution Engine
Stars: ✭ 132 (+230%)
NLP PEMDCNLP Predtrained Embeddings, Models and Datasets Collections(NLP_PEMDC). The collection will keep updating.
Stars: ✭ 58 (+45%)
NLP QuickbookNLP in Python with Deep Learning
Stars: ✭ 516 (+1190%)
Dataset-Sentimen-Analisis-Bahasa-IndonesiaRepositori ini merupakan kumpulan dataset terkait analisis sentimen Berbahasa Indonesia. Apabila Anda menggunakan dataset-dataset yang ada pada repositori ini untuk penelitian, maka cantumkanlah/kutiplah jurnal artikel terkait dataset tersebut. Dataset yang tersedia telah diimplementasikan dalam beberapa penelitian dan hasilnya telah dipublikasi…
Stars: ✭ 38 (-5%)
rita-dslA Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format
Stars: ✭ 60 (+50%)
lingNatural Language Processing Toolkit in Golang
Stars: ✭ 57 (+42.5%)
dh-coreFunctional data science
Stars: ✭ 123 (+207.5%)
agile🌌 Global State and Logic Library for JavaScript/Typescript applications
Stars: ✭ 90 (+125%)
humanflow2Official repository of Learning Multi-Human Optical Flow (IJCV 2019)
Stars: ✭ 37 (-7.5%)
anonymization-apiHow to build and deploy an anonymization API with FastAPI
Stars: ✭ 51 (+27.5%)
datasetdataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections
Stars: ✭ 21 (-47.5%)
pynsettA programmable relation extraction tool
Stars: ✭ 25 (-37.5%)
newtNatural World Tasks
Stars: ✭ 24 (-40%)
spectrochempySpectroChemPy is a framework for processing, analyzing and modeling spectroscopic data for chemistry with Python
Stars: ✭ 34 (-15%)