Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-81.87%)
pytorch-translmAn implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.
Stars: ✭ 22 (-96.78%)
PadatiousA neural network intent parser
Stars: ✭ 124 (-81.87%)
lidtkLanguage Identification Toolkit
Stars: ✭ 17 (-97.51%)
Lingua👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (-50.15%)
lidboxEnd-to-end spoken language identification out of the box.
Stars: ✭ 39 (-94.3%)
Machine-learningThis repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (-96.05%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (-94.15%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (-81.73%)
aprenda-pythonAprendizado, dicas e projetos sobre Python
Stars: ✭ 22 (-96.78%)
parallel-corpora-toolsTools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (-94.88%)
elastic transformersMaking BERT stretchy. Semantic Elasticsearch with Sentence Transformers
Stars: ✭ 153 (-77.63%)
goclassyAn asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.
Stars: ✭ 81 (-88.16%)
mlconjug3A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (-93.13%)
IndRNN pytorchIndependently Recurrent Neural Networks (IndRNN) implemented in pytorch.
Stars: ✭ 112 (-83.63%)
textstatRuby gem to calculate statistics from text to determine readability, complexity and grade level of a particular corpus.
Stars: ✭ 25 (-96.35%)
hama-py🦛 파이썬 한글 처리 라이브러리. Python Korean Morphological Analyzer
Stars: ✭ 16 (-97.66%)
embeddingsEmbeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
Stars: ✭ 27 (-96.05%)
Conditional-SeqGAN-TensorflowConditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow
Stars: ✭ 47 (-93.13%)
vi-rsVietnamese Input Method library
Stars: ✭ 69 (-89.91%)
arabic-taggerAQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (-94.44%)
alter-nluNatural language understanding library for chatbots with intent recognition and entity extraction.
Stars: ✭ 45 (-93.42%)
datastories-semeval2017-task6Deep-learning model presented in "DataStories at SemEval-2017 Task 6: Siamese LSTM with Attention for Humorous Text Comparison".
Stars: ✭ 20 (-97.08%)
vlainic.github.ioMy GitHub blog: things you might be interested, and probably not...
Stars: ✭ 26 (-96.2%)
CVAE DialCVAE_XGate model in paper "Xu, Dusek, Konstas, Rieser. Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Diversity"
Stars: ✭ 16 (-97.66%)
vnlaCode accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155
Stars: ✭ 60 (-91.23%)
cinjeA Pythonic and ultra fast template engine DSL.
Stars: ✭ 26 (-96.2%)
nlcliNatural language interface for the command line.
Stars: ✭ 21 (-96.93%)
TextrudeCode generation from YAML/JSON/CSV models via SCRIBAN templates
Stars: ✭ 79 (-88.45%)
learn perl onelinersExample based guide for text processing with perl from the command line
Stars: ✭ 63 (-90.79%)
Quora question pairs NLP KaggleQuora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training
Stars: ✭ 17 (-97.51%)
referit3dCode accompanying our ECCV-2020 paper on 3D Neural Listeners.
Stars: ✭ 59 (-91.37%)
brand-sentiment-analysisScripts utilizing Heartex platform to build brand sentiment analysis from the news
Stars: ✭ 21 (-96.93%)
s3-utilsUtilities and tools based around Amazon S3 to provide convenience APIs in a CLI
Stars: ✭ 45 (-93.42%)
SuffixTreeOptimized implementation of suffix tree in python using Ukkonen's algorithm.
Stars: ✭ 38 (-94.44%)
python-mecabA repository to bind mecab for Python 3.5+. Not using swig nor pybind. (Not Maintained Now)
Stars: ✭ 27 (-96.05%)
phrase-at-scaleDetect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English
Stars: ✭ 115 (-83.19%)
kexKex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public datasets.
Stars: ✭ 46 (-93.27%)
TextFeatureSelectionPython library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (-93.86%)
Naive-Bayes-Evening-WorkshopCompanion code for Introduction to Python for Data Science: Coding the Naive Bayes Algorithm evening workshop
Stars: ✭ 23 (-96.64%)
Hutoma-Conversational-AI-PlatformHu:toma AI is an open source stack designed to help you create compelling conversational interfaces with little effort and above industry accuracy
Stars: ✭ 35 (-94.88%)
Multi-Type-TD-TSRExtracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
Stars: ✭ 174 (-74.56%)
finglishA Finglish to Persian converter.
Stars: ✭ 60 (-91.23%)
fuzzychineseA small package to fuzzy match chinese words
Stars: ✭ 50 (-92.69%)
Emotion-recognition-from-tweetsA comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.
Stars: ✭ 17 (-97.51%)
lingvo--Ner-ruNamed entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (-94.44%)
frogFrog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (-89.77%)
group-transformerOfficial code for Group-Transformer (Scale down Transformer by Grouping Features for a Lightweight Character-level Language Model, COLING-2020).
Stars: ✭ 21 (-96.93%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (-86.7%)