All Projects → Pycantonese → Similar Projects or Alternatives

755 Open source projects that are alternatives of or similar to Pycantonese

Weixin public corpus
微信公众号语料库
Stars: ✭ 465 (+216.33%)
Pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+189.8%)
Youtokentome
Unsupervised text tokenizer focused on computational efficiency
Stars: ✭ 728 (+395.24%)
Sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
Stars: ✭ 5,540 (+3668.71%)
Vncorenlp
A Vietnamese natural language processing toolkit (NAACL 2018)
Stars: ✭ 354 (+140.82%)
Pythainlp
Thai Natural Language Processing in Python.
Stars: ✭ 582 (+295.92%)
Nltk data
NLTK Data
Stars: ✭ 675 (+359.18%)
Toiro
A comparison tool of Japanese tokenizers
Stars: ✭ 95 (-35.37%)
Tokenizer
Fast and customizable text tokenization library with BPE and SentencePiece support
Stars: ✭ 132 (-10.2%)
Paper Survey
📚Survey of previous research and related works on machine learning (especially Deep Learning) in Japanese
Stars: ✭ 140 (-4.76%)
Persian Stopwords
Persian (Farsi) Stop Words List
Stars: ✭ 131 (-10.88%)
Cocoaai
🤖 The Cocoa Artificial Intelligence Lab
Stars: ✭ 134 (-8.84%)
Tod Bert
Pre-Trained Models for ToD-BERT
Stars: ✭ 143 (-2.72%)
Scattertext Pydata
Notebooks for the Seattle PyData 2017 talk on Scattertext
Stars: ✭ 132 (-10.2%)
Googlelanguager
R client for the Google Translation API, Google Cloud Natural Language API and Google Cloud Speech API
Stars: ✭ 145 (-1.36%)
Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+1170.75%)
Prenlp
Preprocessing Library for Natural Language Processing
Stars: ✭ 130 (-11.56%)
Konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-11.56%)
Medquad
Medical Question Answering Dataset of 47,457 QA pairs created from 12 NIH websites
Stars: ✭ 129 (-12.24%)
Fxdesktopsearch
A JavaFX based desktop search application.
Stars: ✭ 147 (+0%)
Ai Job Info
互联网大厂面试经验
Stars: ✭ 145 (-1.36%)
Learn To Select Data
Code for Learning to select data for transfer learning with Bayesian Optimization
Stars: ✭ 140 (-4.76%)
Deep Lyrics
Lyrics Generator aka Character-level Language Modeling with Multi-layer LSTM Recurrent Neural Network
Stars: ✭ 127 (-13.61%)
Neuraldialog Larl
PyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 127 (-13.61%)
Deeplearning.ai
Stars: ✭ 139 (-5.44%)
100 Days Of Nlp
Stars: ✭ 125 (-14.97%)
Mams For Absa
A Multi-Aspect Multi-Sentiment Dataset for aspect-based sentiment analysis.
Stars: ✭ 135 (-8.16%)
Neusum
Code for the ACL 2018 paper "Neural Document Summarization by Jointly Learning to Score and Select Sentences"
Stars: ✭ 143 (-2.72%)
Zamia Ai
Free and open source A.I. system based on Python, TensorFlow and Prolog.
Stars: ✭ 133 (-9.52%)
Nl2sql
阿里天池首届中文NL2SQL挑战赛top6
Stars: ✭ 146 (-0.68%)
Awesome Ai Services
An overview of the AI-as-a-service landscape
Stars: ✭ 133 (-9.52%)
Stanza Old
Stanford NLP group's shared Python tools.
Stars: ✭ 142 (-3.4%)
Uda
Unsupervised Data Augmentation (UDA)
Stars: ✭ 1,877 (+1176.87%)
Words counted
A Ruby natural language processor.
Stars: ✭ 146 (-0.68%)
Tensorflow 1.4 Billion Password Analysis
Deep Learning model to analyze a large corpus of clear text passwords.
Stars: ✭ 1,720 (+1070.07%)
Nlpaug
Data augmentation for NLP
Stars: ✭ 2,761 (+1778.23%)
Textacy
NLP, before and after spaCy
Stars: ✭ 1,849 (+1157.82%)
Scientific Paper Summarisation
Machine learning models to automatically summarise scientific papers
Stars: ✭ 145 (-1.36%)
Chars2vec
Character-based word embeddings model based on RNN for handling real world texts
Stars: ✭ 130 (-11.56%)
Lexpredict Contraxsuite
LexPredict ContraxSuite
Stars: ✭ 140 (-4.76%)
Rasa Chatbot Templates
RASA chatbot use case boilerplate
Stars: ✭ 127 (-13.61%)
Tree Transformer
Implementation of the paper Tree Transformer
Stars: ✭ 148 (+0.68%)
Corpuscrawler
Crawler for linguistic corpora
Stars: ✭ 127 (-13.61%)
Mutual labels:  linguistics
Ipa Dict
Monolingual wordlists with pronunciation information in IPA
Stars: ✭ 139 (-5.44%)
Mutual labels:  linguistics
Neuro
🔮 Neuro.js is machine learning library for building AI assistants and chat-bots (WIP).
Stars: ✭ 126 (-14.29%)
Awesome Nlp Resources
This repository contains landmark research papers in Natural Language Processing that came out in this century.
Stars: ✭ 145 (-1.36%)
Prosody
Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text
Stars: ✭ 139 (-5.44%)
Keita
My personal toolkit for PyTorch development.
Stars: ✭ 124 (-15.65%)
Aws Machine Learning University Accelerated Nlp
Machine Learning University: Accelerated Natural Language Processing Class
Stars: ✭ 1,695 (+1053.06%)
Spacy Dev Resources
💫 Scripts, tools and resources for developing spaCy
Stars: ✭ 123 (-16.33%)
Char Cnn Text Classification Pytorch
Character-level Convolutional Neural Networks for text classification in PyTorch
Stars: ✭ 147 (+0%)
Absapapers
Worth-reading papers and related awesome resources on aspect-based sentiment analysis (ABSA). 值得一读的方面级情感分析论文与相关资源集合
Stars: ✭ 142 (-3.4%)
Kaggle Crowdflower
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
Stars: ✭ 1,708 (+1061.9%)
Fnc 1 Baseline
A baseline implementation for FNC-1
Stars: ✭ 123 (-16.33%)
Awesome Hungarian Nlp
A curated list of NLP resources for Hungarian
Stars: ✭ 121 (-17.69%)
Ncrfpp
NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character LSTM/CNN, word LSTM/CNN and softmax/CRF components.
Stars: ✭ 1,767 (+1102.04%)
Clicr
Machine reading comprehension on clinical case reports
Stars: ✭ 123 (-16.33%)
Spacy Js
🎀 JavaScript API for spaCy with Python REST API
Stars: ✭ 123 (-16.33%)
Multihead Siamese Nets
Implementation of Siamese Neural Networks built upon multihead attention mechanism for text semantic similarity task.
Stars: ✭ 144 (-2.04%)
Sluice Networks
Code for Sluice networks: Learning what to share between loosely related tasks
Stars: ✭ 135 (-8.16%)
1-60 of 755 similar projects