All Projects → Tokenizer → Similar Projects or Alternatives

1057 Open source projects that are alternatives of or similar to Tokenizer

Anago
Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition, Part-of-Speech Tagging and so on.
Stars: ✭ 1,392 (+954.55%)
Nlp Papers
Papers and Book to look at when starting NLP 📚
Stars: ✭ 111 (-15.91%)
Nlp Pretrained Model
A collection of Natural language processing pre-trained models.
Stars: ✭ 122 (-7.58%)
Awesome Emotion Recognition In Conversations
A comprehensive reading list for Emotion Recognition in Conversations
Stars: ✭ 111 (-15.91%)
Prenlp
Preprocessing Library for Natural Language Processing
Stars: ✭ 130 (-1.52%)
Xlnet extension tf
XLNet Extension in TensorFlow
Stars: ✭ 109 (-17.42%)
Cs230 Code Examples
Code examples in pyTorch and Tensorflow for CS230
Stars: ✭ 1,701 (+1188.64%)
Pymetamap
Python wraper for MetaMap
Stars: ✭ 119 (-9.85%)
Metaknowledge
A Python library for doing bibliometric and network analysis in science and health policy research
Stars: ✭ 102 (-22.73%)
Neuraldialog Larl
PyTorch implementation of latent space reinforcement learning for E2E dialog published at NAACL 2019. It is released by Tiancheng Zhao (Tony) from Dialog Research Center, LTI, CMU
Stars: ✭ 127 (-3.79%)
Papernotes
My personal notes and surveys on DL, CV and NLP papers.
Stars: ✭ 108 (-18.18%)
Ratel
RAT-el is an open source penetration test tool that allows you to take control of a windows machine. It works on the client-server model, the server sends commands and the client executes the commands and sends the result back to the server. The client is completely undetectable by anti-virus software.
Stars: ✭ 121 (-8.33%)
Mutual labels:  unicode
Transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+42128.79%)
Persian Stopwords
Persian (Farsi) Stop Words List
Stars: ✭ 131 (-0.76%)
Nltk
NLTK Source
Stars: ✭ 10,309 (+7709.85%)
Nlpcc Wordseg Weibo
NLPCC 2016 微博分词评测项目
Stars: ✭ 120 (-9.09%)
Chatbot
Русскоязычный чатбот
Stars: ✭ 106 (-19.7%)
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (+1500%)
Mutual labels:  machine-translation
Ios ml
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Stars: ✭ 1,409 (+967.42%)
Discobert
Code for paper "Discourse-Aware Neural Extractive Text Summarization" (ACL20)
Stars: ✭ 120 (-9.09%)
Textaugmentation Gpt2
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Stars: ✭ 104 (-21.21%)
Textacy
NLP, before and after spaCy
Stars: ✭ 1,849 (+1300.76%)
Magnitude
A fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+956.06%)
Tokenizer
Source code tokenizer
Stars: ✭ 119 (-9.85%)
Mutual labels:  tokenizer
Pytorchnlpbook
Code and data accompanying Natural Language Processing with PyTorch published by O'Reilly Media https://nlproc.info
Stars: ✭ 1,390 (+953.03%)
100 Days Of Nlp
Stars: ✭ 125 (-5.3%)
Repo 2016
R, Python and Mathematica Codes in Machine Learning, Deep Learning, Artificial Intelligence, NLP and Geolocation
Stars: ✭ 103 (-21.97%)
Pytextrank
Python implementation of TextRank for phrase extraction and summarization of text documents
Stars: ✭ 1,675 (+1168.94%)
Pynlp
A pythonic wrapper for Stanford CoreNLP.
Stars: ✭ 103 (-21.97%)
Scattertext Pydata
Notebooks for the Seattle PyData 2017 talk on Scattertext
Stars: ✭ 132 (+0%)
Works For Me
Collection of developer toolkits
Stars: ✭ 131 (-0.76%)
Mutual labels:  tokenizer
Konoha
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
Stars: ✭ 130 (-1.52%)
Unibits
Visualize different Unicode encodings in the terminal
Stars: ✭ 125 (-5.3%)
Mutual labels:  unicode
Texting
[ACL 2020] Tensorflow implementation for "Every Document Owns Its Structure: Inductive Text Classification via Graph Neural Networks"
Stars: ✭ 103 (-21.97%)
Hybrid Fonts
Monospaced fonts patched with Chinese characters and extra glyphs from Nerd Fonts
Stars: ✭ 102 (-22.73%)
Mutual labels:  unicode
Codesearchnet
Datasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (+943.94%)
Full Icu Npm
>>> This may become obsolete, read and comment >>>
Stars: ✭ 117 (-11.36%)
Mutual labels:  icu
Eslint Plugin I18n Json
Fully extendable eslint plugin for JSON i18n translation files.
Stars: ✭ 101 (-23.48%)
Mutual labels:  icu
Megamark
😻 Markdown with easy tokenization, a fast highlighter, and a lean HTML sanitizer
Stars: ✭ 100 (-24.24%)
Mutual labels:  tokenizer
Keita
My personal toolkit for PyTorch development.
Stars: ✭ 124 (-6.06%)
Dynamic Coattention Network Plus
Dynamic Coattention Network Plus (DCN+) TensorFlow implementation. Question answering using Deep NLP.
Stars: ✭ 117 (-11.36%)
Atis.keras
Spoken Language Understanding(SLU)/Slot Filling in Keras
Stars: ✭ 100 (-24.24%)
D2l En
Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 300 universities from 55 countries including Stanford, MIT, Harvard, and Cambridge.
Stars: ✭ 11,837 (+8867.42%)
Box Cli Maker
Make Highly Customized Boxes for your CLI
Stars: ✭ 115 (-12.88%)
Mutual labels:  unicode
Clustype
Automatic Entity Recognition and Typing for Domain-Specific Corpora (KDD'15)
Stars: ✭ 99 (-25%)
Awesome Machine Learning
📖 List of some awesome university courses for Machine Learning! Feel free to contribute!
Stars: ✭ 99 (-25%)
Chars2vec
Character-based word embeddings model based on RNN for handling real world texts
Stars: ✭ 130 (-1.52%)
Chevrotain
Parser Building Toolkit for JavaScript
Stars: ✭ 1,795 (+1259.85%)
Mutual labels:  tokenizer
Flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
Stars: ✭ 11,065 (+8282.58%)
Plotille
Plot in the terminal using braille dots.
Stars: ✭ 99 (-25%)
Mutual labels:  unicode
Papers
读过的CV方向的一些论文,图像生成文字、弱监督分割等
Stars: ✭ 99 (-25%)
Stanford Tensorflow Tutorials
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
Stars: ✭ 10,098 (+7550%)
En Fr Mlt Tensorflow
English-French Machine Language Translation in Tensorflow
Stars: ✭ 99 (-25%)
Mutual labels:  machine-translation
Chinese nlu by using rasa nlu
使用 RASA NLU 来构建中文自然语言理解系统(NLU)| Use RASA NLU to build a Chinese Natural Language Understanding System (NLU)
Stars: ✭ 99 (-25%)
Aws Machine Learning University Accelerated Nlp
Machine Learning University: Accelerated Natural Language Processing Class
Stars: ✭ 1,695 (+1184.09%)
Cheatsheet
Pretty cheat sheets, or ``reference cards'', obtainable from Org files.
Stars: ✭ 116 (-12.12%)
Mutual labels:  unicode
Unicode Display width
Monospace Unicode character width in Ruby
Stars: ✭ 98 (-25.76%)
Mutual labels:  unicode
Neuronblocks
NLP DNN Toolkit - Building Your NLP DNN Models Like Playing Lego
Stars: ✭ 1,356 (+927.27%)
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+1048.48%)
Open Semantic Entity Search Api
Open Source REST API for named entity extraction, named entity linking, named entity disambiguation, recommendation & reconciliation of entities like persons, organizations and places for (semi)automatic semantic tagging & analysis of documents by linked data knowledge graph like SKOS thesaurus, RDF ontology, database(s) or list(s) of names
Stars: ✭ 98 (-25.76%)
61-120 of 1057 similar projects