All Projects → Tokenizer → Similar Projects or Alternatives

1057 Open source projects that are alternatives of or similar to Tokenizer

Scanrefer
[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language
Stars: ✭ 84 (-36.36%)
Freeml
A List of Data Science/Machine Learning Resources (Mostly Free)
Stars: ✭ 974 (+637.88%)
Ratel
RAT-el is an open source penetration test tool that allows you to take control of a windows machine. It works on the client-server model, the server sends commands and the client executes the commands and sends the result back to the server. The client is completely undetectable by anti-virus software.
Stars: ✭ 121 (-8.33%)
Mutual labels:  unicode
Conversational Ai
Conversational AI Reading Materials
Stars: ✭ 34 (-74.24%)
U2c
Unicode To Chinese -- U2C : A burpsuite Extender That Convert Unicode To Chinese 【Unicode编码转中文的burp插件】
Stars: ✭ 83 (-37.12%)
Mutual labels:  unicode
Metasra Pipeline
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (-75%)
Transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Stars: ✭ 55,742 (+42128.79%)
Pqg Pytorch
Paraphrase Generation model using pair-wise discriminator loss
Stars: ✭ 33 (-75%)
Sentence Splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-37.88%)
Mutual labels:  tokenizer
Wikisql
A large annotated semantic parsing corpus for developing natural language interfaces.
Stars: ✭ 965 (+631.06%)
Persian Stopwords
Persian (Farsi) Stop Words List
Stars: ✭ 131 (-0.76%)
Easy Deep Learning With Allennlp
🔮Deep Learning for text made easy with AllenNLP
Stars: ✭ 32 (-75.76%)
Unicode
Unicode normalization library. (Mirror of Yoshida-san's code base to maintain the RubyGem.)
Stars: ✭ 81 (-38.64%)
Mutual labels:  unicode
Unidump
hexdump(1) for Unicode data
Stars: ✭ 31 (-76.52%)
Mutual labels:  unicode
Nltk
NLTK Source
Stars: ✭ 10,309 (+7709.85%)
Punny captions
An implementation of the NAACL 2018 paper "Punny Captions: Witty Wordplay in Image Descriptions".
Stars: ✭ 31 (-76.52%)
Lehar
Visualize data using relative ordering
Stars: ✭ 81 (-38.64%)
Mutual labels:  unicode
Omnicat Bayes
Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-77.27%)
Mutual labels:  tokenizer
Nlpcc Wordseg Weibo
NLPCC 2016 微博分词评测项目
Stars: ✭ 120 (-9.09%)
Tensorflow In Practice Specialization
DeepLearning.AI TensorFlow Developer Professional Certificate Specialization
Stars: ✭ 29 (-78.03%)
Mimic Code
MIMIC Code Repository: Code shared by the research community for the MIMIC-III database
Stars: ✭ 1,225 (+828.03%)
Mutual labels:  icu
Harfbuzz Icu Freetype
Harfbuzz with a CMake build configuration using Freetype2, UCDN and ICU
Stars: ✭ 28 (-78.79%)
Mutual labels:  icu
Chatbot
Русскоязычный чатбот
Stars: ✭ 106 (-19.7%)
Webfactoryicutranslationbundle
Enables ICU message formatting for translations in Symfony applications.
Stars: ✭ 27 (-79.55%)
Mutual labels:  icu
Date Time Format Timezone
Surgically polyfills timezone support in Intl.DateTimeFormat API
Stars: ✭ 94 (-28.79%)
Mutual labels:  icu
Research papers
Record some papers I have read and paper notes I have taken, also including some awesome papers reading lists and academic blog posts.
Stars: ✭ 55 (-58.33%)
Php Confusable Homoglyphs
A PHP port of https://github.com/vhf/confusable_homoglyphs
Stars: ✭ 27 (-79.55%)
Mutual labels:  unicode
Cluedatasetsearch
搜索所有中文NLP数据集,附常用英文NLP数据集
Stars: ✭ 2,112 (+1500%)
Mutual labels:  machine-translation
Rex
REx: Relation Extraction. Modernized re-write of the code in the master's thesis: "Relation Extraction using Distant Supervision, SVMs, and Probabalistic First-Order Logic"
Stars: ✭ 21 (-84.09%)
Transformers without tears
Transformers without Tears: Improving the Normalization of Self-Attention
Stars: ✭ 80 (-39.39%)
Mutual labels:  machine-translation
Witwicky
Witwicky: An implementation of Transformer in PyTorch.
Stars: ✭ 21 (-84.09%)
Mutual labels:  machine-translation
Ios ml
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
Stars: ✭ 1,409 (+967.42%)
Named Entity Recognition
name entity recognition with recurrent neural network(RNN) in tensorflow
Stars: ✭ 20 (-84.85%)
Ucdn
Unicode Database and Normalization
Stars: ✭ 78 (-40.91%)
Mutual labels:  unicode
Events
Repository for *SEM Paper on Event Coreference Resolution in ECB+
Stars: ✭ 20 (-84.85%)
Discobert
Code for paper "Discourse-Aware Neural Extractive Text Summarization" (ACL20)
Stars: ✭ 120 (-9.09%)
Unicode 9.0.0
JavaScript-compatible Unicode data. Arrays of code points, arrays of symbols, and regular expressions for Unicode v9.0.0’s categories, scripts, blocks, bidi, and other properties.
Stars: ✭ 15 (-88.64%)
Mutual labels:  unicode
Text Dependency Parser
🏄 依存关系分析,NLP,自然语言处理
Stars: ✭ 78 (-40.91%)
Utf8.h
📚 single header utf8 string functions for C and C++
Stars: ✭ 875 (+562.88%)
Mutual labels:  unicode
Textaugmentation Gpt2
Fine-tuned pre-trained GPT2 for custom topic specific text generation. Such system can be used for Text Augmentation.
Stars: ✭ 104 (-21.21%)
Crx Jtrans
jTransliter - the roman to unicode transliter as Google chrome extension
Stars: ✭ 13 (-90.15%)
Mutual labels:  unicode
Multimodal Toolkit
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
Stars: ✭ 78 (-40.91%)
Node Api.ai
[DEPRECATED] Ultimate Node.JS SDK for api.ai
Stars: ✭ 12 (-90.91%)
Textacy
NLP, before and after spaCy
Stars: ✭ 1,849 (+1300.76%)
Language
Shared repository for open-sourced projects from the Google AI Language team.
Stars: ✭ 860 (+551.52%)
Abigsurvey
A collection of 500+ survey papers on Natural Language Processing (NLP) and Machine Learning (ML)
Stars: ✭ 1,203 (+811.36%)
Pke
Python Keyphrase Extraction module
Stars: ✭ 855 (+547.73%)
Magnitude
A fast, efficient universal vector embedding utility package.
Stars: ✭ 1,394 (+956.06%)
Spago
Self-contained Machine Learning and Natural Language Processing library in Go
Stars: ✭ 854 (+546.97%)
Monkeylearn Ruby
Official Ruby client for the MonkeyLearn API. Build and consume machine learning models for language processing from your Ruby apps.
Stars: ✭ 76 (-42.42%)
Syntree2vec
An algorithm to augment syntactic hierarchy into word embeddings
Stars: ✭ 9 (-93.18%)
Tokenizer
Source code tokenizer
Stars: ✭ 119 (-9.85%)
Mutual labels:  tokenizer
Demos
Some JavaScript works published as demos, mostly ML or DS
Stars: ✭ 55 (-58.33%)
Tensorflow Nlp
NLP and Text Generation Experiments in TensorFlow 2.x / 1.x
Stars: ✭ 1,487 (+1026.52%)
Pytreebank
😡😇 Stanford Sentiment Treebank loader in Python
Stars: ✭ 93 (-29.55%)
Coarij
Corpus of Annual Reports in Japan
Stars: ✭ 55 (-58.33%)
Vietnamese Electra
Electra pre-trained model using Vietnamese corpus
Stars: ✭ 55 (-58.33%)
Multitask sentiment analysis
Multitask Deep Learning for Sentiment Analysis using Character-Level Language Model, Bi-LSTMs for POS Tag, Chunking and Unsupervised Dependency Parsing. Inspired by this great article https://arxiv.org/abs/1611.01587
Stars: ✭ 93 (-29.55%)
Python Myanmar
Python library for Myanmar text processing
Stars: ✭ 53 (-59.85%)
Mutual labels:  unicode
Emotion Detector
A python code to detect emotions from text
Stars: ✭ 54 (-59.09%)
301-360 of 1057 similar projects