All Projects → Tokenizers → Similar Projects or Alternatives

881 Open source projects that are alternatives of or similar to Tokenizers

Kate
Code & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"
Stars: ✭ 135 (-16.15%)
Mutual labels:  text-mining
Clipr
R functions for reading and writing from the system clipboard
Stars: ✭ 112 (-30.43%)
Mutual labels:  rstats
Sigmajs
Σ sigma.js for R
Stars: ✭ 58 (-63.98%)
Mutual labels:  rstats
Docxtractr
✂️ Extract Tables from Microsoft Word Documents with R
Stars: ✭ 139 (-13.66%)
Mutual labels:  rstats
Nlp In Practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+390.68%)
Mutual labels:  text-mining
Drake Examples
Example workflows for the drake R package
Stars: ✭ 57 (-64.6%)
Mutual labels:  rstats
Sparklyr
R interface for Apache Spark
Stars: ✭ 775 (+381.37%)
Mutual labels:  rstats
Syntok
Text tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (-23.6%)
Mutual labels:  tokenizer
Text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+344.1%)
Mutual labels:  text-mining
Hippo
PHP standards checker.
Stars: ✭ 82 (-49.07%)
Mutual labels:  tokenizer
D3r
d3.js helpers for R
Stars: ✭ 133 (-17.39%)
Mutual labels:  rstats
Pipeit
PipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (-64.6%)
Mutual labels:  text-mining
Ggforce
Accelerating ggplot2
Stars: ✭ 640 (+297.52%)
Mutual labels:  rstats
Sentence Splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-49.07%)
Mutual labels:  tokenizer
Rtimes
R wrapper for NYTimes API for government data - ABANDONED
Stars: ✭ 55 (-65.84%)
Mutual labels:  rstats
Engsoccerdata
English and European soccer results 1871-2020
Stars: ✭ 615 (+281.99%)
Mutual labels:  rstats
Darksky
☁️ R interface to the Dark Sky API [APPLE IS SHUTTING DOWN THE API 2021-12-31]
Stars: ✭ 81 (-49.69%)
Mutual labels:  rstats
Efficientr
Efficient R programming: a book
Stars: ✭ 616 (+282.61%)
Mutual labels:  rstats
Genius
Easily access song lyrics from Genius in a tibble.
Stars: ✭ 111 (-31.06%)
Mutual labels:  text-mining
Ngram
Fast n-Gram Tokenization
Stars: ✭ 55 (-65.84%)
Mutual labels:  text-mining
Soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+280.75%)
Mutual labels:  tokenizer
Markovchain
Easy Handling Discrete Time Markov Chains
Stars: ✭ 80 (-50.31%)
Mutual labels:  r-package
Tinytex
A lightweight, cross-platform, portable, and easy-to-maintain LaTeX distribution based on TeX Live
Stars: ✭ 584 (+262.73%)
Mutual labels:  r-package
Japanesetokenizers
aim to use JapaneseTokenizer as easy as possible
Stars: ✭ 120 (-25.47%)
Mutual labels:  tokenizer
Awesome Blogdown
An awesome curated list of blogs built using blogdown
Stars: ✭ 80 (-50.31%)
Mutual labels:  rstats
Decryptr
An extensible API for breaking captchas
Stars: ✭ 154 (-4.35%)
Mutual labels:  rstats
Colormap
R package to generate colors from a list of 44 pre-defined palettes
Stars: ✭ 55 (-65.84%)
Mutual labels:  rstats
Pinp
Pinp Is Not PNAS -- Two-Column PDF Template
Stars: ✭ 134 (-16.77%)
Mutual labels:  r-package
Countdown
⏲ countdown timer for R Markdown slides and HTML docs
Stars: ✭ 110 (-31.68%)
Mutual labels:  rstats
Mapscanner
R package to print maps, draw on them, and scan them back in
Stars: ✭ 55 (-65.84%)
Mutual labels:  r-package
Bigartm
Fast topic modeling platform
Stars: ✭ 563 (+249.69%)
Mutual labels:  text-mining
Projmgr
R-based project management tools
Stars: ✭ 78 (-51.55%)
Mutual labels:  r-package
Tidybayes
Bayesian analysis + tidy data + geoms (R package)
Stars: ✭ 557 (+245.96%)
Mutual labels:  r-package
Scattertext
Beautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+969.57%)
Mutual labels:  text-mining
Kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+244.1%)
Mutual labels:  tokenizer
Projpred
Projection predictive variable selection
Stars: ✭ 76 (-52.8%)
Mutual labels:  r-package
Paletteer
🎨🎨🎨 Collection of most color palettes in a single R package
Stars: ✭ 535 (+232.3%)
Mutual labels:  rstats
Lex
Replaced by foonathan/lexy
Stars: ✭ 137 (-14.91%)
Mutual labels:  tokenizer
Golem
A Framework for Building Robust Shiny Apps
Stars: ✭ 530 (+229.19%)
Mutual labels:  r-package
Darkstudio
darkstudio. A dark grey alternative to RStudio's default dark theme.
Stars: ✭ 75 (-53.42%)
Mutual labels:  rstats
Nlp Notebooks
A collection of notebooks for Natural Language Processing from NLP Town
Stars: ✭ 513 (+218.63%)
Mutual labels:  text-mining
Vegalite
R ggplot2 "bindings" for Vega-Lite
Stars: ✭ 157 (-2.48%)
Mutual labels:  rstats
Shinyapps links
A collection of Shiny applications (links shared on Twitter)
Stars: ✭ 109 (-32.3%)
Mutual labels:  rstats
Thot
Thot toolkit for statistical machine translation
Stars: ✭ 53 (-67.08%)
Mutual labels:  tokenizer
Hexsticker
✨ Hexagon sticker in R
Stars: ✭ 464 (+188.2%)
Mutual labels:  rstats
Emayili
An R package for sending email messages.
Stars: ✭ 72 (-55.28%)
Mutual labels:  rstats
Orangetext
🍊📄 : An #rstats project to keep track of The 🍊 One's speeches
Stars: ✭ 53 (-67.08%)
Mutual labels:  rstats
Datasciencer
a curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+972.67%)
Mutual labels:  text-mining
Anicon
Animated icons for R markdown and Shiny apps
Stars: ✭ 109 (-32.3%)
Mutual labels:  rstats
Ggeconodist
📉 Create Diminutive Distribution Charts
Stars: ✭ 53 (-67.08%)
Mutual labels:  rstats
Rolldown
R Markdown output formats for storytelling
Stars: ✭ 137 (-14.91%)
Mutual labels:  r-package
Kadot
Kadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-32.92%)
Mutual labels:  tokenizer
Euclid
Exact Computation Geometry Framework Based on 'CGAL'
Stars: ✭ 52 (-67.7%)
Mutual labels:  rstats
Dtupdate
The dtupdate package has functions that try to make it easier to keep up with the non-CRAN universe
Stars: ✭ 51 (-68.32%)
Mutual labels:  rstats
Xioc
Extract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (-8.07%)
Mutual labels:  text-mining
Rstudiothemes
A curated list of RStudio themes found on Github
Stars: ✭ 134 (-16.77%)
Mutual labels:  rstats
Pkgnet
R package for analyzing other R packages via graph representations of their dependencies
Stars: ✭ 107 (-33.54%)
Mutual labels:  r-package
Spark Nkp
Natural Korean Processor for Apache Spark
Stars: ✭ 50 (-68.94%)
Mutual labels:  text-mining
Rdoc
colourised R docs in the terminal
Stars: ✭ 49 (-69.57%)
Mutual labels:  rstats
Rlp
An Example of Using Literate Programming for R Package Development
Stars: ✭ 47 (-70.81%)
Mutual labels:  r-package
301-360 of 881 similar projects