All Projects → Tokenizers → Similar Projects or Alternatives

881 Open source projects that are alternatives of or similar to Tokenizers

civicmine
Text mining cancer biomarkers for the CIVIC database
Stars: ✭ 19 (-88.2%)
Mutual labels:  text-mining
Awesome r githubers
This is a list of R developers and advocates on Github. This is to help new comers create a following list.
Stars: ✭ 85 (-47.2%)
Mutual labels:  rstats
geojson
GeoJSON classes for R
Stars: ✭ 32 (-80.12%)
Mutual labels:  r-package
Autophrase
AutoPhrase: Automated Phrase Mining from Massive Text Corpora
Stars: ✭ 835 (+418.63%)
Mutual labels:  text-mining
Megamark
😻 Markdown with easy tokenization, a fast highlighter, and a lean HTML sanitizer
Stars: ✭ 100 (-37.89%)
Mutual labels:  tokenizer
Sharpmath
A small .NET math library.
Stars: ✭ 36 (-77.64%)
Mutual labels:  tokenizer
reportfactory
Lightweight infrastructure to handle multiple rmarkdown reports
Stars: ✭ 68 (-57.76%)
Mutual labels:  r-package
LandR
Landscape Ecosystem Modelling in R
Stars: ✭ 14 (-91.3%)
Mutual labels:  r-package
Brms
brms R package for Bayesian generalized multivariate non-linear multilevel models using Stan
Stars: ✭ 825 (+412.42%)
Mutual labels:  r-package
Gsoc2018 3gm
💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (-77.64%)
Mutual labels:  text-mining
regista
An R package for soccer modelling
Stars: ✭ 71 (-55.9%)
Mutual labels:  rstats
flipper
Make it easy to flip through R packages from CRAN, Bioconductor, and GitHub
Stars: ✭ 13 (-91.93%)
Mutual labels:  r-package
modeltime.gluonts
GluonTS Deep Learning with Modeltime
Stars: ✭ 31 (-80.75%)
Mutual labels:  r-package
Docxtractr
✂️ Extract Tables from Microsoft Word Documents with R
Stars: ✭ 139 (-13.66%)
Mutual labels:  rstats
Adjutant
Runs a pubmed query, returns results and allows user to explore high-level structure of returned documents
Stars: ✭ 59 (-63.35%)
Mutual labels:  text-mining
Nlp In Practice
Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.
Stars: ✭ 790 (+390.68%)
Mutual labels:  text-mining
inline
Inline C, C++ or Fortran functions in R
Stars: ✭ 33 (-79.5%)
Mutual labels:  r-package
Fasterize
High performance raster conversion for modern spatial data 🚀🌏▦
Stars: ✭ 146 (-9.32%)
Mutual labels:  rstats
Works For Me
Collection of developer toolkits
Stars: ✭ 131 (-18.63%)
Mutual labels:  tokenizer
Text predictor
Char-level RNN LSTM text generator📄.
Stars: ✭ 99 (-38.51%)
Mutual labels:  text-mining
Mactheknife
🦈 Various ‘macOS’-oriented Tools and Utilities in R
Stars: ✭ 36 (-77.64%)
Mutual labels:  rstats
Hebrew-Tokenizer
A very simple python tokenizer for Hebrew text.
Stars: ✭ 16 (-90.06%)
Mutual labels:  tokenizer
phsmethods
An R package to standardise methods used in Public Health Scotland (https://public-health-scotland.github.io/phsmethods/)
Stars: ✭ 43 (-73.29%)
Mutual labels:  r-package
Sparklyr
R interface for Apache Spark
Stars: ✭ 775 (+381.37%)
Mutual labels:  rstats
awspack
Amazon Web Services Bundle Package
Stars: ✭ 14 (-91.3%)
Mutual labels:  r-package
Syntok
Text tokenization and sentence segmentation (segtok v2)
Stars: ✭ 123 (-23.6%)
Mutual labels:  tokenizer
vscode-blockman
VSCode extension to highlight nested code blocks
Stars: ✭ 233 (+44.72%)
Mutual labels:  tokenizer
Text2vec
Fast vectorization, topic modeling, distances and GloVe word embeddings in R.
Stars: ✭ 715 (+344.1%)
Mutual labels:  text-mining
realtime
No description or website provided.
Stars: ✭ 15 (-90.68%)
Mutual labels:  r-package
Hippo
PHP standards checker.
Stars: ✭ 82 (-49.07%)
Mutual labels:  tokenizer
thrones2vec
Using Word2Vec to explore semantic similarities between the entities of "A Song of Ice and Fire" ("Game of Thrones").
Stars: ✭ 27 (-83.23%)
Mutual labels:  text-mining
Ggthemr
Themes for ggplot2.
Stars: ✭ 697 (+332.92%)
Mutual labels:  rstats
rodev
⛔ ARCHIVED ⛔ Helper for rOpenSci Package Developpers
Stars: ✭ 24 (-85.09%)
Mutual labels:  r-package
Ggforce
Accelerating ggplot2
Stars: ✭ 640 (+297.52%)
Mutual labels:  rstats
converse
Conversational text Analysis using various NLP techniques
Stars: ✭ 147 (-8.7%)
Mutual labels:  text-mining
Sentence Splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-49.07%)
Mutual labels:  tokenizer
BAS
BAS R package https://merliseclyde.github.io/BAS/
Stars: ✭ 36 (-77.64%)
Mutual labels:  r-package
reproducible
A set of tools for R that enhance reproducibility beyond package management
Stars: ✭ 33 (-79.5%)
Mutual labels:  r-package
linguisticsdown
Easy Linguistics Document Writing with R Markdown
Stars: ✭ 24 (-85.09%)
Mutual labels:  r-package
Engsoccerdata
English and European soccer results 1871-2020
Stars: ✭ 615 (+281.99%)
Mutual labels:  rstats
tsmp
R Functions implementing UCR Matrix Profile Algorithm
Stars: ✭ 63 (-60.87%)
Mutual labels:  r-package
Darksky
☁️ R interface to the Dark Sky API [APPLE IS SHUTTING DOWN THE API 2021-12-31]
Stars: ✭ 81 (-49.69%)
Mutual labels:  rstats
neji
Flexible and powerful platform for biomedical information extraction from text
Stars: ✭ 37 (-77.02%)
Mutual labels:  text-mining
Efficientr
Efficient R programming: a book
Stars: ✭ 616 (+282.61%)
Mutual labels:  rstats
rosette-elasticsearch-plugin
Document Enrichment plugin for Elasticsearch
Stars: ✭ 25 (-84.47%)
Mutual labels:  text-mining
Epinow2
Estimate Realtime Case Counts and Time-varying Epidemiological Parameters
Stars: ✭ 36 (-77.64%)
Mutual labels:  rstats
cattonum
Encode Categorical Features
Stars: ✭ 31 (-80.75%)
Mutual labels:  rstats
tf-idf-python
Term frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (-39.13%)
Mutual labels:  text-mining
Soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+280.75%)
Mutual labels:  tokenizer
gm
R Package for Music Score and Audio Generation
Stars: ✭ 116 (-27.95%)
Mutual labels:  r-package
Markovchain
Easy Handling Discrete Time Markov Chains
Stars: ✭ 80 (-50.31%)
Mutual labels:  r-package
Shinycustomloader
Add a custom loader for R shiny
Stars: ✭ 97 (-39.75%)
Mutual labels:  rstats
ctv
CRAN Task View Initiative
Stars: ✭ 17 (-89.44%)
Mutual labels:  rstats
cang-jie
Chinese tokenizer for tantivy, based on jieba-rs
Stars: ✭ 48 (-70.19%)
Mutual labels:  tokenizer
rchess
♛ Chess package for R
Stars: ✭ 68 (-57.76%)
Mutual labels:  rstats
Worldtilegrid
🔲🗺 World Tile Grid Geom for ggplot2 [WIP]
Stars: ✭ 35 (-78.26%)
Mutual labels:  rstats
powerlmm
powerlmm R package for power calculations for two- and three-level longitudinal multilevel/linear mixed models.
Stars: ✭ 86 (-46.58%)
Mutual labels:  r-package
Cicerone
🏛️ Give tours of your Shiny apps
Stars: ✭ 131 (-18.63%)
Mutual labels:  rstats
Javascript For R
JavaScript for R CRC Book
Stars: ✭ 98 (-39.13%)
Mutual labels:  rstats
Spades
R package for developing and running Spatial Discrete Event Simulation models
Stars: ✭ 34 (-78.88%)
Mutual labels:  r-package
601-660 of 881 similar projects