All Projects → Tokenizer → Similar Projects or Alternatives

298 Open source projects that are alternatives of or similar to Tokenizer

Somajo
A tokenizer and sentence splitter for German and English web and social media texts.
Stars: ✭ 85 (-28.57%)
Mutual labels:  tokenizer
Componette Site
➿ Addons, plugins, components and extensions (@componette ❤️ @nette)
Stars: ✭ 56 (-52.94%)
Mutual labels:  nette-framework
Guitar
A Cross-Platform String and Regular Expression Library written in Swift.
Stars: ✭ 641 (+438.66%)
Mutual labels:  regular-expression
Greynir
The greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (-60.5%)
Mutual labels:  tokenizer
Cyberchef Recipes
A list of cyber-chef recipes and curated links
Stars: ✭ 619 (+420.17%)
Mutual labels:  regular-expression
Debugviewpp
DebugView++, collects, views, filters your application logs, and highlights information that is important to you!
Stars: ✭ 592 (+397.48%)
Mutual labels:  regular-expression
Regexr
For composing regular expressions without the need for double-escaping inside strings.
Stars: ✭ 53 (-55.46%)
Mutual labels:  regular-expression
Djurl
Simple yet helpful library for writing Django urls by an easy, short and intuitive way.
Stars: ✭ 85 (-28.57%)
Mutual labels:  tokenizer
Go Restructure
Match regular expressions into struct fields
Stars: ✭ 570 (+378.99%)
Mutual labels:  regular-expression
Eval Sql.net
SQL Eval Function | Dynamically Evaluate Expression in SQL Server using C# Syntax
Stars: ✭ 84 (-29.41%)
Mutual labels:  regular-expression
Talismane
NLP framework: sentence detector, tokeniser, pos-tagger and dependency parser
Stars: ✭ 38 (-68.07%)
Mutual labels:  tokenizer
Kagome
Self-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+365.55%)
Mutual labels:  tokenizer
Soynlp
한국어 자연어처리를 위한 파이썬 라이브러리입니다. 단어 추출/ 토크나이저 / 품사판별/ 전처리의 기능을 제공합니다.
Stars: ✭ 613 (+415.13%)
Mutual labels:  tokenizer
Py Nltools
A collection of basic python modules for spoken natural language processing
Stars: ✭ 46 (-61.34%)
Mutual labels:  tokenizer
Automa.jl
A julia code generator for regular expressions
Stars: ✭ 111 (-6.72%)
Mutual labels:  regular-expression
Sharpmath
A small .NET math library.
Stars: ✭ 36 (-69.75%)
Mutual labels:  tokenizer
Onigmo
Onigmo is a regular expressions library forked from Oniguruma.
Stars: ✭ 536 (+350.42%)
Mutual labels:  regular-expression
Tokenizer
A small library for converting tokenized PHP source code into XML (and potentially other formats)
Stars: ✭ 4,770 (+3908.4%)
Mutual labels:  tokenizer
Hippo
PHP standards checker.
Stars: ✭ 82 (-31.09%)
Mutual labels:  tokenizer
Rexrex
🦖 Composable JavaScript regular expressions
Stars: ✭ 34 (-71.43%)
Mutual labels:  regular-expression
Regulex
🚧 Regular Expression Excited!
Stars: ✭ 4,877 (+3998.32%)
Mutual labels:  regular-expression
Chinamobilephonenumberregex
Regular expressions that match the mobile phone number in mainland China. / 一组匹配中国大陆手机号码的正则表达式。
Stars: ✭ 4,440 (+3631.09%)
Mutual labels:  regular-expression
Nlp Js Tools French
POS Tagger, lemmatizer and stemmer for french language in javascript
Stars: ✭ 32 (-73.11%)
Mutual labels:  tokenizer
Open Korean Text
Open Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+268.07%)
Mutual labels:  tokenizer
Smoothnlp
专注于可解释的NLP技术 An NLP Toolset With A Focus on Explainable Inference
Stars: ✭ 435 (+265.55%)
Mutual labels:  tokenizer
Sentence Splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.
Stars: ✭ 82 (-31.09%)
Mutual labels:  tokenizer
Tempreites
One-file semantic DSL-free templates direto da roça for the browser and server.
Stars: ✭ 31 (-73.95%)
Mutual labels:  regular-expression
Ekphrasis
Ekphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+263.87%)
Mutual labels:  tokenizer
Moo
Optimised tokenizer/lexer generator! 🐄 Uses /y for performance. Moo.
Stars: ✭ 434 (+264.71%)
Mutual labels:  tokenizer
Omnicat Bayes
Naive Bayes text classification implementation as an OmniCat classifier strategy. (#ruby #naivebayes)
Stars: ✭ 30 (-74.79%)
Mutual labels:  tokenizer
Regexplain
🔍 An RStudio addin slash regex utility belt
Stars: ✭ 413 (+247.06%)
Mutual labels:  regular-expression
Regex
A Regular Expression game for Android
Stars: ✭ 80 (-32.77%)
Mutual labels:  regular-expression
Php Parser
🌿 NodeJS PHP Parser - extract AST or tokens (PHP5 and PHP7)
Stars: ✭ 400 (+236.13%)
Mutual labels:  tokenizer
Hae
HaE - BurpSuite Highlighter and Extractor
Stars: ✭ 397 (+233.61%)
Mutual labels:  regular-expression
Stringr
A fresh approach to string manipulation in R
Stars: ✭ 397 (+233.61%)
Mutual labels:  regular-expression
Place2live
Analysis of the characteristics of different countries
Stars: ✭ 30 (-74.79%)
Mutual labels:  regular-expression
Picomatch
Blazing fast and accurate glob matcher written JavaScript, with no dependencies and full support for standard and extended Bash glob features, including braces, extglobs, POSIX brackets, and regular expressions.
Stars: ✭ 393 (+230.25%)
Mutual labels:  regular-expression
Kadot
Kadot, the unsupervised natural language processing library.
Stars: ✭ 108 (-9.24%)
Mutual labels:  tokenizer
Regexp2
A full-featured regex engine in pure Go based on the .NET engine
Stars: ✭ 389 (+226.89%)
Mutual labels:  regular-expression
Regen
Tool to generate random strings from Go/RE2 regular expressions (Migrated to https://git.sr.ht/~nilium/regen)
Stars: ✭ 79 (-33.61%)
Mutual labels:  regular-expression
Flutter easy rich text
The EasyRichText widget provides an easy way to use RichText.
Stars: ✭ 30 (-74.79%)
Mutual labels:  regular-expression
Jflex
The fast scanner generator for Java™ with full Unicode support
Stars: ✭ 380 (+219.33%)
Mutual labels:  tokenizer
Regexanalyzer
Regular Expression Analyzer and Composer for Node.js / XPCOM / Browser Javascript, PHP, Python
Stars: ✭ 29 (-75.63%)
Mutual labels:  regular-expression
Subconverter
Utility to convert between various subscription format
Stars: ✭ 4,912 (+4027.73%)
Mutual labels:  regular-expression
Minta
✳️  Electron app for generating regular expressions
Stars: ✭ 353 (+196.64%)
Mutual labels:  regular-expression
Nanomatch
Fast, minimal glob matcher for node.js. Similar to micromatch, minimatch and multimatch, but without support for extended globs (extglobs), posix brackets or braces, and with complete Bash 4.3 wildcard support: ("*", "**", and "?").
Stars: ✭ 79 (-33.61%)
Mutual labels:  regular-expression
Lfuzzer
Fuzzing Parsers with Tokens
Stars: ✭ 28 (-76.47%)
Mutual labels:  tokenizer
Commit Watcher
Find interesting and potentially hazardous commits in git projects
Stars: ✭ 345 (+189.92%)
Mutual labels:  regular-expression
Regex
A sane interface for php's built in preg_* functions
Stars: ✭ 909 (+663.87%)
Mutual labels:  regular-expression
Generex
A Java library for generating String from a regular expression.
Stars: ✭ 316 (+165.55%)
Mutual labels:  regular-expression
Friso
High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
Stars: ✭ 313 (+163.03%)
Mutual labels:  tokenizer
To Regex Range
Pass two numbers, get a regex-compatible source string for matching ranges. Fast compiler, optimized regex, and validated against more than 2.78 million test assertions. Useful for creating regular expressions to validate numbers, ranges, years, etc.
Stars: ✭ 97 (-18.49%)
Mutual labels:  regular-expression
Code Checker
✅ A simple tool to check source code against a set of Nette coding standards.
Stars: ✭ 76 (-36.13%)
Mutual labels:  nette
Anabelle
👸 API documentation generator (JSON-RPC / REST)
Stars: ✭ 20 (-83.19%)
Mutual labels:  nette-framework
Regex
Regular expressions for swift
Stars: ✭ 306 (+157.14%)
Mutual labels:  regular-expression
Sentences
A multilingual command line sentence tokenizer in Golang
Stars: ✭ 293 (+146.22%)
Mutual labels:  tokenizer
Sacremoses
Python port of Moses tokenizer, truecaser and normalizer
Stars: ✭ 293 (+146.22%)
Mutual labels:  tokenizer
Cols Agent Tasks
Colin's ALM Corner Custom Build Tasks
Stars: ✭ 70 (-41.18%)
Mutual labels:  tokenizer
Forms Multiplier
🔁 Form multiplier & replicator for Nette Framework
Stars: ✭ 11 (-90.76%)
Mutual labels:  nette
Anymatch
‼️ Matches strings against configurable strings, globs, regular expressions, and/or functions
Stars: ✭ 289 (+142.86%)
Mutual labels:  regular-expression
61-120 of 298 similar projects