All Projects → Yoctol → text-normalizer

Yoctol / text-normalizer

Licence: MIT License
Normalize text string

Programming Languages

python
139335 projects - #7 most used programming language
Makefile
30231 projects

Projects that are alternatives of or similar to text-normalizer

Vestaboard
An API Wrapper for Vestaboards written in Python
Stars: ✭ 23 (+91.67%)
Mutual labels:  pypi
oxygenjs
This a JavaScript Library for the Numerical Javascript and Machine Learning
Stars: ✭ 13 (+8.33%)
Mutual labels:  preprocessing
maloss
Towards Measuring Supply Chain Attacks on Package Managers for Interpreted Languages
Stars: ✭ 46 (+283.33%)
Mutual labels:  pypi
sparklanes
A lightweight data processing framework for Apache Spark
Stars: ✭ 17 (+41.67%)
Mutual labels:  preprocessing
ISS Info
Python wrapper for tracking information about International Space Station via http://open-notify.org
Stars: ✭ 12 (+0%)
Mutual labels:  pypi
Simple-YouTube-Downloader
YouTube download client with focus on simplicity
Stars: ✭ 31 (+158.33%)
Mutual labels:  pypi
MLLabelUtils.jl
Utility package for working with classification targets and label-encodings
Stars: ✭ 30 (+150%)
Mutual labels:  preprocessing
veridical-flow
Making it easier to build stable, trustworthy data-science pipelines.
Stars: ✭ 28 (+133.33%)
Mutual labels:  preprocessing
slamdunk
Streamlining SLAM-seq analysis with ultra-high sensitivity
Stars: ✭ 24 (+100%)
Mutual labels:  pypi
aceso
Python package to calculate 2SFCA and other measures of spatial accessibility
Stars: ✭ 20 (+66.67%)
Mutual labels:  pypi
sphinx-themes.org
A showcase for Sphinx documentation themes
Stars: ✭ 114 (+850%)
Mutual labels:  pypi
ProxyGrab
Asynchronous Library made using Python and aiohttp to get proxies from multiple services!
Stars: ✭ 17 (+41.67%)
Mutual labels:  pypi
flytekit
Extensible Python SDK for developing Flyte tasks and workflows. Simple to get started and learn and highly extensible.
Stars: ✭ 82 (+583.33%)
Mutual labels:  pypi
rfc-bibtex
A command line tool that creates bibtex entries for IETF RFCs and Internet Drafts.
Stars: ✭ 43 (+258.33%)
Mutual labels:  pypi
Preprocessing-Method-for-STEMI-Detection
Official source code of "Preprocessing Method for Performance Enhancement in CNN-based STEMI Detection from 12-lead ECG"
Stars: ✭ 12 (+0%)
Mutual labels:  preprocessing
feedsearch-crawler
Crawl sites for RSS, Atom, and JSON feeds.
Stars: ✭ 23 (+91.67%)
Mutual labels:  pypi
tchamba
Tchamba.random, is a real random data genarator (letters, jokes, names...)
Stars: ✭ 11 (-8.33%)
Mutual labels:  pypi
proxpi
PyPI caching mirror
Stars: ✭ 19 (+58.33%)
Mutual labels:  pypi
wheelodex
An index of wheels
Stars: ✭ 20 (+66.67%)
Mutual labels:  pypi
rdkit-pypi
⚛️ RDKit Python Wheels on PyPi. 💻 pip install rdkit-pypi
Stars: ✭ 62 (+416.67%)
Mutual labels:  pypi

text-normalizer

travis pypi

Normalize your Text String. It is a python package that help you normalize your text data and recover it.

Install

Use Python3

> pip install text-normalizer

Usage

from text_normalizer.text_normalizer_collection_library import chinese_charactor_text_normalizer_collection_2


input_sentence = "   我在85.33度C買了一杯900──1000元的咖啡    《ohoh》?? m_m"
nor_sentence, meta = chinese_charactor_text_normalizer_collection_2.normalize(input_sentence)
print(nor_sentence)
> "我在_float_度c買了一杯_int_-_int_元的咖啡 <ohoh>?? m_m"

de_sentence = chinese_charactor_text_normalizer_collection_2.denormalize(nor_sentence, meta)
print(de_sentence)
> "我在85.33度C買了一杯900──1000元的咖啡 《ohoh》?? m_m",
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].