class-normClass Normalization for Continual Zero-Shot Learning
Stars: ✭ 34 (+25.93%)
Text DetectorTool which allow you to detect and translate text.
Stars: ✭ 173 (+540.74%)
lametaThe Metadata Editor for Transparent Archiving of language document materials
Stars: ✭ 18 (-33.33%)
NlprePython library for Natural Language Preprocessing (NLPre)
Stars: ✭ 158 (+485.19%)
readabilityFast readability scores for text data
Stars: ✭ 22 (-18.52%)
lambda-notebookLambda Notebook: Formal Semantics in Jupyter
Stars: ✭ 16 (-40.74%)
aera-workshopThis workshop introduces participants to the Learning Analytics (LA), and provides a brief overview of LA methodologies, literature, applications, and ethical issues as they relate to STEM education.
Stars: ✭ 14 (-48.15%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (+425.93%)
WonderfulPolishLanguageThis is a repository created for the list of resources for learning and exploring Wonderful Polish language.
Stars: ✭ 31 (+14.81%)
PrenlpPreprocessing Library for Natural Language Processing
Stars: ✭ 130 (+381.48%)
sparklanesA lightweight data processing framework for Apache Spark
Stars: ✭ 17 (-37.04%)
LibasciidocA Golang library for processing Asciidoc files.
Stars: ✭ 129 (+377.78%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+359.26%)
RainNet[CVPR 2021] Region-aware Adaptive Instance Normalization for Image Harmonization
Stars: ✭ 125 (+362.96%)
BplBinary Processing Language
Stars: ✭ 103 (+281.48%)
MtpMulti-lingual Text Processing
Stars: ✭ 87 (+222.22%)
TextrudeCode generation from YAML/JSON/CSV models via SCRIBAN templates
Stars: ✭ 79 (+192.59%)
Awesome-CyberSec-ResourcesAn awesome collection of curated Cyber Security resources(Books, Tutorials, Blogs, Podcasts, ...)
Stars: ✭ 273 (+911.11%)
KefirbbA flexible Java text processor. BB, BBCode, BB-code, HTML, Textile, Markdown, parser, translator, converter.
Stars: ✭ 83 (+207.41%)
malay-datasetText corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html
Stars: ✭ 189 (+600%)
VirastarCleaning-up Persian Texts!
Stars: ✭ 77 (+185.19%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+703.7%)
named-entity-recognitionNotebooks for teaching Named Entity Recognition at the Cultural Heritage Data School, run by Cambridge Digital Humanities
Stars: ✭ 18 (-33.33%)
Lingua FrancaMycroft's multilingual text parsing and formatting library
Stars: ✭ 51 (+88.89%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+662.96%)
Qp Trie RsAn idiomatic and fast QP-trie implementation in pure Rust.
Stars: ✭ 47 (+74.07%)
learn perl onelinersExample based guide for text processing with perl from the command line
Stars: ✭ 63 (+133.33%)
Concise Ipython Notebooks For Deep LearningIpython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-14.81%)
GohnHatena Notation (はてな記法) Parser written in Go
Stars: ✭ 17 (-37.04%)
tapText Analytics Pipeline (TAP)
Stars: ✭ 17 (-37.04%)
Python NameparserA simple Python module for parsing human names into their individual components
Stars: ✭ 462 (+1611.11%)
HdltexHDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+607.41%)
Open Korean TextOpen Korean Text Processor - An Open-source Korean Text Processor
Stars: ✭ 438 (+1522.22%)
Aho CorasickA fast implementation of Aho-Corasick in Rust.
Stars: ✭ 424 (+1470.37%)
TextheroText preprocessing, representation and visualization from zero to hero.
Stars: ✭ 2,407 (+8814.81%)
TextpipeTextpipe: clean and extract metadata from text
Stars: ✭ 284 (+951.85%)
lda2vecMixing Dirichlet Topic Models and Word Embeddings to Make lda2vec from this paper https://arxiv.org/abs/1605.02019
Stars: ✭ 27 (+0%)
daachorse🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure.
Stars: ✭ 75 (+177.78%)
Multi rakeMultilingual Rapid Automatic Keyword Extraction (RAKE) for Python
Stars: ✭ 162 (+500%)
PubMed-Best-MatchMachine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches
Stars: ✭ 36 (+33.33%)
typ3r.js🍟 [Library] dA aNn0Y1Ng t3Xt g3NeRa7or
Stars: ✭ 22 (-18.52%)
LazynlpLibrary to scrape and clean web pages to create massive datasets.
Stars: ✭ 1,985 (+7251.85%)
stringxDrop-in replacements for base R string functions powered by stringi
Stars: ✭ 14 (-48.15%)
textreadrTools to uniformly read in text data including semi-structured transcripts
Stars: ✭ 65 (+140.74%)
SwitchNorm DetectionThe code of Switchable Normalization for object detection based on Detectron.pytorch.
Stars: ✭ 79 (+192.59%)
AdjutantRuns a pubmed query, returns results and allows user to explore high-level structure of returned documents
Stars: ✭ 59 (+118.52%)
readerDistant Reader, a tool for using & understanding a corpus
Stars: ✭ 18 (-33.33%)
dmriprepdMRIPrep is a robust and easy-to-use pipeline for preprocessing of diverse dMRI data. The transparent workflow dispenses of manual intervention, thereby ensuring the reproducibility of the results.
Stars: ✭ 55 (+103.7%)
event-embedding-multitask*SEM 2018: Learning Distributed Event Representations with a Multi-Task Approach
Stars: ✭ 22 (-18.52%)
sensimSentence Similarity Estimator (SenSim)
Stars: ✭ 15 (-44.44%)