XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+146.67%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+51.67%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (+91.67%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+525%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+480%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+108.33%)
estrattoparsing fixed width files content made easy
Stars: ✭ 12 (-80%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-73.33%)
persianSimple Python tool for Persian language localization.
Stars: ✭ 141 (+135%)
Text-Classification-LSTMs-PyTorchThe aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (-25%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (-55%)
SparseLSHA Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+111.67%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+136.67%)
Textractextract text from any document. no muss. no fuss.
Stars: ✭ 3,165 (+5175%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (-45%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+496.67%)
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (-45%)
Textcluster短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (+91.67%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+218.33%)
HdltexHDLTex: Hierarchical Deep Learning for Text Classification
Stars: ✭ 191 (+218.33%)
perstemPersian stemmer and morphological analyzer
Stars: ✭ 18 (-70%)
persianSome utilities for Persian language in Go (Golang)
Stars: ✭ 65 (+8.33%)
VirastarCleaning-up Persian Texts!
Stars: ✭ 77 (+28.33%)
Saaghar“Saaghar” (ساغر) is a Persian poetry software written by C++ under Qt framework, it uses "ganjoor" database as its database. It has tab feature in both its “Viewer” and its “Search” page that cause it be suitable for research goals.
Stars: ✭ 42 (-30%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+1323.33%)
ECG analysisNo description or website provided.
Stars: ✭ 32 (-46.67%)
finglishA Finglish to Persian converter.
Stars: ✭ 60 (+0%)
TRUNAJOD2.0An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (-70%)
tf-idf-pythonTerm frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (+63.33%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (-33.33%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (-20%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+261.67%)
bookworm📚 social networks from novels
Stars: ✭ 72 (+20%)
learning2hash.github.ioWebsite for "A survey of learning to hash for Computer Vision" https://learning2hash.github.io
Stars: ✭ 14 (-76.67%)
py-persian-toolsAn anthology of a variety of tools for the Persian language in Python
Stars: ✭ 106 (+76.67%)
Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
Stars: ✭ 63 (+5%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+243.33%)
kexKex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public datasets.
Stars: ✭ 46 (-23.33%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (+48.33%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+106.67%)
PipeitPipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (-5%)
FxtA large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-58.33%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+21171.67%)
frogFrog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.
Stars: ✭ 70 (+16.67%)
ArabicProcessingCogA Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.
Stars: ✭ 19 (-68.33%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-28.33%)
evildorkEvildork targeting your fiancee👁️
Stars: ✭ 46 (-23.33%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+22198.33%)
text-analysisWeaving analytical stories from text data
Stars: ✭ 12 (-80%)