Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+2075%)
Open Semantic SearchOpen Source research tool to search, browse, analyze and explore large document collections by Semantic Search Engine and Open Source Text Mining & Text Analytics platform (Integrates ETL for document processing, OCR for images & PDF, named entity recognition for persons, organizations & locations, metadata management by thesaurus & ontologies, search user interface & search apps for fulltext search, faceted search & knowledge graph)
Stars: ✭ 386 (+2312.5%)
TRUNAJOD2.0An easy-to-use library to extract indices from texts.
Stars: ✭ 18 (+12.5%)
GraphbrainLanguage, Knowledge, Cognition
Stars: ✭ 294 (+1737.5%)
support-tickets-classificationThis case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+787.5%)
teanaps자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.
Stars: ✭ 91 (+468.75%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (+0%)
TextDatasetCleaner🔬 Очистка датасетов от мусора (нормализация, препроцессинг)
Stars: ✭ 27 (+68.75%)
perkeA keyphrase extractor for Persian
Stars: ✭ 60 (+275%)
XiocExtract indicators of compromise from text, including "escaped" ones.
Stars: ✭ 148 (+825%)
Text mining resourcesResources for learning about Text Mining and Natural Language Processing
Stars: ✭ 358 (+2137.5%)
Cogcomp NlpyCogComp's light-weight Python NLP annotators
Stars: ✭ 115 (+618.75%)
text-analysisWeaving analytical stories from text data
Stars: ✭ 12 (-25%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (+1668.75%)
Knowage ServerKnowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Stars: ✭ 276 (+1625%)
Cogcomp NlpCogComp's Natural Language Processing libraries and Demos:
Stars: ✭ 410 (+2462.5%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (+756.25%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+5237.5%)
woollyThe Text Mining Elixir
Stars: ✭ 48 (+200%)
estrattoparsing fixed width files content made easy
Stars: ✭ 12 (-25%)
Text-Classification-LSTMs-PyTorchThe aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.
Stars: ✭ 45 (+181.25%)
SparseLSHA Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.
Stars: ✭ 127 (+693.75%)
ConTextoLibrería en Python para minería de texto y NLP
Stars: ✭ 43 (+168.75%)
DaDengAndHisPython【微信公众号:大邓和他的python】, Python语法快速入门https://www.bilibili.com/video/av44384851 Python网络爬虫快速入门https://www.bilibili.com/video/av72010301, 我的联系邮箱
[email protected] Stars: ✭ 59 (+268.75%)
kwxBERT, LDA, and TFIDF based keyword extraction in Python
Stars: ✭ 33 (+106.25%)
Text-AnalysisExplaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.
Stars: ✭ 48 (+200%)
big-data-upfRECSM-UPF Summer School: Social Media and Big Data Research
Stars: ✭ 21 (+31.25%)
TextvecText vectorization tool to outperform TFIDF for classification tasks
Stars: ✭ 167 (+943.75%)
JekyllJekyll-based static site for The Programming Historian
Stars: ✭ 387 (+2318.75%)
Stanza OldStanford NLP group's shared Python tools.
Stars: ✭ 142 (+787.5%)
VizukaExplore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (+525%)
Graph samplingGraph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.
Stars: ✭ 99 (+518.75%)
PadatiousA neural network intent parser
Stars: ✭ 124 (+675%)
tf-idf-pythonTerm frequency–inverse document frequency for Chinese novel/documents implemented in python.
Stars: ✭ 98 (+512.5%)
deduceDeduce: de-identification method for Dutch medical text
Stars: ✭ 40 (+150%)
nyt-first-saidTweets when words are published for the first time in the NYT
Stars: ✭ 222 (+1287.5%)
Metasra PipelineMetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (+106.25%)
Gsoc2018 3gm💫 Automated codification of Greek Legislation with NLP
Stars: ✭ 36 (+125%)
Orange3 Text🍊 📄 Text Mining add-on for Orange3
Stars: ✭ 83 (+418.75%)
R Text DataList of textual data sources to be used for text mining in R
Stars: ✭ 85 (+431.25%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+2243.75%)
PipeitPipeIt is a text transformation, conversion, cleansing and extraction tool.
Stars: ✭ 57 (+256.25%)
TadwAn implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (+168.75%)
Pyss3A Python package implementing a new machine learning model for text classification with visualization tools for Explainable AI
Stars: ✭ 191 (+1093.75%)
QminerAnalytic platform for real-time large-scale streams containing structured and unstructured data.
Stars: ✭ 206 (+1187.5%)
QdapQuantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis
Stars: ✭ 146 (+812.5%)
TextpipeTextpipe: clean and extract metadata from text
Stars: ✭ 284 (+1675%)
Textractextract text from any document. no muss. no fuss.
Stars: ✭ 3,165 (+19681.25%)
Textcluster短文本聚类预处理模块 Short text cluster
Stars: ✭ 115 (+618.75%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (+1256.25%)
pfootprintPolitical Discourse Analysis Using Pre-Trained Word Vectors.
Stars: ✭ 20 (+25%)