All Projects → textreadr → Similar Projects or Alternatives

270 Open source projects that are alternatives of or similar to textreadr

Best PDF Converter! PDF to any format, pdf2word/excel/xml/html/txt...

Stars: ✭ 94 (+44.62%)

Mutual labels: docx

A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.

Stars: ✭ 181 (+178.46%)

Mutual labels: text-mining

malay-dataset

Text corpus for Bahasa Malaysia, https://malaya.readthedocs.io/en/latest/Dataset.html

Stars: ✭ 189 (+190.77%)

Mutual labels: text-mining

Tokenizers

Fast, Consistent Tokenization of Natural Language Text

Stars: ✭ 161 (+147.69%)

Mutual labels: text-mining

woolly

The Text Mining Elixir

Stars: ✭ 48 (-26.15%)

Mutual labels: text-mining

Udpipe

R package for Tokenization, Parts of Speech Tagging, Lemmatization and Dependency Parsing Based on the UDPipe Natural Language Processing Toolkit

Stars: ✭ 160 (+146.15%)

Mutual labels: text-mining

opentbs

With OpenTBS you can merge OpenOffice - LibreOffice and Ms Office documents with PHP using the TinyButStrong template engine. Simple use OpenOffice - LibreOffice or Ms Office to edit your templates : DOCX, XLSX, PPTX, ODT, OSD, ODP and other formats. That is the Natural Template philosophy.

Stars: ✭ 48 (-26.15%)

Mutual labels: docx

Awesome Nlp

📖 A curated list of resources dedicated to Natural Language Processing (NLP)

Stars: ✭ 12,626 (+19324.62%)

Mutual labels: text-mining

corpusexplorer2.0

Korpuslinguistik war noch nie so einfach...

Stars: ✭ 16 (-75.38%)

Mutual labels: text-mining

Textfeatures

👷‍♂️ A simple package for extracting useful features from character objects 👷‍♀️

Stars: ✭ 148 (+127.69%)

Mutual labels: text-mining

eoffice

Export and import graphics and tables to MicroSoft office

Stars: ✭ 19 (-70.77%)

Mutual labels: docx

Qdap

Quantitative Discourse Analysis Package: Bridging the gap between qualitative data and quantitative analysis

Stars: ✭ 146 (+124.62%)

Mutual labels: text-mining

intertext

Detect and visualize text reuse

Stars: ✭ 97 (+49.23%)

Mutual labels: text-mining

Kate

Code & data accompanying the KDD 2017 paper "KATE: K-Competitive Autoencoder for Text"

Stars: ✭ 135 (+107.69%)

Mutual labels: text-mining

odinson

Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.

Stars: ✭ 59 (-9.23%)

Mutual labels: text-mining

Khcoder

KH Coder: for Quantitative Content Analysis or Text Mining

Stars: ✭ 126 (+93.85%)

Mutual labels: text-mining

my-writing-workflow

Tutorial for converting markdown files in to APA-formatted docs, based on my workflow.

Stars: ✭ 35 (-46.15%)

Mutual labels: docx

Keywords2vec

Stars: ✭ 121 (+86.15%)

Mutual labels: text-mining

soldoc

A solidity documentation generator, based in NatSpec format. 📃 with standalone HTML, pdf, gitbook and docsify output ✏️ just plug and play.

Stars: ✭ 54 (-16.92%)

Mutual labels: doc

Cogcomp Nlpy

CogComp's light-weight Python NLP annotators

Stars: ✭ 115 (+76.92%)

Mutual labels: text-mining

crminer

⛔ ARCHIVED ⛔ Fetch 'Scholary' Full Text from 'Crossref'

Stars: ✭ 17 (-73.85%)

Mutual labels: text-mining

Genius

Easily access song lyrics from Genius in a tibble.

Stars: ✭ 111 (+70.77%)

Mutual labels: text-mining

Udacity-Data-Analyst-Nanodegree

Repository for the projects needed to complete the Data Analyst Nanodegree.

Stars: ✭ 31 (-52.31%)

Mutual labels: text-mining

Text predictor

Char-level RNN LSTM text generator📄.

Stars: ✭ 99 (+52.31%)

Mutual labels: text-mining

Text-Classification-LSTMs-PyTorch

The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.

Stars: ✭ 45 (-30.77%)

Mutual labels: text-mining

Lexicon

A data package containing lexicons and dictionaries for text analysis

Stars: ✭ 87 (+33.85%)

Mutual labels: text-mining

DocX

Convert NSAttributedString / AttributedString to .docx Word files on iOS and macOS

Stars: ✭ 41 (-36.92%)

Mutual labels: docx

Orange3 Text

🍊 📄 Text Mining add-on for Orange3

Stars: ✭ 83 (+27.69%)

Mutual labels: text-mining

perke

A keyphrase extractor for Persian

Stars: ✭ 60 (-7.69%)

Mutual labels: text-mining

Pyphonetics

A Python 3 phonetics library.

Stars: ✭ 61 (-6.15%)

Mutual labels: text-mining

JoSH

[KDD 2020] Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding

Stars: ✭ 55 (-15.38%)

Mutual labels: text-mining

Applied Text Mining In Python

Repo for Applied Text Mining in Python (coursera) by University of Michigan

Stars: ✭ 59 (-9.23%)

Mutual labels: text-mining

palladian

Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.

Stars: ✭ 32 (-50.77%)

Mutual labels: text-mining

Pipeit

PipeIt is a text transformation, conversion, cleansing and extraction tool.

Stars: ✭ 57 (-12.31%)

Mutual labels: text-mining

teanaps

자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.

Stars: ✭ 91 (+40%)

Mutual labels: text-mining

Spark Nkp

Natural Korean Processor for Apache Spark

Stars: ✭ 50 (-23.08%)

Mutual labels: text-mining

docx-to-pdf-on-AWS-Lambda

Microsoft Word doc/docx to PDF conversion on AWS Lambda using Node.js

Stars: ✭ 42 (-35.38%)

Mutual labels: docx

Friend.ly

A social media platform with a friend recommendation engine based on personality trait extraction

Stars: ✭ 41 (-36.92%)

Mutual labels: text-mining

Blue Brain text mining toolbox for semantic search and structured information extraction

Stars: ✭ 26 (-60%)

Mutual labels: text-mining

Tidytext

Text mining using tidy tools ✨📄✨

Stars: ✭ 975 (+1400%)

Mutual labels: text-mining

koshort

(deprecated) 🐱 koshort is a Python package for Korean internet spoken language crawling and processing... or maybe Korean domestic cat.

Stars: ✭ 62 (-4.62%)

Mutual labels: text-mining

Uc Davis Cs Exams Analysis

📈 Regression and Classification with UC Davis student quiz data and exam data

Stars: ✭ 33 (-49.23%)

Mutual labels: text-mining

documentspark

💖 DocumentSpark - Simple secure document viewing server. Converts a document to a picture of its pages. Content disarm and reconstruction. CDR. Formerly p2. The CDR solution for ViewFinder remote browser.

Stars: ✭ 211 (+224.62%)

Mutual labels: docx

Nlppln

NLP pipeline software using common workflow language

Stars: ✭ 31 (-52.31%)

Mutual labels: text-mining

Text Mining

Text Mining in Python

Stars: ✭ 18 (-72.31%)

Mutual labels: text-mining

docx2csv

Extracts tables from .docx files and saves them as .csv or .xls files

Stars: ✭ 42 (-35.38%)

Mutual labels: docx

Autophrase

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

Stars: ✭ 835 (+1184.62%)

Mutual labels: text-mining

Aravec

AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP research community with free to use and powerful word embedding models.

Stars: ✭ 239 (+267.69%)

Mutual labels: text-mining

Rake Nltk

Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.

Stars: ✭ 793 (+1120%)

Mutual labels: text-mining

Nlp In Practice

Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, pre-trained embeddings and more.

Stars: ✭ 790 (+1115.38%)

Mutual labels: text-mining

TableDisentangler

Functional and structural analysis of tables in research papers (Table disentangling)

Stars: ✭ 21 (-67.69%)

Mutual labels: text-mining

Gwu data mining

Materials for GWU DNSC 6279 and DNSC 6290.

Stars: ✭ 217 (+233.85%)

Mutual labels: text-mining

Text2vec

Fast vectorization, topic modeling, distances and GloVe word embeddings in R.