All Projects → deduce → Similar Projects or Alternatives

335 Open source projects that are alternatives of or similar to deduce

The aim of this repository is to show a baseline model for text classification by implementing a LSTM-based model coded in PyTorch. In order to provide a better understanding of the model, it will be used a Tweets dataset provided by Kaggle.

Stars: ✭ 45 (+12.5%)

Mutual labels: text-mining, text-processing

advanced-text-mining

TEANAPS 라이브러리를 활용한 자연어 처리와 텍스트 분석 방법론에 대해 다룹니다.

Stars: ✭ 15 (-62.5%)

Mutual labels: text-mining, text-processing

text-analysis

Weaving analytical stories from text data

Stars: ✭ 12 (-70%)

Mutual labels: text-mining, text-processing

Dan Jurafsky Chris Manning Nlp

My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.

Stars: ✭ 124 (+210%)

Mutual labels: information-extraction, text-processing

support-tickets-classification

This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en

Stars: ✭ 142 (+255%)

Mutual labels: text-mining, text-processing

perke

A keyphrase extractor for Persian

Stars: ✭ 60 (+50%)

Mutual labels: text-mining, text-processing

TableDisentangler

Functional and structural analysis of tables in research papers (Table disentangling)

Stars: ✭ 21 (-47.5%)

Mutual labels: text-mining, information-extraction

palladian

Palladian is a Java-based toolkit with functionality for text processing, classification, information extraction, and data retrieval from the Web.

Stars: ✭ 32 (-20%)

Mutual labels: text-mining, information-extraction

odinson

Odinson is a powerful and highly optimized open-source framework for rule-based information extraction. Odinson couples a simple, yet powerful pattern language that can operate over multiple representations of text, with a runtime system that operates in near real time.

Stars: ✭ 59 (+47.5%)

Mutual labels: text-mining, information-extraction

TextDatasetCleaner

🔬 Очистка датасетов от мусора (нормализация, препроцессинг)

Stars: ✭ 27 (-32.5%)

Mutual labels: text-mining, text-processing

teanaps

자연어 처리와 텍스트 분석을 위한 오픈소스 파이썬 라이브러리 입니다.

Stars: ✭ 91 (+127.5%)

Mutual labels: text-mining, text-processing

Cogcomp Nlpy

CogComp's light-weight Python NLP annotators

Stars: ✭ 115 (+187.5%)

Mutual labels: text-mining, text-processing

Pipeit

PipeIt is a text transformation, conversion, cleansing and extraction tool.

Stars: ✭ 57 (+42.5%)

Mutual labels: text-mining, text-processing

TabInOut

Framework for information extraction from tables

Stars: ✭ 37 (-7.5%)

Mutual labels: text-mining, information-extraction

Xioc

Extract indicators of compromise from text, including "escaped" ones.

Stars: ✭ 148 (+270%)

Mutual labels: text-mining, text-processing

Textcluster

短文本聚类预处理模块 Short text cluster

Stars: ✭ 115 (+187.5%)

Mutual labels: text-mining, text-processing

Artificial Adversary

🗣️ Tool to generate adversarial text examples and test machine learning models against them

Stars: ✭ 348 (+770%)

Mutual labels: text-mining, text-processing

Text Mining

Text Mining in Python

Stars: ✭ 18 (-55%)

Mutual labels: text-mining, text-processing

Applied Text Mining In Python

Repo for Applied Text Mining in Python (coursera) by University of Michigan

Stars: ✭ 59 (+47.5%)

Mutual labels: text-mining, text-processing

estratto

parsing fixed width files content made easy

Stars: ✭ 12 (-70%)

Mutual labels: text-mining, text-processing

frog

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. All NLP modules are based on Timbl, the Tilburg memory-based learning software package.

Stars: ✭ 70 (+75%)

Mutual labels: text-processing, dutch

TRUNAJOD2.0

An easy-to-use library to extract indices from texts.

Stars: ✭ 18 (-55%)

Mutual labels: text-mining, text-processing

neji

Flexible and powerful platform for biomedical information extraction from text

Stars: ✭ 37 (-7.5%)

Mutual labels: text-mining, information-extraction

corpusexplorer2.0

Korpuslinguistik war noch nie so einfach...

Stars: ✭ 16 (-60%)

Mutual labels: text-mining, text-processing

Text-Analysis

Explaining textual analysis tools in Python. Including Preprocessing, Skip Gram (word2vec), and Topic Modelling.

Stars: ✭ 48 (+20%)

Mutual labels: text-mining, text-processing

Awesome Hungarian Nlp

A curated list of NLP resources for Hungarian

Stars: ✭ 121 (+202.5%)

Mutual labels: text-mining, information-extraction

Chemdataextractor

Automatically extract chemical information from scientific documents

Stars: ✭ 152 (+280%)

Mutual labels: text-mining, information-extraction

syn

syn - the thesaurus

Stars: ✭ 45 (+12.5%)

Mutual labels: text-processing

Emotion-recognition-from-tweets

A comprehensive approach on recognizing emotion (sentiment) from a certain tweet. Supervised machine learning.

Stars: ✭ 17 (-57.5%)

Mutual labels: text-processing

BioMedical-NLP-corpus

Biomedical NLP Corpus or Datasets.

Stars: ✭ 44 (+10%)

Mutual labels: text-mining

learn perl oneliners

Example based guide for text processing with perl from the command line

Stars: ✭ 63 (+57.5%)

Mutual labels: text-processing

awesome-document-understanding

A curated list of resources for Document Understanding (DU) topic

Stars: ✭ 620 (+1450%)

Mutual labels: information-extraction

crminer

⛔ ARCHIVED ⛔ Fetch 'Scholary' Full Text from 'Crossref'

Stars: ✭ 17 (-57.5%)

Mutual labels: text-mining

neural name tagging

Code for "Reliability-aware Dynamic Feature Composition for Name Tagging" (ACL2019)

Stars: ✭ 39 (-2.5%)

Mutual labels: information-extraction

text2video

Text to Video Generation Problem

Stars: ✭ 28 (-30%)

Mutual labels: text-processing

frangipanni

Program to convert lines of text into a tree structure.

Stars: ✭ 1,176 (+2840%)

Mutual labels: text-processing

TypeNet

A Hierarchical Type system for fine grained entity typing

Stars: ✭ 51 (+27.5%)

Mutual labels: information-extraction

batterydatabase

Tools for auto-generating the battery-materials database.

Stars: ✭ 29 (-27.5%)

Mutual labels: information-extraction

iis

Information Inference Service of the OpenAIRE system

Stars: ✭ 16 (-60%)

Mutual labels: text-mining

ReQuest

Indirect Supervision for Relation Extraction Using Question-Answer Pairs (WSDM'18)

Stars: ✭ 26 (-35%)

Mutual labels: information-extraction

text-mined-synthesis public

Codes for text-mined solid-state reactions dataset

Stars: ✭ 46 (+15%)

Mutual labels: text-mining

s3-concat

Concatenate Amazon S3 files remotely using flexible patterns

Stars: ✭ 32 (-20%)

Mutual labels: text-processing

ConTexto

Librería en Python para minería de texto y NLP

Stars: ✭ 43 (+7.5%)

Mutual labels: text-processing

PubMed-Best-Match

Machine-learning based pipeline relying on LambdaMART currently used in PubMed for relevance (Best Match) searches

Stars: ✭ 36 (-10%)

Mutual labels: text-mining

sliceslice-rs

A fast implementation of single-pattern substring search using SIMD acceleration.

Stars: ✭ 66 (+65%)

Mutual labels: text-processing

DocuNet

Code and dataset for the IJCAI 2021 paper "Document-level Relation Extraction as Semantic Segmentation".

Stars: ✭ 84 (+110%)

Mutual labels: information-extraction

r4strings

Handling Strings in R

Stars: ✭ 39 (-2.5%)

Mutual labels: text-processing

Answerable

Recommendation system for Stack Overflow unanswered questions

Stars: ✭ 13 (-67.5%)

Mutual labels: text-mining

Blue Brain text mining toolbox for semantic search and structured information extraction

Stars: ✭ 26 (-35%)

Mutual labels: text-mining

CogIE

CogIE: An Information Extraction Toolkit for Bridging Text and CogNet. ACL 2021

Stars: ✭ 47 (+17.5%)

Mutual labels: information-extraction

s3-utils

Utilities and tools based around Amazon S3 to provide convenience APIs in a CLI

Stars: ✭ 45 (+12.5%)

Mutual labels: text-processing

naacl2018-fever

Fact Extraction and VERification baseline published in NAACL2018

Stars: ✭ 109 (+172.5%)

Mutual labels: information-extraction

lima

The Libre Multilingual Analyzer, a Natural Language Processing (NLP) C++ toolkit.

Stars: ✭ 75 (+87.5%)

Mutual labels: information-extraction

readability

Fast readability scores for text data

Stars: ✭ 22 (-45%)

Mutual labels: text-mining

WeTextProcessing

Text Normalization & Inverse Text Normalization

Stars: ✭ 213 (+432.5%)

Mutual labels: text-processing

koshort

(deprecated) 🐱 koshort is a Python package for Korean internet spoken language crawling and processing... or maybe Korean domestic cat.

Stars: ✭ 62 (+55%)

Mutual labels: text-mining

vi-rs

Vietnamese Input Method library

Stars: ✭ 69 (+72.5%)

Mutual labels: text-processing

dif

'dif' is a Linux preprocessing front end to gvimdiff/meld/kompare

Stars: ✭ 18 (-55%)

Mutual labels: text-processing

TVGemist

An *Unofficial* Uitzending Gemist application for  TV

Stars: ✭ 23 (-42.5%)

Mutual labels: dutch

woolly

The Text Mining Elixir

Stars: ✭ 48 (+20%)

Mutual labels: text-mining

1-60 of 335 similar projects

›

next*5