Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+93.84%)
EventsRepository for *SEM Paper on Event Coreference Resolution in ECB+
Stars: ✭ 20 (-96.58%)
zinggScalable identity resolution, entity resolution, data mastering and deduplication using ML
Stars: ✭ 655 (+12.16%)
Cdqa⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Stars: ✭ 500 (-14.38%)
ScdvText classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-90.75%)
NeuralqaNeuralQA: A Usable Library for Question Answering on Large Datasets with BERT
Stars: ✭ 185 (-68.32%)
AbydosAbydos NLP/IR library for Python
Stars: ✭ 91 (-84.42%)
Knowledge GraphsA collection of research on knowledge graphs
Stars: ✭ 845 (+44.69%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+159.59%)
tika-similarityTika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Stars: ✭ 92 (-84.25%)
PkePython Keyphrase Extraction module
Stars: ✭ 855 (+46.4%)
Drl4nlp.scratchpadNotes on Deep Reinforcement Learning for Natural Language Processing papers
Stars: ✭ 26 (-95.55%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-84.42%)
RefinrCluster and merge similar char values: an R implementation of Open Refine clustering algorithms
Stars: ✭ 91 (-84.42%)
MlA high-level machine learning and deep learning library for the PHP language.
Stars: ✭ 1,270 (+117.47%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+2085.45%)
Practical Machine Learning With PythonMaster the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+219.86%)
NewsrecommenderA news recommendation system tailored for user communities
Stars: ✭ 164 (-71.92%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (-21.23%)
FingerprintsMake it easier to compare and cross-reference the names of companies and people by applying strong normalisation.
Stars: ✭ 91 (-84.42%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (-67.81%)
CatalystAccelerated deep learning R&D
Stars: ✭ 2,804 (+380.14%)
LibpostalA C library for parsing/normalizing street addresses around the world. Powered by statistical NLP and open geo data.
Stars: ✭ 3,312 (+467.12%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-84.76%)
splinkImplementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (-69.01%)
Deep Semantic Similarity ModelMy Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
Stars: ✭ 509 (-12.84%)
FuzzywuzzyJava fuzzy string matching implementation of the well known Python's fuzzywuzzy algorithm. Fuzzy search for Java
Stars: ✭ 506 (-13.36%)
Spacy Stanza💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
Stars: ✭ 508 (-13.01%)
Bert scoreBERT score for text generation
Stars: ✭ 568 (-2.74%)
RecordlinkageA toolkit for record linkage and duplicate detection in Python
Stars: ✭ 532 (-8.9%)
SeqganA simplified PyTorch implementation of "SeqGAN: Sequence Generative Adversarial Nets with Policy Gradient." (Yu, Lantao, et al.)
Stars: ✭ 502 (-14.04%)
DtaidistanceTime series distances: Dynamic Time Warping (DTW)
Stars: ✭ 499 (-14.55%)
LeakganThe codes of paper "Long Text Generation via Adversarial Training with Leaked Information" on AAAI 2018. Text generation using GAN and Hierarchical Reinforcement Learning.
Stars: ✭ 533 (-8.73%)
Xlnet PytorchSimple XLNet implementation with Pytorch Wrapper
Stars: ✭ 501 (-14.21%)
IowncodeA curated collection of iOS, ML, AR resources sprinkled with some UI additions
Stars: ✭ 499 (-14.55%)
Mycroft CoreMycroft Core, the Mycroft Artificial Intelligence platform.
Stars: ✭ 5,489 (+839.9%)
Ner LstmNamed Entity Recognition using multilayered bidirectional LSTM
Stars: ✭ 532 (-8.9%)
Ml paper notes📖 Notes and summaries of some Machine Learning / Computer Vision / NLP papers.
Stars: ✭ 496 (-15.07%)
DoccanoOpen source annotation tool for machine learning practitioners.
Stars: ✭ 5,600 (+858.9%)
Superpoint graphLarge-scale Point Cloud Semantic Segmentation with Superpoint Graphs
Stars: ✭ 533 (-8.73%)
PisaPISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (-16.27%)
Neural Vqa❔ Visual Question Answering in Torch
Stars: ✭ 487 (-16.61%)
ClustergcnA PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks" (KDD 2019).
Stars: ✭ 529 (-9.42%)
RnnlgRNNLG is an open source benchmark toolkit for Natural Language Generation (NLG) in spoken dialogue system application domains. It is released by Tsung-Hsien (Shawn) Wen from Cambridge Dialogue Systems Group under Apache License 2.0.
Stars: ✭ 487 (-16.61%)
Ml MiptOpen Machine Learning course at MIPT
Stars: ✭ 480 (-17.81%)
LopqTraining of Locally Optimized Product Quantization (LOPQ) models for approximate nearest neighbor search of high dimensional data in Python and Spark.
Stars: ✭ 530 (-9.25%)
Learn Data Science For FreeThis repositary is a combination of different resources lying scattered all over the internet. The reason for making such an repositary is to combine all the valuable resources in a sequential manner, so that it helps every beginners who are in a search of free and structured learning resource for Data Science. For Constant Updates Follow me in …
Stars: ✭ 4,757 (+714.55%)
Tensorflow BookAccompanying source code for Machine Learning with TensorFlow. Refer to the book for step-by-step explanations.
Stars: ✭ 4,448 (+661.64%)
CilantroA lean C++ library for working with point cloud data
Stars: ✭ 577 (-1.2%)
Fast abs rlCode for ACL 2018 paper: "Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting. Chen and Bansal"
Stars: ✭ 569 (-2.57%)
Hanlp中文分词 词性标注 命名实体识别 依存句法分析 成分句法分析 语义依存分析 语义角色标注 指代消解 风格转换 语义相似度 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁转换 自然语言处理
Stars: ✭ 24,626 (+4116.78%)
ResinHardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (-9.42%)
StealthAn open source Ruby framework for text and voice chatbots. 🤖
Stars: ✭ 481 (-17.64%)
Textgan PytorchTextGAN is a PyTorch framework for Generative Adversarial Networks (GANs) based text generation models.
Stars: ✭ 479 (-17.98%)