Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models

Stars: ✭ 42 (-63.48%)

Mutual labels: nlp-machine-learning

Very-deep-cnn-tensorflow

Very deep CNN for text classification

Stars: ✭ 18 (-84.35%)

Mutual labels: nlp-machine-learning

knime-textprocessing

KNIME - Text Processing Extension (Labs)

Stars: ✭ 17 (-85.22%)

Mutual labels: nlp-machine-learning

Deception-Detection-on-Amazon-reviews-dataset

A SVM model that classifies the reviews as real or fake. Used both the review text and the additional features contained in the data set to build a model that predicted with over 85% accuracy without using any deep learning techniques.

Stars: ✭ 42 (-63.48%)

Mutual labels: nlp-machine-learning

CVAE Dial

CVAE_XGate model in paper "Xu, Dusek, Konstas, Rieser. Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Diversity"

Stars: ✭ 16 (-86.09%)

Mutual labels: nlp-machine-learning

pyspark-algorithms

PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2

Stars: ✭ 72 (-37.39%)

Mutual labels: pyspark

mlconjug3

A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.

Stars: ✭ 47 (-59.13%)

Mutual labels: nlp-machine-learning

topic modelling financial news

Topic modelling on financial news with Natural Language Processing

Stars: ✭ 51 (-55.65%)

Mutual labels: nlp-machine-learning

nlp classification workshop

NLP Classification Workshop

Stars: ✭ 22 (-80.87%)

Mutual labels: nlp-machine-learning

fake-news

This is a further development of the kdnuggets article on fake news classification by George McIntyre

Stars: ✭ 15 (-86.96%)

Mutual labels: nlp-machine-learning

datalake-etl-pipeline

Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations

Stars: ✭ 39 (-66.09%)

Mutual labels: pyspark

lidtk

Language Identification Toolkit

Stars: ✭ 17 (-85.22%)

Mutual labels: nlp-machine-learning

soda-spark

Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes

Stars: ✭ 58 (-49.57%)

Mutual labels: pyspark

pytorch-translm

An implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.

Stars: ✭ 22 (-80.87%)

Mutual labels: nlp-machine-learning

ShortText-Fasttext

ShortText classification

Stars: ✭ 12 (-89.57%)

Mutual labels: nlp-machine-learning

alter-nlu

Natural language understanding library for chatbots with intent recognition and entity extraction.

Stars: ✭ 45 (-60.87%)

Mutual labels: nlp-machine-learning

Naive-Bayes-Evening-Workshop

Companion code for Introduction to Python for Data Science: Coding the Naive Bayes Algorithm evening workshop

Stars: ✭ 23 (-80%)

Mutual labels: nlp-machine-learning

Question-Answering-based-on-SQuAD

Question Answering System using BiDAF Model on SQuAD v2.0

Stars: ✭ 20 (-82.61%)

Mutual labels: nlp-machine-learning

Multi-Type-TD-TSR

Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:

Stars: ✭ 174 (+51.3%)

Mutual labels: nlp-machine-learning

ake-datasets

Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.

Stars: ✭ 125 (+8.7%)

Mutual labels: nlp-machine-learning

anuvada

Interpretable Models for NLP using PyTorch

Stars: ✭ 102 (-11.3%)

Mutual labels: nlp-machine-learning

vnla

Code accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155

Stars: ✭ 60 (-47.83%)

Mutual labels: nlp-machine-learning

Quora QuestionPairs DL

Kaggle Competition: Using deep learning to solve quora's question pairs problem

Stars: ✭ 54 (-53.04%)

Mutual labels: nlp-machine-learning

RadiologyReportEmbedding

Intelligent Word Embeddings of Free-Text Radiology Reports

Stars: ✭ 22 (-80.87%)

Mutual labels: nlp-machine-learning

Engine

The Centrifuge process, filter and saves the relevant documents as recommendations to the relevant users

Stars: ✭ 20 (-82.61%)

Mutual labels: nlp-machine-learning

elastic transformers

Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers

Stars: ✭ 153 (+33.04%)

Mutual labels: nlp-machine-learning

deep-semantic-code-search

Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search application

Stars: ✭ 63 (-45.22%)

Mutual labels: nlp-machine-learning

pyspark-k8s-boilerplate

Boilerplate for PySpark on Cloud Kubernetes

Stars: ✭ 24 (-79.13%)

Mutual labels: pyspark

DeepLearningReading

Deep Learning and Machine Learning mini-projects. Current Project: Deepmind Attentive Reader (rc-data)

Stars: ✭ 78 (-32.17%)

Mutual labels: nlp-machine-learning

pyspark-ML-in-Colab

Pyspark in Google Colab: A simple machine learning (Linear Regression) model

Stars: ✭ 32 (-72.17%)

Mutual labels: pyspark

Entity Embedding

Reference implementation of the paper "Word Embeddings for Entity-annotated Texts"

Stars: ✭ 19 (-83.48%)

Mutual labels: nlp-machine-learning

python mozetl

ETL jobs for Firefox Telemetry

Stars: ✭ 25 (-78.26%)

Mutual labels: pyspark

kafka-twitter-spark-streaming

Counting Tweets Per User in Real-Time

Stars: ✭ 38 (-66.96%)

Mutual labels: pyspark

pyspark-for-data-processing

Code for my presentation: Using PySpark to Process Boat Loads of Data

Stars: ✭ 20 (-82.61%)

Mutual labels: pyspark

Sumrized

Automatic Text Summarization (English/Arabic).

Stars: ✭ 37 (-67.83%)

Mutual labels: nlp-machine-learning

Quora question pairs NLP Kaggle

Quora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training

Stars: ✭ 17 (-85.22%)

Mutual labels: nlp-machine-learning

spark-twitter-sentiment-analysis

Sentiment Analysis of a Twitter Topic with Spark Structured Streaming

Stars: ✭ 55 (-52.17%)

Mutual labels: pyspark

oshinko-s2i

This is a place to put s2i images and utilities for spark application builders for openshift

Stars: ✭ 16 (-86.09%)

Mutual labels: pyspark

learn-by-examples

Real-world Spark pipelines examples

Stars: ✭ 84 (-26.96%)

Mutual labels: pyspark

arabic-tagger

AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training

Stars: ✭ 38 (-66.96%)

Mutual labels: nlp-machine-learning

Conditional-SeqGAN-Tensorflow

Conditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow

Stars: ✭ 47 (-59.13%)

Mutual labels: nlp-machine-learning

lingvo--Ner-ru

Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке

Stars: ✭ 38 (-66.96%)

Mutual labels: nlp-machine-learning

text-preprocess-python

Text preprocessing tools in python.

Stars: ✭ 22 (-80.87%)

Mutual labels: nlp-machine-learning

embeddings

Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language

Stars: ✭ 27 (-76.52%)

Mutual labels: nlp-machine-learning

Machine-learning

This repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates

Stars: ✭ 27 (-76.52%)

Mutual labels: nlp-machine-learning

anovos

Anovos - An Open Source Library for Scalable feature engineering Using Apache-Spark

Stars: ✭ 77 (-33.04%)

Mutual labels: pyspark

1-60 of 264 similar projects

›