OSCIOpen Source Contributor Index
Stars: ✭ 107 (-6.96%)
Machine-Learning-ModelsIn This repository I made some simple to complex methods in machine learning. Here I try to build template style code.
Stars: ✭ 30 (-73.91%)
kexKex is a python library for unsupervised keyword extraction from a document, providing an easy interface and benchmarks on 15 public datasets.
Stars: ✭ 46 (-60%)
scicle-stopclickbaitUserscript that changes Clickbait headlines by headlines more honest to the news it links to.
Stars: ✭ 16 (-86.09%)
cejaPySpark phonetic and string matching algorithms
Stars: ✭ 24 (-79.13%)
empythyAutomated NLP sentiment predictions- batteries included, or use your own data
Stars: ✭ 17 (-85.22%)
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (-55.65%)
NLP-Flask-WebsiteA simple Flask website for all NLP tasks which includes Text Preprocessing, Keyword Extraction, Text Summarization etc. Created Date: 30 Jan 2019
Stars: ✭ 43 (-62.61%)
vlainic.github.ioMy GitHub blog: things you might be interested, and probably not...
Stars: ✭ 26 (-77.39%)
brand-sentiment-analysisScripts utilizing Heartex platform to build brand sentiment analysis from the news
Stars: ✭ 21 (-81.74%)
TextFeatureSelectionPython library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (-63.48%)
Deception-Detection-on-Amazon-reviews-datasetA SVM model that classifies the reviews as real or fake. Used both the review text and the additional features contained in the data set to build a model that predicted with over 85% accuracy without using any deep learning techniques.
Stars: ✭ 42 (-63.48%)
CVAE DialCVAE_XGate model in paper "Xu, Dusek, Konstas, Rieser. Better Conversations by Modeling, Filtering, and Optimizing for Coherence and Diversity"
Stars: ✭ 16 (-86.09%)
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (-37.39%)
mlconjug3A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Stars: ✭ 47 (-59.13%)
fake-newsThis is a further development of the kdnuggets article on fake news classification by George McIntyre
Stars: ✭ 15 (-86.96%)
datalake-etl-pipelineSimplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validation, Column extensions, SQL functions, and DataFrame transformations
Stars: ✭ 39 (-66.09%)
lidtkLanguage Identification Toolkit
Stars: ✭ 17 (-85.22%)
soda-sparkSoda Spark is a PySpark library that helps you with testing your data in Spark Dataframes
Stars: ✭ 58 (-49.57%)
pytorch-translmAn implementation of transformer-based language model for sentence rewriting tasks such as summarization, simplification, and grammatical error correction.
Stars: ✭ 22 (-80.87%)
alter-nluNatural language understanding library for chatbots with intent recognition and entity extraction.
Stars: ✭ 45 (-60.87%)
Naive-Bayes-Evening-WorkshopCompanion code for Introduction to Python for Data Science: Coding the Naive Bayes Algorithm evening workshop
Stars: ✭ 23 (-80%)
Multi-Type-TD-TSRExtracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition:
Stars: ✭ 174 (+51.3%)
ake-datasetsLarge, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.
Stars: ✭ 125 (+8.7%)
anuvadaInterpretable Models for NLP using PyTorch
Stars: ✭ 102 (-11.3%)
vnlaCode accompanying the CVPR 2019 paper: https://arxiv.org/abs/1812.04155
Stars: ✭ 60 (-47.83%)
Quora QuestionPairs DLKaggle Competition: Using deep learning to solve quora's question pairs problem
Stars: ✭ 54 (-53.04%)
EngineThe Centrifuge process, filter and saves the relevant documents as recommendations to the relevant users
Stars: ✭ 20 (-82.61%)
elastic transformersMaking BERT stretchy. Semantic Elasticsearch with Sentence Transformers
Stars: ✭ 153 (+33.04%)
deep-semantic-code-searchDeep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search application
Stars: ✭ 63 (-45.22%)
DeepLearningReadingDeep Learning and Machine Learning mini-projects. Current Project: Deepmind Attentive Reader (rc-data)
Stars: ✭ 78 (-32.17%)
pyspark-ML-in-ColabPyspark in Google Colab: A simple machine learning (Linear Regression) model
Stars: ✭ 32 (-72.17%)
Entity EmbeddingReference implementation of the paper "Word Embeddings for Entity-annotated Texts"
Stars: ✭ 19 (-83.48%)
python mozetlETL jobs for Firefox Telemetry
Stars: ✭ 25 (-78.26%)
SumrizedAutomatic Text Summarization (English/Arabic).
Stars: ✭ 37 (-67.83%)
Quora question pairs NLP KaggleQuora Kaggle Competition : Natural Language Processing using word2vec embeddings, scikit-learn and xgboost for training
Stars: ✭ 17 (-85.22%)
oshinko-s2iThis is a place to put s2i images and utilities for spark application builders for openshift
Stars: ✭ 16 (-86.09%)
arabic-taggerAQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training
Stars: ✭ 38 (-66.96%)
Conditional-SeqGAN-TensorflowConditional Sequence Generative Adversarial Network trained with policy gradient, Implementation in Tensorflow
Stars: ✭ 47 (-59.13%)
lingvo--Ner-ruNamed entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (-66.96%)
embeddingsEmbeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
Stars: ✭ 27 (-76.52%)
Machine-learningThis repository will contain all the stuffs required for beginners in ML and DL do follow and star this repo for regular updates
Stars: ✭ 27 (-76.52%)
anovosAnovos - An Open Source Library for Scalable feature engineering Using Apache-Spark
Stars: ✭ 77 (-33.04%)