Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+791.34%)
converseConversational text Analysis using various NLP techniques
Stars: ✭ 147 (+15.75%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-76.38%)
resume tailorAn unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field
Stars: ✭ 15 (-88.19%)
Practical Machine Learning With PythonMaster the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+1370.87%)
Adam qasADAM - A Question Answering System. Inspired from IBM Watson
Stars: ✭ 330 (+159.84%)
nlpbuddyA text analysis application for performing common NLP tasks through a web dashboard interface and an API
Stars: ✭ 115 (-9.45%)
backpropBackprop makes it simple to use, finetune, and deploy state-of-the-art ML models.
Stars: ✭ 229 (+80.31%)
Crime AnalysisAssociation Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-84.25%)
TwitterldatopicmodelingUses topic modeling to identify context between follower relationships of Twitter users
Stars: ✭ 48 (-62.2%)
CltkThe Classical Language Toolkit
Stars: ✭ 650 (+411.81%)
Sense2vec🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+832.28%)
nlp-cheat-sheet-pythonNLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition
Stars: ✭ 69 (-45.67%)
wechselCode for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
Stars: ✭ 39 (-69.29%)
Spacy Transformers🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
Stars: ✭ 919 (+623.62%)
oreilly-bert-nlpThis repository contains code for the O'Reilly Live Online Training for BERT
Stars: ✭ 19 (-85.04%)
Ryuzaki botSimple chatbot in Python using NLTK and scikit-learn
Stars: ✭ 28 (-77.95%)
tweets-preprocessorRepo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
Stars: ✭ 26 (-79.53%)
Arch-Data-ScienceArchlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (-27.56%)
anonymisationAnonymization of legal cases (Fr) based on Flair embeddings
Stars: ✭ 85 (-33.07%)
ParsBigBirdPersian Bert For Long-Range Sequences
Stars: ✭ 58 (-54.33%)
adaptAwesome Domain Adaptation Python Toolbox
Stars: ✭ 46 (-63.78%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+2584.25%)
AutogluonAutoGluon: AutoML for Text, Image, and Tabular Data
Stars: ✭ 3,920 (+2986.61%)
udacity-cvnd-projectsMy solutions to the projects assigned for the Udacity Computer Vision Nanodegree
Stars: ✭ 36 (-71.65%)
Doc2vec📓 Long(er) text representation and classification using Doc2Vec embeddings
Stars: ✭ 92 (-27.56%)
Bert Sklearna sklearn wrapper for Google's BERT model
Stars: ✭ 182 (+43.31%)
ShallowlearnAn experiment about re-implementing supervised learning models based on shallow neural network approaches (e.g. fastText) with some additional exclusive features and nice API. Written in Python and fully compatible with Scikit-learn.
Stars: ✭ 196 (+54.33%)
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (-43.31%)
Text ClassificationMachine Learning and NLP: Text Classification using python, scikit-learn and NLTK
Stars: ✭ 239 (+88.19%)
pygramsExtracts key terminology (n-grams) from any large collection of documents (>1000) and forecasts emergence
Stars: ✭ 52 (-59.06%)
Skin Lesions Classification DCNNsTransfer Learning with DCNNs (DenseNet, Inception V3, Inception-ResNet V2, VGG16) for skin lesions classification
Stars: ✭ 47 (-62.99%)
cups-rlCustomisable Unified Physical Simulations (CUPS) for Reinforcement Learning. Experiments run on the ai2thor environment (http://ai2thor.allenai.org/) e.g. using A3C, RainbowDQN and A3C_GA (Gated Attention multi-modal fusion) for Task-Oriented Language Grounding (tasks specified by natural language instructions) e.g. "Pick up the Cup or else"
Stars: ✭ 38 (-70.08%)
website-fingerprintingDeanonymizing Tor or VPN users with website fingerprinting and machine learning.
Stars: ✭ 59 (-53.54%)
uniformer-pytorchImplementation of Uniformer, a simple attention and 3d convolutional net that achieved SOTA in a number of video classification tasks, debuted in ICLR 2022
Stars: ✭ 90 (-29.13%)
image-background-remove-tool✂️ Automated high-quality background removal framework for an image using neural networks. ✂️
Stars: ✭ 767 (+503.94%)
doc2vec-apidocument embedding and machine learning script for beginners
Stars: ✭ 92 (-27.56%)
Basic-UI-for-GPT-J-6B-with-low-vramA repository to run gpt-j-6b on low vram machines (4.2 gb minimum vram for 2000 token context, 3.5 gb for 1000 token context). Model loading takes 12gb free ram.
Stars: ✭ 90 (-29.13%)
Transformer-in-PyTorchTransformer/Transformer-XL/R-Transformer examples and explanations
Stars: ✭ 21 (-83.46%)
ml webappExplore machine learning models. Leveraging scikit-learn's models and exposing their behaviour through API
Stars: ✭ 29 (-77.17%)
Land-Cover-Classification-using-Sentinel-2-DatasetApplication of deep learning on Satellite Imagery of Sentinel-2 satellite that move around the earth from June, 2015. This image patches can be trained and classified using transfer learning techniques.
Stars: ✭ 36 (-71.65%)
sign2textReal-time AI-powered translation of American sign language to text
Stars: ✭ 132 (+3.94%)
namebotA company/project name generator for Python. Uses NLTK and diverse techniques derived from existing corporate etymologies and naming agencies for sophisticated word generation and ideation.
Stars: ✭ 44 (-65.35%)
TextClassification基于scikit-learn实现对新浪新闻的文本分类,数据集为100w篇文档,总计10类,测试集与训练集1:1划分。分类算法采用SVM和Bayes,其中Bayes作为baseline。
Stars: ✭ 86 (-32.28%)
ASD-ML-APIThis project has 3 goals: To find out the best machine learning pipeline for predicting ASD cases using genetic algorithms, via the TPOT library. (Classification Problem) Compare the accuracy of the accuracy of the determined pipeline, with a standard Naive-Bayes classifier. Saving the classifier as an external file, and use this file in a Flask…
Stars: ✭ 14 (-88.98%)
Deception-Detection-on-Amazon-reviews-datasetA SVM model that classifies the reviews as real or fake. Used both the review text and the additional features contained in the data set to build a model that predicted with over 85% accuracy without using any deep learning techniques.
Stars: ✭ 42 (-66.93%)
NLP QuickbookNLP in Python with Deep Learning
Stars: ✭ 516 (+306.3%)
emoji-prediction🤓🔮🔬 Emoji prediction from a text using machine learning
Stars: ✭ 41 (-67.72%)
pycobrapython library implementing ensemble methods for regression, classification and visualisation tools including Voronoi tesselations.
Stars: ✭ 111 (-12.6%)
jax-modelsUnofficial JAX implementations of deep learning research papers
Stars: ✭ 108 (-14.96%)