All Projects β†’ wayfair-incubator β†’ extra-model

wayfair-incubator / extra-model

Licence: MIT license
Code to run the ExtRA algorithm for unsupervised topic/aspect extraction on English texts.

Programming Languages

python
139335 projects - #7 most used programming language
shell
77523 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to extra-model

simple NER
simple rule based named entity recognition
Stars: ✭ 29 (-32.56%)
Mutual labels:  nlp-library, nlp-keywords-extraction
Handwritten-Digits-Classification-Using-KNN-Multiclass Perceptron-SVM
πŸ† A Comparative Study on Handwritten Digits Recognition using Classifiers like K-Nearest Neighbours (K-NN), Multiclass Perceptron/Artificial Neural Network (ANN) and Support Vector Machine (SVM) discussing the pros and cons of each algorithm and providing the comparison results in terms of accuracy and efficiecy of each algorithm.
Stars: ✭ 42 (-2.33%)
Mutual labels:  machine-learning-algorithms
Intention-Mining-
Intention Mining in Social Networking. It Mines Emotions and polarity for the given keyword . For the keyword it searchers the twitter for the comments and analyzes the results for various events such as Election results, Sports prediction Movie ratings, Breaking news events such as demonetisation and many more. Bayes , Maximum Entropy and Hidde…
Stars: ✭ 19 (-55.81%)
Mutual labels:  machine-learning-algorithms
Clustering-Python
Python Clustering Algorithms
Stars: ✭ 23 (-46.51%)
Mutual labels:  machine-learning-algorithms
OpenPrompt
An Open-Source Framework for Prompt-Learning.
Stars: ✭ 1,769 (+4013.95%)
Mutual labels:  nlp-library
ExCon
ExCon: Explanation-driven Supervised Contrastive Learning
Stars: ✭ 17 (-60.47%)
Mutual labels:  machine-learning-algorithms
RapLyrics-Back
Model training, custom generative function and training for raplyrics.eu - A rap music lyrics generation project
Stars: ✭ 14 (-67.44%)
Mutual labels:  machine-learning-algorithms
tensorflow-rbm
Tensorflow implementation of the Restricted Boltzmann Machine
Stars: ✭ 308 (+616.28%)
Mutual labels:  machine-learning-algorithms
Scon-ABSA
[CIKM 2021] Enhancing Aspect-Based Sentiment Analysis with Supervised Contrastive Learning
Stars: ✭ 17 (-60.47%)
Mutual labels:  aspect-based-sentiment-analysis
Subjective-Answer-Evaluation
Subjective answer evaluation using machine learning. Give your answers at this link ->
Stars: ✭ 22 (-48.84%)
Mutual labels:  machine-learning-algorithms
NLP-Natural-Language-Processing
Projects and useful articles / links
Stars: ✭ 149 (+246.51%)
Mutual labels:  nlp-library
BoostSRL
BoostSRL: "Boosting for Statistical Relational Learning." A gradient-boosting based approach for learning different types of SRL models.
Stars: ✭ 31 (-27.91%)
Mutual labels:  machine-learning-algorithms
online-course-recommendation-system
Built on data from Pluralsight's course API fetched results. Works with model trained with K-means unsupervised clustering algorithm.
Stars: ✭ 31 (-27.91%)
Mutual labels:  machine-learning-algorithms
ugtm
ugtm: a Python package for Generative Topographic Mapping
Stars: ✭ 34 (-20.93%)
Mutual labels:  machine-learning-algorithms
bonsai-dt
Programmable Decision Tree Framework
Stars: ✭ 34 (-20.93%)
Mutual labels:  machine-learning-algorithms
random-fourier-features
Implementation of random Fourier features for kernel method, like support vector machine and Gaussian process model
Stars: ✭ 50 (+16.28%)
Mutual labels:  machine-learning-algorithms
minie
An open information extraction system that provides compact extractions
Stars: ✭ 83 (+93.02%)
Mutual labels:  nlp-library
PROSAC
PROSAC algorithm in python
Stars: ✭ 19 (-55.81%)
Mutual labels:  machine-learning-algorithms
perceptron
The simplest Perceptron you'll ever see
Stars: ✭ 45 (+4.65%)
Mutual labels:  machine-learning-algorithms
Self-Driving-Car
Implemented a Convolutional Neural Network for end-to-end driving in a simulator using Tensorflow and Keras. The project involves training over 13,000 images in a unity3d simulator to steer the car successfully throughout the track
Stars: ✭ 29 (-32.56%)
Mutual labels:  machine-learning-algorithms

codecov Code style: black Imports: isort GitHub license PyPI

Table of Contents

extra-model

Code to run the Extra algorithm for the unsupervised topic/aspect extraction on English texts.

Read the Official Documentation here

Quick start

IMPORTANT:

  1. When running Extra inside docker-container, make sure that Docker process has enough resources. For example, on Mac/Windows it should have at least 8 Gb of RAM available to it. Read More about RAM Requirements
  2. GitHub repo does not come with Glove Embeddings. See section Downloading Embeddings for how to download the required embeddings.

Using docker-compose

This is a preferred way to run extra-model. You can find instructions on how to run extra-model using CLI or as a Python package here

First, build the image:

docker-compose build

Then, run following command to make sure that extra-model was installed correctly:

docker-compose run test

Downloading Embeddings

Next step is to download the embeddings (we use Glove from Stanford in this project).

To download the required embeddings, run the following command:

docker-compose run --rm setup

The embeddings will be downloaded, unzipped and formatted into a space-efficient format. Files will be saved in the embeddings/ directory in the root of the project directory. If the process fails, it can be safely restarted. If you want to restart the process with new files, delete all files except README.md in the embeddings/ directory.

[Optional] Run docker-compose build again

After you've downloaded the embeddings, you may want to run docker-compose build again. This will build an image with embeddings already present inside the image.

The tradeoff here is that the image will be much bigger, but you won't spend ~2 minutes each time you run extra-model waiting for embeddings to be mounted into the container. On the other hand, building an image with embeddings in the context will increase build time from ~3 minutes to ~10 minutes.

Run extra-model

Finally, running extra-model is as simple as:

docker-compose run extra-model /package/tests/resources/100_comments.csv

NOTE: when using this approach, input file should be mounted inside the container. By default, everything from extra-model folder will be mounted to /package/ folder. This can be changed in docker-compose.yaml

This will produce a result.csv file in /io/ (default setting) folder.

There are multiple flags you can set to change input/outputs of extra. You can find them by running:

docker-compose run extra-model --help

Learn more

Our official documentation is the best place to continue learning about extra-model:

  1. Explanation of inputs/outputs
  2. Step-by-step workflow of what happens inside of extra-model
  3. Examples of how extra-model can be used in downstream applications
  4. Detailed explanation of how to run extra-model using different interfaces (via docker-compose, via CLI, as a Python package).

Authors

extra-model was written by [email protected], [email protected].

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].