ten-blue-links / Fxt

Licence: mit
A large scale feature extraction tool for text-based machine learning

Projects that are alternatives of or similar to Fxt

perke
A keyphrase extractor for Persian
Stars: ✭ 60 (+140%)
Mutual labels:  information-retrieval, text-processing
Dan Jurafsky Chris Manning Nlp
My solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (+396%)
Mutual labels:  information-retrieval, text-processing
Deep Semantic Similarity Model
My Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
Stars: ✭ 509 (+1936%)
Mutual labels:  information-retrieval
Gohn
Hatena Notation (はてな記法) Parser written in Go
Stars: ✭ 17 (-32%)
Mutual labels:  text-processing
Efficientnet Pytorch
A PyTorch implementation of EfficientNet and EfficientNetV2 (coming soon!)
Stars: ✭ 6,685 (+26640%)
Mutual labels:  feature-extraction
Resin
Hardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+2016%)
Mutual labels:  information-retrieval
Pykaldi
A Python wrapper for Kaldi
Stars: ✭ 756 (+2924%)
Mutual labels:  feature-extraction
Machinelearnjs
Machine Learning library for the web and Node.
Stars: ✭ 498 (+1892%)
Mutual labels:  feature-extraction
Relevancyfeedback
Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, conceptual search, semantic search and personalized search
Stars: ✭ 19 (-24%)
Mutual labels:  information-retrieval
Tsfresh
Automatic extraction of relevant features from time series:
Stars: ✭ 6,077 (+24208%)
Mutual labels:  feature-extraction
Speechpy
💬 SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/
Stars: ✭ 833 (+3232%)
Mutual labels:  feature-extraction
Talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+2236%)
Mutual labels:  information-retrieval
Feature Selection
Features selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (+2036%)
Mutual labels:  feature-extraction
Meyda
Audio feature extraction for JavaScript.
Stars: ✭ 792 (+3068%)
Mutual labels:  feature-extraction
Feature Engineering And Feature Selection
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
Stars: ✭ 526 (+2004%)
Mutual labels:  feature-extraction
Chr
🔤 Lightweight R package for manipulating [string] characters
Stars: ✭ 18 (-28%)
Mutual labels:  text-processing
Cdqa
⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Stars: ✭ 500 (+1900%)
Mutual labels:  information-retrieval
Anserini
A Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+2192%)
Mutual labels:  information-retrieval
Awesome Neural Models For Semantic Match
A curated list of papers dedicated to neural text (semantic) matching.
Stars: ✭ 669 (+2576%)
Mutual labels:  information-retrieval
Concise Ipython Notebooks For Deep Learning
Ipython Notebooks for solving problems like classification, segmentation, generation using latest Deep learning algorithms on different publicly available text and image data-sets.
Stars: ✭ 23 (-8%)
Mutual labels:  text-processing

Fxt

A large scale feature extraction tool for text-based machine learning.

Building from Source

  1. Make sure you have installed the dependencies:

    • A recent version of g++ or clang
    • GNU make
    • cmake 3.0 or later
    • git
  2. Clone the source with git:

    $ git clone https://github.com/ten-blue-links/fxt.git
    $ cd fxt
    
  1. Build and install:

    git submodule update --init --recursive --progress
    mkdir build
    cd build
    cmake ..
    make
    

Features

The toolkit offers a large number of text-based features that can be configured for extraction. See the feature list for the feature types and descriptions.

Documentation

For a quick tour see the quick start guide. Then refer to the main documentation for specific topics in more detail.

License

Fxt is distributed under the terms of the MIT license.

See LICENSE for details.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].