All Projects → Wordtokenizers.jl → Similar Projects or Alternatives

508 Open source projects that are alternatives of or similar to Wordtokenizers.jl

Easyocr
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+21136.51%)
perke
A keyphrase extractor for Persian
Stars: ✭ 60 (-4.76%)
AILA-Artificial-Intelligence-for-Legal-Assistance
Python implementations of the various methods used in FIRE 2019 conference.
Stars: ✭ 39 (-38.1%)
bookworm
📚 social networks from novels
Stars: ✭ 72 (+14.29%)
Gensim
Topic Modelling for Humans
Stars: ✭ 12,763 (+20158.73%)
ml-nlp-services
机器学习、深度学习、自然语言处理
Stars: ✭ 23 (-63.49%)
Rmdl
RMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+495.24%)
Drl4nlp.scratchpad
Notes on Deep Reinforcement Learning for Natural Language Processing papers
Stars: ✭ 26 (-58.73%)
Mutual labels:  information-retrieval
Metasra Pipeline
MetaSRA: normalized sample-specific metadata for the Sequence Read Archive
Stars: ✭ 33 (-47.62%)
Mutual labels:  data-mining
Mico
Mico ("Monkey" in catalan). Monkey language implementation done with C++. https://interpreterbook.com/
Stars: ✭ 19 (-69.84%)
Mutual labels:  lexer
Pyclustering
pyclustring is a Python, C++ data mining library.
Stars: ✭ 806 (+1179.37%)
Mutual labels:  data-mining
Date Info
API to let user fetch the events that happen(ed) on a specific date
Stars: ✭ 7 (-88.89%)
Mutual labels:  information-retrieval
Mldm
потоковый курс "Машинное обучение и анализ данных (Machine Learning and Data Mining)" на факультете ВМК МГУ имени М.В. Ломоносова
Stars: ✭ 35 (-44.44%)
Mutual labels:  data-mining
Model Describer
model-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-65.08%)
Mutual labels:  data-mining
Scdv
Text classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-14.29%)
Mutual labels:  information-retrieval
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-71.43%)
Mutual labels:  data-mining
Nprf
NPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval
Stars: ✭ 31 (-50.79%)
Mutual labels:  information-retrieval
Ail Framework
AIL framework - Analysis Information Leak framework
Stars: ✭ 1,091 (+1631.75%)
Mutual labels:  data-mining
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+1017.46%)
Mutual labels:  data-mining
Cgnn
Crystal Graph Neural Networks
Stars: ✭ 48 (-23.81%)
Mutual labels:  data-mining
Subdue
The Subdue graph miner discovers highly-compressing patterns in an input graph.
Stars: ✭ 20 (-68.25%)
Mutual labels:  data-mining
Dataproofer
A proofreader for your data
Stars: ✭ 628 (+896.83%)
Mutual labels:  data-mining
Elki
ELKI Data Mining Toolkit
Stars: ✭ 613 (+873.02%)
Mutual labels:  data-mining
En Data mining
Data Mining Historical Newspaper Metadata (METS/ALTO formats)
Stars: ✭ 14 (-77.78%)
Mutual labels:  data-mining
Talisman
Straightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+826.98%)
Mutual labels:  information-retrieval
Data Science With Ruby
Practical Data Science with Ruby based tools.
Stars: ✭ 549 (+771.43%)
Mutual labels:  data-mining
Awesome Fraud Detection Papers
A curated list of data mining papers about fraud detection.
Stars: ✭ 843 (+1238.1%)
Mutual labels:  data-mining
Helioml
A book about machine learning, statistics, and data mining for heliophysics
Stars: ✭ 36 (-42.86%)
Mutual labels:  data-mining
Twitter Get Old Tweets Scraper
A data scraper for retrieving old tweets in Twitter using Python3.
Stars: ✭ 27 (-57.14%)
Mutual labels:  data-mining
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (+1607.94%)
Mutual labels:  data-mining
Fxt
A large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-60.32%)
Mutual labels:  information-retrieval
Drugs Recommendation Using Reviews
Analyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-44.44%)
Mutual labels:  data-mining
Snl Compiler
SNL(Small Nested Language) Compiler. Maven jUnit Tokenizer Lexer Syntax Parser. 编译原理 词法分析 语法分析
Stars: ✭ 19 (-69.84%)
Mutual labels:  lexer
Gendis
Contains an implementation (sklearn API) of the algorithm proposed in "GENDIS: GEnetic DIscovery of Shapelets" and code to reproduce all experiments.
Stars: ✭ 59 (-6.35%)
Mutual labels:  data-mining
Relevancyfeedback
Dice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, conceptual search, semantic search and personalized search
Stars: ✭ 19 (-69.84%)
Mutual labels:  information-retrieval
Domain discovery tool
This repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.
Stars: ✭ 33 (-47.62%)
Mutual labels:  information-retrieval
Biolitmap
Code for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-71.43%)
Mutual labels:  data-mining
Php Ml
PHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+12439.68%)
Mutual labels:  data-mining
Stocktalk
Data collection tool for social media analytics
Stars: ✭ 765 (+1114.29%)
Mutual labels:  data-mining
Invoice2data
Extract structured data from PDF invoices
Stars: ✭ 943 (+1396.83%)
Mutual labels:  data-mining
Awesome Neural Models For Semantic Match
A curated list of papers dedicated to neural text (semantic) matching.
Stars: ✭ 669 (+961.9%)
Mutual labels:  information-retrieval
Gaanaapi
Unofficial Gaana API
Stars: ✭ 59 (-6.35%)
Mutual labels:  information-retrieval
Nfstream
NFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+887.3%)
Mutual labels:  data-mining
Clevercsv
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+1307.94%)
Mutual labels:  data-mining
Research
novel deep learning research works with PaddlePaddle
Stars: ✭ 609 (+866.67%)
Mutual labels:  data-mining
Tadw
An implementation of "Network Representation Learning with Rich Text Information" (IJCAI '15).
Stars: ✭ 43 (-31.75%)
Mutual labels:  data-mining
Anserini
A Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+809.52%)
Mutual labels:  information-retrieval
Data mining
The Ruby DataMining Gem, is a little collection of several Data-Mining-Algorithms
Stars: ✭ 10 (-84.13%)
Mutual labels:  data-mining
Bert Vietnamese Question Answering
Vietnamese question answering system with BERT
Stars: ✭ 57 (-9.52%)
Mutual labels:  information-retrieval
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+758.73%)
Mutual labels:  data-mining
Logos
Create ridiculously fast Lexers
Stars: ✭ 1,001 (+1488.89%)
Mutual labels:  lexer
Pke
Python Keyphrase Extraction module
Stars: ✭ 855 (+1257.14%)
Mutual labels:  information-retrieval
Interpretable machine learning with python
Examples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Stars: ✭ 530 (+741.27%)
Mutual labels:  data-mining
Resin
Hardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+739.68%)
Mutual labels:  information-retrieval
Vectorbt
Ultimate Python library for time series analysis and backtesting at scale
Stars: ✭ 855 (+1257.14%)
Mutual labels:  data-mining
Feature Engineering And Feature Selection
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
Stars: ✭ 526 (+734.92%)
Mutual labels:  data-mining
Libfsm
DFA regular expression library & friends
Stars: ✭ 512 (+712.7%)
Mutual labels:  lexer
Tox
misc parsers in rust
Stars: ✭ 40 (-36.51%)
Mutual labels:  lexer
Dataflowjavasdk
Google Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+1255.56%)
Mutual labels:  data-mining
Deep Semantic Similarity Model
My Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
Stars: ✭ 509 (+707.94%)
Mutual labels:  information-retrieval
1-60 of 508 similar projects