All Projects → SekouDiaoNlp → mlconjug3

SekouDiaoNlp / mlconjug3

Licence: MIT license
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to mlconjug3

verbecc
Complete Conjugation of any Verb using Machine Learning for French, Spanish, Portuguese, Italian and Romanian
Stars: ✭ 45 (-4.26%)
Mutual labels:  linguistics, conjugation, conjugator
Nuts
自然语言处理常见任务(主要包括文本分类,序列标注,自动问答等)解决方案试验田
Stars: ✭ 21 (-55.32%)
Mutual labels:  nlp-library, nlp-machine-learning
NLP-Natural-Language-Processing
Projects and useful articles / links
Stars: ✭ 149 (+217.02%)
Mutual labels:  nlp-library, nlp-machine-learning
Tika Python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (+2021.28%)
Mutual labels:  nlp-library, nlp-machine-learning
Contextualized Topic Models
A python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (+576.6%)
Mutual labels:  nlp-library, nlp-machine-learning
OpenPrompt
An Open-Source Framework for Prompt-Learning.
Stars: ✭ 1,769 (+3663.83%)
Mutual labels:  nlp-library, nlp-machine-learning
Lingua
👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (+625.53%)
Mutual labels:  nlp-library, nlp-machine-learning
Pynlpl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+806.38%)
Mutual labels:  linguistics, nlp-library
lingvo--Ner-ru
Named entity recognition (NER) in Russian texts / Определение именованных сущностей (NER) в тексте на русском языке
Stars: ✭ 38 (-19.15%)
Mutual labels:  linguistics, nlp-machine-learning
Nlp profiler
A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Stars: ✭ 181 (+285.11%)
Mutual labels:  nlp-library, nlp-machine-learning
TextFeatureSelection
Python library for feature selection for text features. It has filter method, genetic algorithm and TextFeatureSelectionEnsemble for improving text classification models. Helps improve your machine learning models
Stars: ✭ 42 (-10.64%)
Mutual labels:  nlp-library, nlp-machine-learning
schrutepy
The Entire Transcript from the Office in Tidy Format
Stars: ✭ 22 (-53.19%)
Mutual labels:  nlp-library, nlp-machine-learning
Lingo
package lingo provides the data structures and algorithms required for natural language processing
Stars: ✭ 113 (+140.43%)
Mutual labels:  nlp-library, nlp-machine-learning
pfootprint
Political Discourse Analysis Using Pre-Trained Word Vectors.
Stars: ✭ 20 (-57.45%)
Mutual labels:  linguistics, nlp-machine-learning
empythy
Automated NLP sentiment predictions- batteries included, or use your own data
Stars: ✭ 17 (-63.83%)
Mutual labels:  nlp-library, nlp-machine-learning
react-native-unit-tests
Example how to test React Native components
Stars: ✭ 79 (+68.09%)
Mutual labels:  test-driven-development
KoParadigm
KoParadigm: Korean Inflectional Paradigm Generator
Stars: ✭ 48 (+2.13%)
Mutual labels:  linguistics
anuvada
Interpretable Models for NLP using PyTorch
Stars: ✭ 102 (+117.02%)
Mutual labels:  nlp-machine-learning
PixelTest
Fast, modern, simple iOS snapshot testing written purely in Swift.
Stars: ✭ 56 (+19.15%)
Mutual labels:  test-driven-development
langua
A suite of language tools
Stars: ✭ 29 (-38.3%)
Mutual labels:  linguistics
mlconjug3 PyPi Home Page

MLCONJUG3

Package Maintenance Status Package Maintener Build status on Windows, MacOs and Linux Pypi Python Package Index Status Anaconda Package Index Status Compatible Python versions Supported platforms Documentation Status Dependencies status Code Coverage Status Code Vulnerability Status PyPI Downloads Conda
A Python library to conjugate verbs in French, English, Spanish, Italian, Portuguese and Romanian (more soon) using Machine Learning techniques.
Any verb in one of the supported language can be conjugated, as the module contains a Machine Learning model of how the verbs behave.
Even completely new or made-up verbs can be successfully conjugated in this manner.
The supplied pre-trained models are composed of:
  • a binary feature extractor,
  • a feature selector using Linear Support Vector Classification,
  • a classifier using Stochastic Gradient Descent.
MLConjug3 uses scikit-learn to implement the Machine Learning algorithms.
Users of the library can use any compatible classifiers from scikit-learn to modify and retrain the models.
The training data for the french model is based on Verbiste https://perso.b2b2c.ca/~sarrazip/dev/verbiste.html .
The training data for English, Spanish, Italian, Portuguese and Romanian was generated using unsupervised learning techniques using the French model as a model to query during the training.

Warning

MLCONJUG3 now only supports Python 3.x as Python 2.x has been deprecated in 2020.

Supported Languages

  • French
  • English
  • Spanish
  • Italian
  • Portuguese
  • Romanian

Features

  • Easy to use API.
  • Includes pre-trained models with 99% + accuracy in predicting conjugation class of unknown verbs.
  • Easily train new models or add new languages.
  • Easily integrate MLConjug in your own projects.
  • Can be used as a command line tool.

Academic publications citing mlconjug

Software projects using mlconjug

  • Translation flask API for the Helsinki NLP models available in the Huggingface Transformers library.
  • NLP Suite is a package of tools designed for non-specialists, for scholars with no knowledge or little knowledge of Natural Language Processing.
  • This project offers tools to visualize the gender bias in pre-trained language models to better understand the prejudices in the data.
  • This project uses language models to generate text that is well suited to the type of publication.
  • Dockerized microservice with REST API for conjugation of any verb in French and Spanish.
  • A tool to Manage and tansform HTML documents.
  • A Tux bot.
  • Tweets the words of the French language. Largely inspired by the @botducul (identical lexicon, but code in Python) and the @botsupervnr.
    Posts on @botduslip. Stores the position of the last tweeted word in a Redis database.
  • This project offers a tool to help learn differnt verbal forms.
  • A collection of common NLP tasks such as dataset parsing and explicit semantic extraction.
  • This project offers a model which recognizes covid-19 masks.
  • Need an excuse for why you can't show up in your Zoom lectures? Just generate one here!
  • Repository to store Natural Language Processing models.
  • This is a simple virtual assistant. With it, you can search the Internet, access websites, open programs, and more using just your voice.
    This virtual assistant supports the English and Portuguese languages and has many settings that you can adjust to your liking.
  • This python module responds to yes or no questions. It dishes out its advice at random.
    Disclaimer: Do not actually act on this advice ;)
  • Python+Flask web app that uses mlconjug to dynamically generate foreign language conjugation questions.
  • A dwarf-fortress adventure mode-inspired rogue-like Pygame Python3 game.
  • A WebApp to learn Spanish.
  • Application for German-French vocabulary with simple GUI.

BibTeX

If you want to cite mlconjug3 in an academic publication use this citation format:

@article{mlconjug3,
  title={mlconjug3},
  author={Sekou Diao},
  journal={GitHub. Note: https://github.com/SekouDiaoNlp/mlconjug3 Cited by},
  year={2021}
}

Credits

This package was created with the help of Verbiste and scikit-learn.

The logo was designed by Zuur.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].