WatcherWatcher - Open Source Cybersecurity Threat Hunting Platform. Developed with Django & React JS.
Stars: ✭ 324 (+268.18%)
clj-ducklingLanguage, engine, and tooling for expressing, testing, and evaluating composable language rules on input strings. (a duckling clojure fork)
Stars: ✭ 15 (-82.95%)
UndertheseaUnderthesea - Vietnamese NLP Toolkit
Stars: ✭ 823 (+835.23%)
EkphrasisEkphrasis is a text processing tool, geared towards text from social networks, such as Twitter or Facebook. Ekphrasis performs tokenization, word normalization, word segmentation (for splitting hashtags) and spell correction, using word statistics from 2 big corpora (english Wikipedia, twitter - 330mil english tweets).
Stars: ✭ 433 (+392.05%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-65.91%)
NatasPython 3 library for processing historical English
Stars: ✭ 28 (-68.18%)
Chatbot nerchatbot_ner: Named Entity Recognition for chatbots.
Stars: ✭ 273 (+210.23%)
TwitterldatopicmodelingUses topic modeling to identify context between follower relationships of Twitter users
Stars: ✭ 48 (-45.45%)
tweets-preprocessorRepo containing the Twitter preprocessor module, developed by the AUTH OSWinds team
Stars: ✭ 26 (-70.45%)
SudachiA Japanese Tokenizer for Business
Stars: ✭ 496 (+463.64%)
Nltk Book ResourceNotes and solutions to complement the official NLTK book
Stars: ✭ 54 (-38.64%)
Giveme5w1hExtraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?
Stars: ✭ 316 (+259.09%)
resume tailorAn unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field
Stars: ✭ 15 (-82.95%)
Text Analytics With PythonLearn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+1186.36%)
WikiquizGenerates a quiz for a Wikipedia page using parts of speech and text chunking.
Stars: ✭ 778 (+784.09%)
summarize-webpageA small NLP SAAS project that summarize a webpage
Stars: ✭ 34 (-61.36%)
StocksightStock market analyzer and predictor using Elasticsearch, Twitter, News headlines and Python natural language processing and sentiment analysis
Stars: ✭ 1,037 (+1078.41%)
GitsuggestA tool to suggest github repositories based on the repositories you have shown interest in.
Stars: ✭ 636 (+622.73%)
KagomeSelf-contained Japanese Morphological Analyzer written in pure Go
Stars: ✭ 554 (+529.55%)
NLP ToolkitLibrary of state-of-the-art models (PyTorch) for NLP tasks
Stars: ✭ 92 (+4.55%)
Tika PythonTika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Stars: ✭ 997 (+1032.95%)
Spacy💫 Industrial-strength Natural Language Processing (NLP) in Python
Stars: ✭ 21,978 (+24875%)
Node OpennlpApache OpenNLP wrapper for Nodejs
Stars: ✭ 55 (-37.5%)
PynlplPyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl can be used for basic tasks such as the extraction of n-grams and frequency lists, and to build simple language model. There are also more complex data types and algorithms. Moreover, there are parsers for file formats common in NLP (e.g. FoLiA/Giza/Moses/ARPA/Timbl/CQL). There are also clients to interface with various NLP specific servers. PyNLPl most notably features a very extensive library for working with FoLiA XML (Format for Linguistic Annotation).
Stars: ✭ 426 (+384.09%)
Lingua👄 The most accurate natural language detection library for Java and the JVM, suitable for long and short text alike
Stars: ✭ 341 (+287.5%)
Farm🏡 Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
Stars: ✭ 1,140 (+1195.45%)
Contextualized Topic ModelsA python package to run contextualized topic modeling. CTMs combine BERT with topic models to get coherent topics. Also supports multilingual tasks. Cross-lingual Zero-shot model published at EACL 2021.
Stars: ✭ 318 (+261.36%)
Ryuzaki botSimple chatbot in Python using NLTK and scikit-learn
Stars: ✭ 28 (-68.18%)
Quick NlpPytorch NLP library based on FastAI
Stars: ✭ 279 (+217.05%)
NagisaA Japanese tokenizer based on recurrent neural networks
Stars: ✭ 260 (+195.45%)
Atr4sToolkit with state-of-the-art Automatic Terms Recognition methods in Scala
Stars: ✭ 23 (-73.86%)
NLP-toolsUseful python NLP tools (evaluation, GUI interface, tokenization)
Stars: ✭ 39 (-55.68%)
SimstringA Python implementation of the SimString, a simple and efficient algorithm for approximate string matching.
Stars: ✭ 79 (-10.23%)
billboard🎤 Lyrics/associated NLP data for Billboard's Top 100, 1950-2015.
Stars: ✭ 53 (-39.77%)
Rake NltkPython implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
Stars: ✭ 793 (+801.14%)
curso-IRIIntrodução à Recuperação de Informações
Stars: ✭ 14 (-84.09%)
classyclassy is a simple-to-use library for building high-performance Machine Learning models in NLP.
Stars: ✭ 61 (-30.68%)
KuromojiKuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search
Stars: ✭ 745 (+746.59%)
Giveme5WExtraction of the five journalistic W-questions (5W) from news articles
Stars: ✭ 16 (-81.82%)
ru punktRussian language support for NLTK's PunktSentenceTokenizer
Stars: ✭ 49 (-44.32%)
CltkThe Classical Language Toolkit
Stars: ✭ 650 (+638.64%)
PygermanetGermaNet API for Python
Stars: ✭ 42 (-52.27%)
JanomeJapanese morphological analysis engine written in pure Python
Stars: ✭ 630 (+615.91%)
Orange3 Text🍊 📄 Text Mining add-on for Orange3
Stars: ✭ 83 (-5.68%)
Python nlp tutorialThis repository provides everything to get started with Python for Text Mining / Natural Language Processing (NLP)
Stars: ✭ 72 (-18.18%)
TextblobSimple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
Stars: ✭ 7,991 (+8980.68%)
PythainlpThai Natural Language Processing in Python.
Stars: ✭ 582 (+561.36%)