text2textText2Text: Cross-lingual natural language processing and generation toolkit
Stars: ✭ 188 (+74.07%)
DefactonlpDeFactoNLP: An Automated Fact-checking System that uses Named Entity Recognition, TF-IDF vector comparison and Decomposable Attention models.
Stars: ✭ 30 (-72.22%)
PisaPISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (+352.78%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+235.19%)
ResinHardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+389.81%)
GreynirThe greynir.is natural language processing website for Icelandic
Stars: ✭ 47 (-56.48%)
Sequence Semantic EmbeddingTools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
Stars: ✭ 435 (+302.78%)
SoqalArabic Open Domain Question Answering System using Neural Reading Comprehension
Stars: ✭ 72 (-33.33%)
Drl4nlp.scratchpadNotes on Deep Reinforcement Learning for Natural Language Processing papers
Stars: ✭ 26 (-75.93%)
Elixir ScrapeScrape any website, article or RSS/Atom Feed with ease!
Stars: ✭ 306 (+183.33%)
TalismanStraightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+440.74%)
MovieboxMachine learning movie recommending system
Stars: ✭ 504 (+366.67%)
StringlifierStringlifier is on Opensource ML Library for detecting random strings in raw text. It can be used in sanitising logs, detecting accidentally exposed credentials and as a pre-processing step in unsupervised ML-based analysis of application text data.
Stars: ✭ 85 (-21.3%)
Telegram Scrapertelegram group scraper tool. fetch all information about group members
Stars: ✭ 450 (+316.67%)
Domain discovery toolThis repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.
Stars: ✭ 33 (-69.44%)
Osi.igInformation Gathering Instagram.
Stars: ✭ 377 (+249.07%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-17.59%)
GetaltnameExtract subdomains from SSL certificates in HTTPS sites.
Stars: ✭ 320 (+196.3%)
Knowledge GraphsA collection of research on knowledge graphs
Stars: ✭ 845 (+682.41%)
PolyfuzzFuzzy string matching, grouping, and evaluation.
Stars: ✭ 292 (+170.37%)
Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
Stars: ✭ 63 (-41.67%)
FxtA large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-76.85%)
TextminingPython文本挖掘系统 Research of Text Mining System
Stars: ✭ 268 (+148.15%)
ai-distilleryAutomatically modelling and distilling knowledge within AI. In other words, summarising the AI research firehose.
Stars: ✭ 20 (-81.48%)
FreediscoveryWeb Service for E-Discovery Analytics
Stars: ✭ 59 (-45.37%)
AnseriniA Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+430.56%)
Pyndripyndri is a Python interface to the Indri search engine.
Stars: ✭ 85 (-21.3%)
Deep Semantic Similarity ModelMy Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
Stars: ✭ 509 (+371.3%)
ScdvText classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-50%)
Cdqa⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Stars: ✭ 500 (+362.96%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+325.93%)
Lucene SolrApache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+3804.63%)
Textrank Keyword ExtractionKeyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and other techniques.
Stars: ✭ 79 (-26.85%)
Ip TracerTrack any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
Stars: ✭ 399 (+269.44%)
NprfNPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval
Stars: ✭ 31 (-71.3%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+247.22%)
SertSemantic Entity Retrieval Toolkit
Stars: ✭ 100 (-7.41%)
Nlp Projectsword2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
Stars: ✭ 360 (+233.33%)
PkePython Keyphrase Extraction module
Stars: ✭ 855 (+691.67%)
ScreenfetchFetches system/theme information in terminal for Linux desktop screenshots.
Stars: ✭ 3,339 (+2991.67%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-34.26%)
NlpSelected Machine Learning algorithms for natural language processing and semantic analysis in Golang
Stars: ✭ 304 (+181.48%)
Date InfoAPI to let user fetch the events that happen(ed) on a specific date
Stars: ✭ 7 (-93.52%)
AllrankallRank is a framework for training learning-to-rank neural models based on PyTorch.
Stars: ✭ 269 (+149.07%)
BitmagicBitMagic Library
Stars: ✭ 263 (+143.52%)
GeeseDBGraph Engine for Exploration and Search
Stars: ✭ 14 (-87.04%)
RelevancyfeedbackDice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, conceptual search, semantic search and personalized search
Stars: ✭ 19 (-82.41%)
Ds2iA library of inverted index data structures
Stars: ✭ 104 (-3.7%)
FlexneuartFlexible classic and NeurAl Retrieval Toolkit
Stars: ✭ 99 (-8.33%)
SolrpluginsDice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (-20.37%)
GaanaapiUnofficial Gaana API
Stars: ✭ 59 (-45.37%)