PkePython Keyphrase Extraction module
Stars: ✭ 855 (+359.68%)
AnseriniA Lucene toolkit for replicable information retrieval research
Stars: ✭ 573 (+208.06%)
Ds2iA library of inverted index data structures
Stars: ✭ 104 (-44.09%)
Sequence Semantic EmbeddingTools and recipes to train deep learning models and build services for NLP tasks such as text classification, semantic search ranking and recall fetching, cross-lingual information retrieval, and question answering etc.
Stars: ✭ 435 (+133.87%)
Haystack🔍 Haystack is an open source NLP framework that leverages Transformer models. It enables developers to implement production-ready neural search, question answering, semantic document search and summarization for a wide range of applications.
Stars: ✭ 3,409 (+1732.8%)
FxtA large scale feature extraction tool for text-based machine learning
Stars: ✭ 25 (-86.56%)
Tutorial Utilizing KgResources for Tutorial on "Utilizing Knowledge Graphs in Text-centric Information Retrieval"
Stars: ✭ 148 (-20.43%)
PisaPISA: Performant Indexes and Search for Academia
Stars: ✭ 489 (+162.9%)
ForteForte is a flexible and powerful NLP builder FOR TExt. This is part of the CASL project: http://casl-project.ai/
Stars: ✭ 89 (-52.15%)
GaanaapiUnofficial Gaana API
Stars: ✭ 59 (-68.28%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+94.62%)
Dan Jurafsky Chris Manning NlpMy solution to the Natural Language Processing course made by Dan Jurafsky, Chris Manning in Winter 2012.
Stars: ✭ 124 (-33.33%)
Domain discovery toolThis repository contains the Domain Discovery Tool (DDT) project. DDT is an interactive system that helps users explore and better understand a domain (or topic) as it is represented on the Web.
Stars: ✭ 33 (-82.26%)
Date InfoAPI to let user fetch the events that happen(ed) on a specific date
Stars: ✭ 7 (-96.24%)
Pytrec evalpytrec_eval is an Information Retrieval evaluation tool for Python, based on the popular trec_eval.
Stars: ✭ 114 (-38.71%)
BooksBooks worth spreading
Stars: ✭ 161 (-13.44%)
Deep Semantic Similarity ModelMy Keras implementation of the Deep Semantic Similarity Model (DSSM)/Convolutional Latent Semantic Model (CLSM) described here: http://research.microsoft.com/pubs/226585/cikm2014_cdssm_final.pdf.
Stars: ✭ 509 (+173.66%)
FlexneuartFlexible classic and NeurAl Retrieval Toolkit
Stars: ✭ 99 (-46.77%)
Telegram Scrapertelegram group scraper tool. fetch all information about group members
Stars: ✭ 450 (+141.94%)
Osi.igInformation Gathering Instagram.
Stars: ✭ 377 (+102.69%)
Pyndripyndri is a Python interface to the Indri search engine.
Stars: ✭ 85 (-54.3%)
Wordtokenizers.jlHigh performance tokenizers for natural language processing and other related tasks
Stars: ✭ 63 (-66.13%)
Nlp Projectsword2vec, sentence2vec, machine reading comprehension, dialog system, text classification, pretrained language model (i.e., XLNet, BERT, ELMo, GPT), sequence labeling, information retrieval, information extraction (i.e., entity, relation and event extraction), knowledge graph, text generation, network embedding
Stars: ✭ 360 (+93.55%)
FoundryThe Cognitive Foundry is an open-source Java library for building intelligent systems using machine learning
Stars: ✭ 124 (-33.33%)
FreediscoveryWeb Service for E-Discovery Analytics
Stars: ✭ 59 (-68.28%)
GensimTopic Modelling for Humans
Stars: ✭ 12,763 (+6761.83%)
ScdvText classification with Sparse Composite Document Vectors.
Stars: ✭ 54 (-70.97%)
NprfNPRF: A Neural Pseudo Relevance Feedback Framework for Ad-hoc Information Retrieval
Stars: ✭ 31 (-83.33%)
RankingLearning to Rank in TensorFlow
Stars: ✭ 2,362 (+1169.89%)
Knowledge GraphsA collection of research on knowledge graphs
Stars: ✭ 845 (+354.3%)
Scilla🏴☠️ Information Gathering tool 🏴☠️ DNS / Subdomains / Ports / Directories enumeration
Stars: ✭ 116 (-37.63%)
Drl4nlp.scratchpadNotes on Deep Reinforcement Learning for Natural Language Processing papers
Stars: ✭ 26 (-86.02%)
PyseriniPython interface to the Anserini IR toolkit built on Lucene
Stars: ✭ 148 (-20.43%)
RelevancyfeedbackDice.com's relevancy feedback solr plugin created by Simon Hughes (Dice). Contains request handlers for doing MLT style recommendations, conceptual search, semantic search and personalized search
Stars: ✭ 19 (-89.78%)
VtextSimple NLP in Rust with Python bindings
Stars: ✭ 108 (-41.94%)
TalismanStraightforward fuzzy matching, information retrieval and NLP building blocks for JavaScript.
Stars: ✭ 584 (+213.98%)
NeuralqaNeuralQA: A Usable Library for Question Answering on Large Datasets with BERT
Stars: ✭ 185 (-0.54%)
ResinHardware-accelerated vector-based search engine. Available as a HTTP service or as an embedded library.
Stars: ✭ 529 (+184.41%)
SertSemantic Entity Retrieval Toolkit
Stars: ✭ 100 (-46.24%)
Cdqa⛔ [NOT MAINTAINED] An End-To-End Closed Domain Question Answering System.
Stars: ✭ 500 (+168.82%)
InvoicenetDeep neural network to extract intelligent information from invoice documents.
Stars: ✭ 1,886 (+913.98%)
Awesome Persian Nlp IrCurated List of Persian Natural Language Processing and Information Retrieval Tools and Resources
Stars: ✭ 460 (+147.31%)
Lucene SolrApache Lucene and Solr open-source search software
Stars: ✭ 4,217 (+2167.2%)
Bm25A Python implementation of the BM25 ranking function.
Stars: ✭ 159 (-14.52%)
Ip TracerTrack any ip address with IP-Tracer. IP-Tracer is developed for Linux and Termux. you can retrieve any ip address information using IP-Tracer.
Stars: ✭ 399 (+114.52%)
SolrpluginsDice Solr Plugins from Simon Hughes Dice.com
Stars: ✭ 86 (-53.76%)
RmdlRMDL: Random Multimodel Deep Learning for Classification
Stars: ✭ 375 (+101.61%)
EasyocrReady-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Stars: ✭ 13,379 (+7093.01%)
Textrank Keyword ExtractionKeyword extraction using TextRank algorithm after pre-processing the text with lemmatization, filtering unwanted parts-of-speech and other techniques.
Stars: ✭ 79 (-57.53%)
Vec4irWord Embeddings for Information Retrieval
Stars: ✭ 188 (+1.08%)
K NrmK-NRM: End-to-End Neural Ad-hoc Ranking with Kernel Pooling
Stars: ✭ 183 (-1.61%)
Sf1r LiteSearch Formula-1——A distributed high performance massive data engine for enterprise/vertical search
Stars: ✭ 158 (-15.05%)
Rated Ranking EvaluatorSearch Quality Evaluation Tool for Apache Solr & Elasticsearch search-based infrastructures
Stars: ✭ 134 (-27.96%)
VectorsinsearchDice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searching with Vectors' talk from Haystack 2019 (US). Builds upon my conceptual search and semantic search work from 2015
Stars: ✭ 71 (-61.83%)