Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → 5hirish → Adam_qas

5hirish / Adam_qas

Licence: gpl-3.0

ADAM - A Question Answering System. Inspired from IBM Watson

Programming Languages

python

139335 projects - #7 most used programming language

Labels

natural-language-processing elasticsearch pandas scikit-learn question-answering wikipedia spacy gensim

Projects that are alternatives of or similar to Adam qas

Practical Machine Learning With Python

Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.

Stars: ✭ 1,868 (+466.06%)

Mutual labels: pandas, natural-language-processing, scikit-learn, spacy

Text Analytics With Python

Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.

Stars: ✭ 1,132 (+243.03%)

Mutual labels: natural-language-processing, scikit-learn, spacy, gensim

Datacamp Python Data Science Track

All the slides, accompanying code and exercises all stored in this repo. 🎈

Stars: ✭ 250 (-24.24%)

Mutual labels: pandas, natural-language-processing, scikit-learn

Sense2vec

🦆 Contextually-keyed word vectors

Stars: ✭ 1,184 (+258.79%)

Mutual labels: natural-language-processing, spacy, gensim

Arch-Data-Science

Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision

Stars: ✭ 92 (-72.12%)

Mutual labels: scikit-learn, pandas, spacy

Crime Analysis

Association Rule Mining from Spatial Data for Crime Analysis

Stars: ✭ 20 (-93.94%)

Mutual labels: pandas, scikit-learn, gensim

Dat8

General Assembly's 2015 Data Science course in Washington, DC

Stars: ✭ 1,516 (+359.39%)

Mutual labels: pandas, natural-language-processing, scikit-learn

Eland

Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch

Stars: ✭ 235 (-28.79%)

Mutual labels: pandas, elasticsearch, scikit-learn

nlp workshop odsc europe20

Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…

Stars: ✭ 127 (-61.52%)

Mutual labels: scikit-learn, spacy, gensim

Winerama Recommender Tutorial

A wine recommender system tutorial using Python technologies such as Django, Pandas, or Scikit-learn, and others such as Bootstrap.

Stars: ✭ 324 (-1.82%)

Mutual labels: pandas, scikit-learn

A-Detector

⭐ An anomaly-based intrusion detection system.

Stars: ✭ 69 (-79.09%)

Mutual labels: scikit-learn, pandas

skippa

SciKIt-learn Pipeline in PAndas

Stars: ✭ 33 (-90%)

Mutual labels: scikit-learn, pandas

machine-learning-capstone-project

This is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning

Stars: ✭ 28 (-91.52%)

Mutual labels: scikit-learn, pandas

five-minute-midas

Stars: ✭ 41 (-87.58%)

Mutual labels: scikit-learn, pandas

Product-Categorization-NLP

Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).

Stars: ✭ 30 (-90.91%)

Mutual labels: pandas, gensim

DS-Cookbook101

A jupyter notebook having all most frequent used code snippet for daily data scienceoperations

Stars: ✭ 59 (-82.12%)

Mutual labels: scikit-learn, pandas

AIPortfolio

Use AI to generate a optimized stock portfolio

Stars: ✭ 28 (-91.52%)

Mutual labels: scikit-learn, pandas

dataquest-guided-projects-solutions

My dataquest project solutions

Stars: ✭ 35 (-89.39%)

Mutual labels: scikit-learn, pandas

DataSciPy

Data Science with Python

Stars: ✭ 15 (-95.45%)

Mutual labels: scikit-learn, pandas

resume tailor

An unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field

Stars: ✭ 15 (-95.45%)

Mutual labels: scikit-learn, gensim

View All Similar Projects ➔

ADAM -- Question Answering System

A question answering system that extracts answers from Wikipedia to questions posed in natural language. Inspired by IBM Watson and START. We are currently focused on improving the accuracy of the extracted answers. Follow the creator's blog at shirishkadam.com for updates on progress.

Getting Started

Elasticsearch is being used to store and index the scrapped and parsed texts from Wikipedia. Elasticsearch 7.X installation guide can be found at Elasticsearch Documentation. You might have to start the elasticsearch search service.

$ git clone https://github.com/5hirish/adam_qas.git
$ cd adam_qas
$ pip install -r requirements.txt
$ python -m qas.adam -vv "When was linux kernel version 4.0 released ?"

Note: The above installation downloads the best-matching default english language model for spaCy. But to improve the model's accuracy you can install other models too. Read more at spaCy docs.

$ python -m spacy download en_core_web_md

Running with Docker

$ git clone https://github.com/5hirish/adam_qas.git
$ cd adam_qas
$ docker-compose up

Now both conntainers are up and running. Next step is to enter in the python container and run Adam:

$ docker exec -it $(docker ps -a -q  --filter ancestor=adam_qas_adam) bash
$ python -m qas.adam -vv "When was linux kernel version 4.0 released ?"

References

Find more in depth documentation about the system with its research paper and system architecture here

Requirements

Python Package dependencies listed in requirements.txt Upgrading Elasticsearch 6.X:

Rolling Update 6.2 to 6.8 > ref
Rolling Update 6.8 to 7.1 > ref

Features

Extract information from Wikipedia
Classify questions with regular expression (default)
Classify questions with a SVM (optional)
Vector space model used for answer extraction
Rank candidate answers
Merge top 5 answers into one response

Current Project State ?

GitHub Issue #36: Invalid Answers

TODO

[x] Replace Wikipedia APIs with custom scraper
[x] Storing extracted data in database (elasticsearch)
[x] SQLite test input data storage
[ ] Anaphora resolution in both questions and answers
[ ] Machine learning query constructor rather than rule-based
[ ] Improve vector space language model for answer extraction

Contributions

Please see our contributing documentation for some tips on getting started.

Maintainers

@5hirish - Shirish Kadam

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 330

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (9) 🔗