All Projects → 5hirish → Adam_qas

5hirish / Adam_qas

Licence: gpl-3.0
ADAM - A Question Answering System. Inspired from IBM Watson

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Adam qas

Practical Machine Learning With Python
Master the essential skills needed to recognize and solve complex real-world problems with Machine Learning and Deep Learning by leveraging the highly popular Python Machine Learning Eco-system.
Stars: ✭ 1,868 (+466.06%)
Mutual labels:  pandas, natural-language-processing, scikit-learn, spacy
Text Analytics With Python
Learn how to process, classify, cluster, summarize, understand syntax, semantics and sentiment of text data with the power of Python! This repository contains code and datasets used in my book, "Text Analytics with Python" published by Apress/Springer.
Stars: ✭ 1,132 (+243.03%)
Mutual labels:  natural-language-processing, scikit-learn, spacy, gensim
Datacamp Python Data Science Track
All the slides, accompanying code and exercises all stored in this repo. 🎈
Stars: ✭ 250 (-24.24%)
Mutual labels:  pandas, natural-language-processing, scikit-learn
Sense2vec
🦆 Contextually-keyed word vectors
Stars: ✭ 1,184 (+258.79%)
Mutual labels:  natural-language-processing, spacy, gensim
Arch-Data-Science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (-72.12%)
Mutual labels:  scikit-learn, pandas, spacy
Crime Analysis
Association Rule Mining from Spatial Data for Crime Analysis
Stars: ✭ 20 (-93.94%)
Mutual labels:  pandas, scikit-learn, gensim
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+359.39%)
Mutual labels:  pandas, natural-language-processing, scikit-learn
Eland
Python Client and Toolkit for DataFrames, Big Data, Machine Learning and ETL in Elasticsearch
Stars: ✭ 235 (-28.79%)
Mutual labels:  pandas, elasticsearch, scikit-learn
nlp workshop odsc europe20
Extensive tutorials for the Advanced NLP Workshop in Open Data Science Conference Europe 2020. We will leverage machine learning, deep learning and deep transfer learning to learn and solve popular tasks using NLP including NER, Classification, Recommendation \ Information Retrieval, Summarization, Classification, Language Translation, Q&A and T…
Stars: ✭ 127 (-61.52%)
Mutual labels:  scikit-learn, spacy, gensim
Winerama Recommender Tutorial
A wine recommender system tutorial using Python technologies such as Django, Pandas, or Scikit-learn, and others such as Bootstrap.
Stars: ✭ 324 (-1.82%)
Mutual labels:  pandas, scikit-learn
A-Detector
⭐ An anomaly-based intrusion detection system.
Stars: ✭ 69 (-79.09%)
Mutual labels:  scikit-learn, pandas
skippa
SciKIt-learn Pipeline in PAndas
Stars: ✭ 33 (-90%)
Mutual labels:  scikit-learn, pandas
machine-learning-capstone-project
This is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning
Stars: ✭ 28 (-91.52%)
Mutual labels:  scikit-learn, pandas
five-minute-midas
Predicting Profitable Day Trading Positions using Decision Tree Classifiers. scikit-learn | Flask | SQLite3 | pandas | MLflow | Heroku | Streamlit
Stars: ✭ 41 (-87.58%)
Mutual labels:  scikit-learn, pandas
Product-Categorization-NLP
Multi-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (-90.91%)
Mutual labels:  pandas, gensim
DS-Cookbook101
A jupyter notebook having all most frequent used code snippet for daily data scienceoperations
Stars: ✭ 59 (-82.12%)
Mutual labels:  scikit-learn, pandas
AIPortfolio
Use AI to generate a optimized stock portfolio
Stars: ✭ 28 (-91.52%)
Mutual labels:  scikit-learn, pandas
dataquest-guided-projects-solutions
My dataquest project solutions
Stars: ✭ 35 (-89.39%)
Mutual labels:  scikit-learn, pandas
DataSciPy
Data Science with Python
Stars: ✭ 15 (-95.45%)
Mutual labels:  scikit-learn, pandas
resume tailor
An unsupervised analysis combining topic modeling and clustering to preserve an individuals work history and credentials while tailoring their resume towards a new career field
Stars: ✭ 15 (-95.45%)
Mutual labels:  scikit-learn, gensim

ADAM -- Question Answering System

License: GPL v3 Codacy Badge Codecov Build Status Gitter Twitter

A question answering system that extracts answers from Wikipedia to questions posed in natural language. Inspired by IBM Watson and START. We are currently focused on improving the accuracy of the extracted answers. Follow the creator's blog at shirishkadam.com for updates on progress.

Getting Started

Elasticsearch is being used to store and index the scrapped and parsed texts from Wikipedia. Elasticsearch 7.X installation guide can be found at Elasticsearch Documentation. You might have to start the elasticsearch search service.

$ git clone https://github.com/5hirish/adam_qas.git
$ cd adam_qas
$ pip install -r requirements.txt
$ python -m qas.adam -vv "When was linux kernel version 4.0 released ?"

Note: The above installation downloads the best-matching default english language model for spaCy. But to improve the model's accuracy you can install other models too. Read more at spaCy docs.

$ python -m spacy download en_core_web_md

Running with Docker

$ git clone https://github.com/5hirish/adam_qas.git
$ cd adam_qas
$ docker-compose up

Now both conntainers are up and running. Next step is to enter in the python container and run Adam:

$ docker exec -it $(docker ps -a -q  --filter ancestor=adam_qas_adam) bash
$ python -m qas.adam -vv "When was linux kernel version 4.0 released ?"

References

Find more in depth documentation about the system with its research paper and system architecture here

Requirements

Python Package dependencies listed in requirements.txt Upgrading Elasticsearch 6.X:

  • Rolling Update 6.2 to 6.8 > ref
  • Rolling Update 6.8 to 7.1 > ref

Features

  • Extract information from Wikipedia
  • Classify questions with regular expression (default)
  • Classify questions with a SVM (optional)
  • Vector space model used for answer extraction
  • Rank candidate answers
  • Merge top 5 answers into one response

Current Project State ?

GitHub Issue #36: Invalid Answers

TODO

  • [x] Replace Wikipedia APIs with custom scraper
  • [x] Storing extracted data in database (elasticsearch)
  • [x] SQLite test input data storage
  • [ ] Anaphora resolution in both questions and answers
  • [ ] Machine learning query constructor rather than rule-based
  • [ ] Improve vector space language model for answer extraction

Contributions

Please see our contributing documentation for some tips on getting started.

Maintainers

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].