All Projects β†’ omarsar β†’ clinical_nlp_elastic

omarsar / clinical_nlp_elastic

Licence: other
Clinical NLP Analysis with Elasticsearch and Kibana

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to clinical nlp elastic

icingabeat
Elastic Beat fetching events & status from Icinga 2
Stars: ✭ 36 (+12.5%)
Mutual labels:  kibana, elastic
Elastic Charts
πŸ“Š Elastic Charts library
Stars: ✭ 191 (+496.88%)
Mutual labels:  kibana, elastic
Sigmaui
SIGMA UI is a free open-source application based on the Elastic stack and Sigma Converter (sigmac)
Stars: ✭ 123 (+284.38%)
Mutual labels:  kibana, elastic
kibana-keycloak
Secure kibana dashboards using keycloak
Stars: ✭ 40 (+25%)
Mutual labels:  kibana, elastic
ELK-Hunting
Threat Hunting with ELK Workshop (InfoSecWorld 2017)
Stars: ✭ 58 (+81.25%)
Mutual labels:  kibana, elastic
Elastic Docker
Example setups for Elasticsearch, Kibana, Logstash, and Beats with docker-compose
Stars: ✭ 118 (+268.75%)
Mutual labels:  kibana, elastic
Docker Elastic Stack
ELK Stack Dockerfile
Stars: ✭ 175 (+446.88%)
Mutual labels:  kibana, elastic
Sentinl
Kibana Alert & Report App for Elasticsearch
Stars: ✭ 1,233 (+3753.13%)
Mutual labels:  kibana, elastic
awesome-elastic-stack
Awesome Elastic Stack
Stars: ✭ 29 (-9.37%)
Mutual labels:  kibana, elastic
kbn circles vis
Kibana 4.4.1 D3 Circles Packing Visualization
Stars: ✭ 30 (-6.25%)
Mutual labels:  kibana, elastic
Redelk
Red Team's SIEM - tool for Red Teams used for tracking and alarming about Blue Team activities as well as better usability in long term operations.
Stars: ✭ 1,692 (+5187.5%)
Mutual labels:  kibana, elastic
next-eui-starter
Start building Kibana protoypes quickly with the Next.js EUI Starter
Stars: ✭ 74 (+131.25%)
Mutual labels:  kibana, elastic
Microservice Monitoring
Monitor your Spring Boot application with the Elastic Stack all around
Stars: ✭ 114 (+256.25%)
Mutual labels:  kibana, elastic
Elastic
Elastic Stack (6.2.4) 을 ν™œμš©ν•œ Dashboard λ§Œλ“€κΈ° Project
Stars: ✭ 121 (+278.13%)
Mutual labels:  kibana, elastic
Elasticambari
Elastic Service for Ambari
Stars: ✭ 108 (+237.5%)
Mutual labels:  kibana, elastic
Elastic Stack
Aprenda Elasticsearch, Logstash, Kibana e Beats do jeito mais fÑcil ⭐️
Stars: ✭ 135 (+321.88%)
Mutual labels:  kibana, elastic
Pfelk
pfSense/OPNsense + ELK
Stars: ✭ 417 (+1203.13%)
Mutual labels:  kibana, elastic
Elasticsearch Cli
Command line interface for ElasticSearch
Stars: ✭ 70 (+118.75%)
Mutual labels:  kibana, elastic
jmx-monitoring-stacks
No description or website provided.
Stars: ✭ 170 (+431.25%)
Mutual labels:  kibana, elastic
area3d vis
3D Graph Visualization for Kibana 5
Stars: ✭ 28 (-12.5%)
Mutual labels:  kibana, elastic

Online Mental Disorder Analysis with Elasticsearch and Kibana


alt txt

Project Summary

In this presentation (slide available here), I provide a basic analysis and feature engineering pipeline to discover interesting and hidden linguistic phenomena and emotional behaviors of online mental disorder users. The first phase of the analysis focus on exploring basic linguistic and social cues such as emoticons, stopwords, hashtags, social mentions, etc. The goals of the presentation are the following:

  • To broadly show how to leverage Elasticsearch's ingest pipeline and custom analyzers for preprocessing and feature engineering
  • To introduce common best practices for dealing with natural language data
  • To discover insights that assist to improve feature engineering and ML models

Prerequisites

To fully understand the material in this project, it assumes basic knowledge of Elasticsearch and Python. The Elastic learning page and official reference are great places to start learning about Elasticsearch and Kibana.

Requirements

Below are the requirements for this project:

Framework

In this project, I show how to insert the data discussed above into Elasticsearch to be able to conduct interesting featuring engineering and anlysis using Kibana and the developer tools. I make heavy use of the ingest pipeline to define processors through which the data will be fed before being indexed (stored) into Elasticsearch via the bulk API using elasticsearch Python module. The code (notebook) used to perform the bulk insert can be found in this repository.

alt txt

Data

The participants used in this analysis are collected from Twitter based on their self-reported status. The self-reported mental disorder cases were collected via a filter, "I am diagnosed with X". I only focus on bipolar disorder and borderline personality disorder self-reported cases. A seperate control group was randomly collected using the Twitter public API. All cases were manually verified and checked for data quality control. More information on how to collect such dataset can be found in this paper.

There are some ethical concerns about publicly sharing the dataset so if you are seriously interested in using it for research purposes please feel free to reach out to me at [email protected]. There is an alternative dataset (collected from Reddit), similar to this one, that can also be obtained in a similar process. Ideally, it doesn't matter what dataset we are using, the goal is to provide a framework that can perform the analysis on various kinds of datasets coming from different sources.

Dashboard

The visualization dashboard built using Kibana looks as follows:

alt txt

Other Resources

The full presentation slides accomponying this project can be found here

Project Status

  • Phase 1 - Proposal of indexing, feature engineering, and analytics (read more)
  • Phase 2 (Coming soon!) - More details on project, models, data, research, etc.

Future Ideas

  • Create sample data to reproduce experiments and analyses
  • General NLP pipeline for mental disorder analysis and text analytics
  • Integrate logstash for gathering, processing, and indexing more cases
  • Build a machine learning or deep learning dedicated API to automatically perform prediction of new mental disorder cases

Contact Information

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].