All Projects → sherlockjjj → T-Watch

sherlockjjj / T-Watch

Licence: other
Real Time Twitter Sentiment Analysis Product

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language
shell
77523 projects
CSS
56736 projects
HTML
75241 projects
javascript
184084 projects - #8 most used programming language

Projects that are alternatives of or similar to T-Watch

Udacity Data Engineering Projects
Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake development.
Stars: ✭ 458 (+2190%)
Mutual labels:  airflow, aws-s3, aws-ec2
cloud-cheat-sheets
My handmade cheat-sheets for different AWS services.
Stars: ✭ 63 (+215%)
Mutual labels:  aws-s3, aws-ec2
Spark Jupyter Aws
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
Stars: ✭ 259 (+1195%)
Mutual labels:  aws-s3, aws-ec2
awesome-aws-research
A curated list of awesome Amazon Web Services (AWS) libraries, open source repos, guides, blogs, and other resources for Academic Researchers new to AWS
Stars: ✭ 41 (+105%)
Mutual labels:  aws-s3, aws-ec2
trackit
Trackit helps you understand and improve your use of AWS
Stars: ✭ 91 (+355%)
Mutual labels:  aws-s3, aws-ec2
Security monkey
Security Monkey monitors AWS, GCP, OpenStack, and GitHub orgs for assets and their changes over time.
Stars: ✭ 4,244 (+21120%)
Mutual labels:  aws-s3, aws-ec2
Ansible Playbooks
Ansible playbook collection that have been written for Ubuntu. Some of the playbooks are Elasticsearch, Mesos, AWS, MySql, Sensu, Nginx etc..
Stars: ✭ 429 (+2045%)
Mutual labels:  aws-s3, aws-ec2
Aws Labs
step by step guide for aws mini labs. Currently maintained on : https://github.com/Cloud-Yeti/aws-labs Youtube playlist for labs:
Stars: ✭ 153 (+665%)
Mutual labels:  aws-s3, aws-ec2
Aws Deployment Guide
☁️ Deploy to Amazon aws on a virtual private cloud with elastic beanstalk
Stars: ✭ 89 (+345%)
Mutual labels:  aws-s3, aws-ec2
Curso Aws Com Terraform
🎦 🇧🇷 Arquivos do curso "DevOps: AWS com Terraform Automatizando sua infraestrutura" publicado na Udemy. Você pode me ajudar comprando o curso utilizando o link abaixo.
Stars: ✭ 62 (+210%)
Mutual labels:  aws-s3, aws-ec2
Aws Csa Notes 2018
My AWS Certified Solutions Architect Associate Study Notes!
Stars: ✭ 167 (+735%)
Mutual labels:  aws-s3, aws-ec2
Docker Airflow
Repo for building docker based airflow image. Containers support multiple features like writing logs to local or S3 folder and Initializing GCP while container booting. https://abhioncbr.github.io/docker-airflow/
Stars: ✭ 29 (+45%)
Mutual labels:  airflow, aws-s3
lamba-thumbnailer
AWS S3 Video Thumbnailer with Lambda
Stars: ✭ 21 (+5%)
Mutual labels:  aws-s3
incremental training
Repo that relates to the Medium blog 'Keeping your ML model in shape with Kafka, Airflow' and MLFlow'
Stars: ✭ 110 (+450%)
Mutual labels:  airflow
emacs-easy-jekyll
Emacs major mode for managing jekyll
Stars: ✭ 53 (+165%)
Mutual labels:  aws-s3
sigil
AWS SSM Session manager client
Stars: ✭ 67 (+235%)
Mutual labels:  aws-ec2
kafka-twitter-spark-streaming
Counting Tweets Per User in Real-Time
Stars: ✭ 38 (+90%)
Mutual labels:  spark-streaming
simple-file-uploader
A file uploader written using HTML5 and Node.js. It can upload both to a local directory on the server or to an AWS S3 server.
Stars: ✭ 85 (+325%)
Mutual labels:  aws-s3
Real-time-log-analysis-system
🐧基于spark streaming+flume+kafka+hbase的实时日志处理分析系统(分为控制台版本和基于springboot、Echarts等的Web UI可视化版本)
Stars: ✭ 31 (+55%)
Mutual labels:  spark-streaming
airflow-site
Apache Airflow Website
Stars: ✭ 95 (+375%)
Mutual labels:  airflow

Real Time Twitter Stream Analysis via Kafka and Spark Streaming

Motivation:

Build a data product that could process streaming data and has an end-to-end data pipeline that could be easily scaled upon request.

Model Training:

  1. Training tfidf and random forest model using pipeline on spark ML
  2. Saving models to S3

Real Time Analysis:

  1. Collecting real time twitter streams through Kafka
  2. Integrating Kafka with spark streaming
  3. Loading saved model to predict incoming streams in spark streaming
  4. Storing incoming streams to MongoDB in spark streaming
  5. Fetching data from MongoDB and publishing results on web application via flask

Data Flow

Real Time Data Flow

Tools:

AWS EC2 EMR S3, SES

Kafka

Spark (spark streaming, spark sql, spark ml)

Flask

MongoDB

Plotly

Twilio

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].