All Projects → stormsinbrewing → Real_Time_Social_Media_Mining

stormsinbrewing / Real_Time_Social_Media_Mining

Licence: MIT license
DevOps pipeline for Real Time Social/Web Mining

Programming Languages

HTML
75241 projects
python
139335 projects - #7 most used programming language
java
68154 projects - #9 most used programming language
shell
77523 projects
Dockerfile
14818 projects

Projects that are alternatives of or similar to Real Time Social Media Mining

chrome-addon
☁ GitHub action to upload addon to Chrome
Stars: ✭ 53 (+140.91%)
Mutual labels:  actions, github-actions
setup-jdk
(DEPRECATED) Set up your GitHub Actions workflow with a specific version of AdoptOpenJDK
Stars: ✭ 32 (+45.45%)
Mutual labels:  actions, github-actions
python-actions-alpha-archived
Please note that this was for the *alpha* version of GitHub Actions for Python.
Stars: ✭ 15 (-31.82%)
Mutual labels:  actions, github-actions
actions
Set of actions for implementing CI/CD with werf and GitHub Actions
Stars: ✭ 67 (+204.55%)
Mutual labels:  actions, github-actions
action-netlify-deploy
🙌 Netlify deployments via GitHub actions
Stars: ✭ 32 (+45.45%)
Mutual labels:  actions, github-actions
big-data-upf
RECSM-UPF Summer School: Social Media and Big Data Research
Stars: ✭ 21 (-4.55%)
Mutual labels:  big-data, social-network-analysis
dart-package-publisher
Action to Publish Dart / Flutter Package To https://pub.dev When you need to publish a package, just bump the version in pubspec.yaml
Stars: ✭ 45 (+104.55%)
Mutual labels:  actions, github-actions
recent-activity
Add your recent activity to your profile readme!
Stars: ✭ 87 (+295.45%)
Mutual labels:  actions, github-actions
gh-action-get-changed-files
GitHub Action that saves changed files as JSON for use by other actions.
Stars: ✭ 82 (+272.73%)
Mutual labels:  actions, github-actions
changed-files
Github action to retrieve all (added, copied, modified, deleted, renamed, type changed, unmerged, unknown) files and directories.
Stars: ✭ 733 (+3231.82%)
Mutual labels:  actions, github-actions
actions-deploy-gist
📌 Deploy file to Github Gist
Stars: ✭ 26 (+18.18%)
Mutual labels:  actions, github-actions
scan-action
Anchore container analysis and scan provided as a GitHub Action
Stars: ✭ 140 (+536.36%)
Mutual labels:  actions, github-actions
netlify-build-github-actions
An example of triggering a Netlify build using Github Actions Scheduled Events
Stars: ✭ 31 (+40.91%)
Mutual labels:  actions, github-actions
action.playbook
Github Action for running Ansible Playbooks.
Stars: ✭ 26 (+18.18%)
Mutual labels:  actions, github-actions
ghaction-virustotal
GitHub Action to upload and scan files with VirusTotal
Stars: ✭ 105 (+377.27%)
Mutual labels:  actions, github-actions
actions
A set of GitHub actions for Infracost. See cloud cost estimates for Terraform in pull requests. 💰📉 Love your cloud bill!
Stars: ✭ 147 (+568.18%)
Mutual labels:  actions, github-actions
action-homebrew-bump-formula
⚙️ A GitHub Action to easily bump Homebrew formula on new release
Stars: ✭ 68 (+209.09%)
Mutual labels:  actions, github-actions
clang-format-action
GitHub Action for clang-format checking
Stars: ✭ 48 (+118.18%)
Mutual labels:  actions, github-actions
public-ip
Queries GitHub actions runner's public IP address
Stars: ✭ 64 (+190.91%)
Mutual labels:  actions, github-actions
csharp-docs-generator
An action that generates html documentation for C# programs to use for GitHub pages.
Stars: ✭ 21 (-4.55%)
Mutual labels:  actions, github-actions

DevOps pipeline for Real Time Social/Web Mining

Workflow

Workflow

  • Setting up Apache Maven for Java project - User Interface and MapReduce functions

  • Setting up GitHub repository workflow

  • Setting up GitHub Actions for automation

  • Creating a web crawler in Python using Tweepy library to fetch data based on some parameter.

  • Create a HDFS cluster for MapReduce functionality and program Hadoop MapReduce in Java

  • Setup Hadoop Core and create Job Tracker and Task Trackers for the project

  • Implement MapReduce in HDFS using Java to count the frequency of significant words in Data dictionary, in Twitter string

  • Configure Apache Maven with MapReduce codes and install Apache Hadoop Jar dependency

  • Configure MapReduce code in GitHub Actions for automation

  • Automate the Big Data pipeline till MapReduce using GitHub Actions

  • WAP in Java to implement MapReduce from JSON file extracted from crawler to find the frequency of significant words - Textual Analysis

  • Data Classification - create a multi-class data dictionary for sentimental analysis - currently for words (in future, we might extend it for phrases and sentences for improved accuracy)

  • Data Predicition - Using the KNN algorithm in Python to find the relation between tweets and their sentiments.

  • Data Visualization - Using the Python matplotlib library to implement visualization.

Important Source files and dependencies

  1. pom.xml - Setup Apache Maven

  2. helloworld.java - Basic Java project setup

  3. maven.yml - setup GitHub Actions

  4. crawler.py - Web Crawler in Python to extract twitter data based on specific hashtags.

  5. info.csv - data file created as an output for the crawler and to be sent to the HDFS core for processing

  6. MapReduce functionalities in Java

  1. Sentimental Analysis in Python
  • Convolutional Neural Networks
  • Decision Tree
  • SVM
  • Pre-Processing
  • Random Forests
  • Naive Bayes
  • XGBoost
  1. matplotlib.py - Data Visualization using matplotlib in python

  2. Hadoop Setup

How to Contribute

It is an open source project. Open for everyone.

Follow these contribution guidelines.

License

MIT License, copyrighted to Storms In Brewing (2019-2020)

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].