All Projects → blend → social-modeling-starter

blend / social-modeling-starter

Licence: other
Facebook social data modeling with Scala, HBase, and HPaste

Programming Languages

scala
5932 projects
shell
77523 projects

Projects that are alternatives of or similar to social-modeling-starter

wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-26.92%)
Mutual labels:  hbase
cmux
A set of commands for managing CDH clusters using Cloudera Manager REST API.
Stars: ✭ 34 (+30.77%)
Mutual labels:  hbase
np-flink
flink详细学习实践
Stars: ✭ 26 (+0%)
Mutual labels:  hbase
springBoot-hbase
封装了一个简易的hbase-orm框架
Stars: ✭ 25 (-3.85%)
Mutual labels:  hbase
thrift2-hbase
thrift2-hbase component for Hyperf.
Stars: ✭ 14 (-46.15%)
Mutual labels:  hbase
litemall-dw
基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (+38.46%)
Mutual labels:  hbase
facebook-data-extraction
Experiences in extracting data from Facebook with these 3 methods: Facebook Graph API, Automation tools, DevTools Console
Stars: ✭ 81 (+211.54%)
Mutual labels:  facebook-data
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-46.15%)
Mutual labels:  hbase
talos
No description or website provided.
Stars: ✭ 37 (+42.31%)
Mutual labels:  hbase
yuzhouwan
Code Library for My Blog
Stars: ✭ 39 (+50%)
Mutual labels:  hbase
hbase-packet-inspector
Analyzes network traffic of HBase RegionServers
Stars: ✭ 35 (+34.62%)
Mutual labels:  hbase
BigDataTools
tools for bigData
Stars: ✭ 36 (+38.46%)
Mutual labels:  hbase
swordfish
Open-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (+34.62%)
Mutual labels:  hbase
darwin
Avro Schema Evolution made easy
Stars: ✭ 26 (+0%)
Mutual labels:  hbase
BigData-News
基于Spark2.2新闻网大数据实时系统项目
Stars: ✭ 36 (+38.46%)
Mutual labels:  hbase
cbass
adding "simple" to HBase
Stars: ✭ 25 (-3.85%)
Mutual labels:  hbase
cloud
云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (+84.62%)
Mutual labels:  hbase
pull facebook data for good
[DEPRECATED] Imitate an API for downloading data from Facebook Data For Good
Stars: ✭ 12 (-53.85%)
Mutual labels:  facebook-data
lib
Perl Utility Library for my other repos
Stars: ✭ 16 (-38.46%)
Mutual labels:  hbase
ansible-cloudera-hadoop
ansible playbook to deploy cloudera hadoop components to the cluster
Stars: ✭ 51 (+96.15%)
Mutual labels:  hbase

social-modeling-starter

Facebook social data modeling with Scala, HBase, and HPaste.

Includes accessing the Facebook Graph API in Scala via Dispatch and Lift JSON, and using HPaste to access HBase and define MapReduce jobs.

Slides at http://blendlabsinc.com/blend-labs-tech-talk-stanford-may-2012.pdf.

Setup

Visit https://developers.facebook.com/tools/explorer?method=GET&path=me . Click "Get Access Token". Select the following permissions:

  • User Data Permissions
    • user_groups
    • user_likes
  • Friends Data Permissions
    • friends_groups
    • friends_likes

Click "Get Access Token", then "Allow". Copy and paste the Access Token. In your console, run:

export FACEBOOK_ACCESS_TOKEN=<your access token>

Download Cloudera's HBase distribution (CDH4 Beta 2) at http://archive.cloudera.com/cdh4/cdh/4/hbase-0.92.1-cdh4.0.0b2.tar.gz.

Unarchive the file and run

bin/start-hbase.sh
sleep 5
bin/hbase shell
hbase> create 'person', 'info', 'like'

To load your Facebook data, run

./bin/load.sh

Two MapReduce calculations are included: "friend likes" and "top likes". To execute these, run

./bin/topLikes.sh
./bin/friendLikes.sh
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].