All Projects → oldratlee → Big Data Study

oldratlee / Big Data Study

🐳 big data study

Projects that are alternatives of or similar to Big Data Study

Cortx
CORTX Community Object Storage is 100% open source object storage uniquely optimized for mass capacity storage devices.
Stars: ✭ 426 (+202.13%)
Mutual labels:  big-data, bigdata
Countly Sdk Cordova
Countly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-51.06%)
Mutual labels:  big-data, bigdata
Circosjs
d3 library to build circular graphs
Stars: ✭ 436 (+209.22%)
Mutual labels:  big-data, bigdata
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-90.78%)
Mutual labels:  big-data, bigdata
Bigdata Notes
大数据入门指南 ⭐
Stars: ✭ 10,991 (+7695.04%)
Mutual labels:  big-data, bigdata
NiFi-Rule-engine-processor
Drools processor for Apache NiFi
Stars: ✭ 34 (-75.89%)
Mutual labels:  big-data, bigdata
Hadoop For Geoevent
ArcGIS GeoEvent Server sample Hadoop connector for storing GeoEvents in HDFS.
Stars: ✭ 5 (-96.45%)
Mutual labels:  big-data, bigdata
meetups-archivos
Ppts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (-57.45%)
Mutual labels:  big-data, bigdata
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+848.94%)
Mutual labels:  big-data, bigdata
Uproot4
ROOT I/O in pure Python and NumPy.
Stars: ✭ 80 (-43.26%)
Mutual labels:  big-data, bigdata
big data
A collection of tutorials on Hadoop, MapReduce, Spark, Docker
Stars: ✭ 34 (-75.89%)
Mutual labels:  big-data, bigdata
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-22.7%)
Mutual labels:  big-data, bigdata
v6.dooring.public
可视化大屏解决方案, 提供一套可视化编辑引擎, 助力个人或企业轻松定制自己的可视化大屏应用.
Stars: ✭ 323 (+129.08%)
Mutual labels:  big-data, bigdata
Uproot3
ROOT I/O in pure Python and NumPy.
Stars: ✭ 312 (+121.28%)
Mutual labels:  big-data, bigdata
SparkProgrammingInScala
Apache Spark Course Material
Stars: ✭ 57 (-59.57%)
Mutual labels:  big-data, bigdata
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (+428.37%)
Mutual labels:  big-data, bigdata
awesome-coder-resources
编程路上加油站!------【持续更新中...欢迎star,欢迎常回来看看......】【内容:编程/学习/阅读资源,开源项目,面试题,网站,书,博客,教程等等】
Stars: ✭ 54 (-61.7%)
Mutual labels:  big-data, bigdata
gan deeplearning4j
Automatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-86.52%)
Mutual labels:  big-data, bigdata
Big Data Engineering Coursera Yandex
Big Data for Data Engineers Coursera Specialization from Yandex
Stars: ✭ 71 (-49.65%)
Mutual labels:  big-data, bigdata
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-24.11%)
Mutual labels:  big-data, bigdata

大数据学习

大数据学习的资料整理。

经典文章

对大数据给出 整体认识(架构/场景/方案说明) 或是 重点说明(关键组件及其特点) 。

  1. 100 open source Big Data architecture papers for data professionals
    # 中文译文:PayPal高级工程总监:读完这100篇论文就能成大数据高手
  2. The Log: What every software engineer should know about real-time data's unifying abstraction
    # 中文译文:日志:每个软件工程师都应该知道的有关实时数据的统一抽象
    来自LinkedInKreps发表的一篇博文,虽然很长,但是被称为程序员 史诗般必读 文章。 日志原本应该是运维人员掌握的,如今也是研发人员必须关心的,这是符合DevOps原则。
  3. Google公开的大数据领域论文
    1. Big Data beyond MapReduce: Google's Big Data papers
      # 中文译文:那些年Google公开的大数据领域论文
    2. More Google Big Data papers: Megastore and Spanner

已有的资料汇编

  1. 分布式系统(Distributed System)资料 by @ty4z2008
  2. 大数据应用与技术 - 入门资源汇编 by @memect
  3. 详细的领域列表 - Awesome Big Data Awesome
  4. The Hadoop Ecosystem Table

书籍

个人整理的大数据书籍豆列

讨论 & 科普

典型技术

Data Scientist

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].