SpartaReal Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (-40.14%)
hadoopofficeHadoopOffice - Analyze Office documents using the Hadoop ecosystem (Spark/Flink/Hive)
Stars: ✭ 56 (-93.47%)
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+10.74%)
aaocp一个对用户行为日志进行分析的大数据项目
Stars: ✭ 53 (-93.82%)
py-hdfs-mountMount HDFS with fuse, works with kerberos!
Stars: ✭ 13 (-98.48%)
cloud云计算之hadoop、hive、hue、oozie、sqoop、hbase、zookeeper环境搭建及配置文件
Stars: ✭ 48 (-94.4%)
KyloKylo is a data lake management software platform and framework for enabling scalable enterprise-class data lakes on big data technologies such as Teradata, Apache Spark and/or Hadoop. Kylo is licensed under Apache 2.0. Contributed by Teradata Inc.
Stars: ✭ 916 (+6.88%)
iOS-Interview📚 Comprehensive list of questions and problems to pass an interview for the iOS Developer position
Stars: ✭ 127 (-85.18%)
Coding-Interview-ChallengesThis is a repo where I upload code for important interview questions written in Python, C++, and Swift
Stars: ✭ 13 (-98.48%)
litemall-dw基于开源Litemall电商项目的大数据项目,包含前端埋点(openresty+lua)、后端埋点;数据仓库(五层)、实时计算和用户画像。大数据平台采用CDH6.3.2(已使用vagrant+ansible脚本化),同时也包含了Azkaban的workflow。
Stars: ✭ 36 (-95.8%)
Coder求职信息 组队刷题 经验交流
Stars: ✭ 22 (-97.43%)
np-flinkflink详细学习实践
Stars: ✭ 26 (-96.97%)
React-Interview-QuestionsDuring the last three years had a lot of react interview questions so i decided to collect them all in one place to help other have an idea of most asked react questions , so if you have any more questions feel free to make a pull request and add your question along with its answer .
Stars: ✭ 37 (-95.68%)
spark-utillow-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-98.13%)
Cv🙈Front End Engineer Curriculum Vitae -《切图仔面试宝典》 急需招人,简历请投 [email protected],谢谢
Stars: ✭ 772 (-9.92%)
Algo Basic专注于分享算法,计算机基础(包括计算机网络,操作系统,MySQL等),无论是应付面试,还是提升自己地内功,这里都能帮到你
Stars: ✭ 768 (-10.39%)
knitDeprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead
Stars: ✭ 53 (-93.82%)
cmuxA set of commands for managing CDH clusters using Cloudera Manager REST API.
Stars: ✭ 34 (-96.03%)
dev-recruitment👨🏼💻 Test your developer skills. Questions and answers at various levels (from junior developer up to senior developer).
Stars: ✭ 19 (-97.78%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (-87.05%)
Awesome Ios Interview📲 The curated list of iOS Developer interview questions and answers, Swift & Objective-C
Stars: ✭ 753 (-12.14%)
Stream ReactorStreaming reference architecture for ETL with Kafka and Kafka-Connect. You can find more on http://lenses.io on how we provide a unified solution to manage your connectors, most advanced SQL engine for Kafka and Kafka Streams, cluster monitoring and alerting, and more.
Stars: ✭ 753 (-12.14%)
basinBasin is a visual programming editor for building Spark and PySpark pipelines. Easily build, debug, and deploy complex ETL pipelines from your browser
Stars: ✭ 25 (-97.08%)
Coding Now学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
Stars: ✭ 750 (-12.49%)
interview-tipsA collection of awesome Interview Tips and Questions
Stars: ✭ 29 (-96.62%)
hadoop-docker-liteDocker build project to setup a lightweight hadoop cluster containing hadoop, pig, zookeeper, hbase, phoenix, storm, kafka, kafka manager
Stars: ✭ 24 (-97.2%)
interview-leetcode【📚 技术面试高频算法+真实面试各类问答+学习指南】助力快速复习找到工作,涵盖大部分程序员所需要掌握的核心知识。
Stars: ✭ 161 (-81.21%)
interview-process-survival🌈 🦄 this repository is a interview process guide for developers (web/frontend focused)
Stars: ✭ 191 (-77.71%)
Spark Movie LensAn on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (-13.07%)
Mega Interview GuideThe MEGA interview guide, JavaSciript, Front End, Comp Sci
Stars: ✭ 255 (-70.25%)
Commondevknowledge🔥 🌟⭐⭐⭐ ⭐ 史上最全的BAT大厂Android面试题汇集,以及常用的Android开发的一些技能点,冷门知识点汇总,开发中遇到的坑汇总等干货。
Stars: ✭ 2,831 (+230.34%)
bigkubeMinikube for big data with Scala and Spark
Stars: ✭ 16 (-98.13%)
Big Data Rosetta CodeCode snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
Stars: ✭ 254 (-70.36%)
Front End InterviewA list of interview for front-end developer(前端开发者面试清单)
Stars: ✭ 2,754 (+221.35%)
Docker Spark ClusterA simple spark standalone cluster for your testing environment purposses
Stars: ✭ 261 (-69.54%)
CodinginterviewsThis repository contains coding interviews that I have encountered in company interviews
Stars: ✭ 2,881 (+236.17%)
Hbase RddSpark RDD to read, write and delete from HBase
Stars: ✭ 277 (-67.68%)
Kafka Storm StarterCode examples that show to integrate Apache Kafka 0.8+ with Apache Storm 0.9+ and Apache Spark Streaming 1.1+, while using Apache Avro as the data serialization format.
Stars: ✭ 728 (-15.05%)
BehemothBehemoth is an open source platform for large scale document analysis based on Apache Hadoop.
Stars: ✭ 286 (-66.63%)
Spark Hbase ConnectorConnect Spark to HBase for reading and writing data with ease
Stars: ✭ 299 (-65.11%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-64.64%)
Fe Interview🔥🔥🔥 前端面试,独有前端面试题详解,前端面试刷题必备,1000+前端面试真题,Html、Css、JavaScript、Vue、React、Node、TypeScript、Webpack、算法、网络与安全、浏览器
Stars: ✭ 4,435 (+417.5%)
CloudflowCloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Stars: ✭ 278 (-67.56%)
ElasticlusterCreate clusters of VMs on the cloud and configure them with Ansible.
Stars: ✭ 298 (-65.23%)