All Projects → jacksu → Utils4s

jacksu / Utils4s

scala、spark使用过程中,各种测试用例以及相关资料整理

Programming Languages

scala
5932 projects

Projects that are alternatives of or similar to Utils4s

cassandra.realtime
Different ways to process data into Cassandra in realtime with technologies such as Kafka, Spark, Akka, Flink
Stars: ✭ 25 (-97.66%)
Mutual labels:  akka, spark-streaming
Play Spark Scala
Stars: ✭ 51 (-95.23%)
Mutual labels:  spark, akka
Cloudflow
Cloudflow enables users to quickly develop, orchestrate, and operate distributed streaming applications on Kubernetes.
Stars: ✭ 278 (-74.02%)
Mutual labels:  spark, akka
Data Accelerator
Data Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-76.92%)
Mutual labels:  spark, spark-streaming
Mobius
C# and F# language binding and extensions to Apache Spark
Stars: ✭ 929 (-13.18%)
Mutual labels:  spark, spark-streaming
Every Single Day I Tldr
A daily digest of the articles or videos I've found interesting, that I want to share with you.
Stars: ✭ 249 (-76.73%)
Mutual labels:  spark, akka
Learningspark
Scala examples for learning to use Spark
Stars: ✭ 421 (-60.65%)
Mutual labels:  spark, spark-streaming
Pyspark Learning
Updated repository
Stars: ✭ 147 (-86.26%)
Mutual labels:  spark, spark-streaming
Angel
A Flexible and Powerful Parameter Server for large-scale machine learning
Stars: ✭ 6,458 (+503.55%)
Mutual labels:  spark, spark-streaming
Sparta
Real Time Analytics and Data Pipelines based on Spark Streaming
Stars: ✭ 513 (-52.06%)
Mutual labels:  spark, spark-streaming
Gimel
Big Data Processing Framework - Unified Data API or SQL on Any Storage
Stars: ✭ 216 (-79.81%)
Mutual labels:  spark, spark-streaming
Real Time Stream Processing Engine
This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.
Stars: ✭ 37 (-96.54%)
Mutual labels:  spark, spark-streaming
Example Spark
Spark, Spark Streaming and Spark SQL unit testing strategies
Stars: ✭ 205 (-80.84%)
Mutual labels:  spark, spark-streaming
wasp
WASP is a framework to build complex real time big data applications. It relies on a kind of Kappa/Lambda architecture mainly leveraging Kafka and Spark. If you need to ingest huge amount of heterogeneous data and analyze them through complex pipelines, this is the framework for you.
Stars: ✭ 19 (-98.22%)
Mutual labels:  akka, spark-streaming
Spark Streaming With Kafka
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
Stars: ✭ 180 (-83.18%)
Mutual labels:  spark, spark-streaming
Coolplayspark
酷玩 Spark: Spark 源代码解析、Spark 类库等
Stars: ✭ 3,318 (+210.09%)
Mutual labels:  spark, spark-streaming
Spark
.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+60.84%)
Mutual labels:  spark, spark-streaming
Azure Event Hubs Spark
Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs
Stars: ✭ 140 (-86.92%)
Mutual labels:  spark, spark-streaming
Cdap
An open source framework for building data analytic applications.
Stars: ✭ 509 (-52.43%)
Mutual labels:  spark, spark-streaming
Learning Spark
零基础学习spark,大数据学习
Stars: ✭ 37 (-96.54%)
Mutual labels:  spark, spark-streaming

utils4s

公众号: 公众号

Build StatusJoin the chat at https://gitter.im/jacksu/utils4s

Issues 中包含我们平时阅读的关于scala、spark好的文章,欢迎推荐

utils4s包含各种scala通用、好玩的工具库demo和使用文档,通过简单的代码演示和操作文档,各种库信手拈来。

同时欢迎大家贡献各种好玩的、经常使用的工具库。

开源中国地址

QQ交流群 432290475(已满),请加530066027 Scala Spark 或者点击上面gitter图标也可以参与讨论

作者博客专注大数据、分布式系统、机器学习,欢迎交流

微博:jacksu_

scala语法学习

说明:scala语法学习过程中,用例代码都放在scala-demo模块下。

利用IntelliJ IDEA与Maven开始你的Scala之旅

快学scala电子书(推荐入门级书)

scala理解的比较深

scala99问题

scala初学者指南(这可不是初学者可以理解的欧,还是写过一些程序后再看)

scala初学者指南英文版

scala学习用例

scala入门笔记

Databricks风格

scala/java 通过maven编译(Mixed Java/Scala Projects)

common库

日志操作log4s

单元测试scalatest

日期操作lama)(注:只支持日期操作,不支持时间操作)

日期时间操作nscala-time)(注:没有每月多少天,每月最后一天,以及每年多少天)

json解析json4s

resources下文件加载用例

文件操作better-files

单位换算squants

线性代数和向量计算(breeze)

分布式并行实现库akka(akka)

Twitter工具库twitter util

日常脚本工具

BigData库

Spark

Spark core

[spark远程调试源代码](http://hadoop1989.com/2016/02/01/Spark-Remote-Debug/)

spark介绍

一个不错的spark学习互动课程

spark 设计与实现

aliyun-spark-deploy-tool---Spark on ECS

Spark Streaming

Spark Streaming使用Kafka保证数据零丢失

spark streaming测试用例

spark streaming源码解析

基于spark streaming的聚合分析(Sparkta)

Spark SQL

spark DataFrame测试用例

Hive Json加载

SparkSQL架构设计和代码分析

Spark 机器学习

spark机器学习源码解析

KeyStoneML KeystoneML is a software framework, written in Scala, from the UC Berkeley AMPLab designed to simplify the construction of large scale, end-to-end, machine learning pipelines with Apache Spark.

spark TS

Spark zeppelin

Z-Manager--Simplify getting Zeppelin up and running

zeppelin--a web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interactive and collaborative documents with SQL, Scala and more.

helium--Brings Zeppelin to data analytics application platform

Spark 其它

spark专题在简书

databricks spark知识库

spark学习知识总结

Spark library for doing exploratory data analysis in a scalable way

图处理(cassovary)

基于spark进行地理位置分析(gagellan)

spark summit east 2016 ppt

ES

ES 非阻塞scala客户端

Beam

[Apache Beam:下一代的数据处理标准](http://geek.csdn.net/news/detail/134167)

贡献代码步骤

1. 首先 fork 我的项目 2. 把 fork 过去的项目也就是你的项目 clone 到你的本地 3. 运行 git remote add jacksu [email protected]:jacksu/utils4s.git 把我的库添加为远端库 4. 运行 git pull jacksu master 拉取并合并到本地 5. coding 6. commit后push到自己的库( git push origin master ) 7. 登陆Github在你首页可以看到一个 pull request 按钮,点击它,填写一些说明信息,然后提交即可。 1~3是初始化操作,执行一次即可。在coding前必须执行第4步同步我的库(这样避免冲突),然后执行5~7既可。

贡献者

[jjcipher](https://github.com/jjcipher)
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].