Clustering4EverC4E, a JVM friendly library written in Scala for both local and distributed (Spark) Clustering.
Stars: ✭ 126 (+77.46%)
ZeppelinWeb-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
Stars: ✭ 5,513 (+7664.79%)
pyspark-algorithmsPySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
Stars: ✭ 72 (+1.41%)
Machinelearning ng吴恩达机器学习coursera课程,学习代码(2017年秋) The Stanford Coursera course on MachineLearning with Andrew Ng
Stars: ✭ 181 (+154.93%)
meetups-archivosPpts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …
Stars: ✭ 60 (-15.49%)
gan deeplearning4jAutomatic feature engineering using Generative Adversarial Networks using Deeplearning4j and Apache Spark.
Stars: ✭ 19 (-73.24%)
fastdata-clusterFast Data Cluster (Apache Cassandra, Kafka, Spark, Flink, YARN and HDFS with Vagrant and VirtualBox)
Stars: ✭ 20 (-71.83%)
yuzhouwanCode Library for My Blog
Stars: ✭ 39 (-45.07%)
Andrew Ng NotesThis is Andrew NG Coursera Handwritten Notes.
Stars: ✭ 180 (+153.52%)
SuccinctEnabling queries on compressed data.
Stars: ✭ 257 (+261.97%)
autThe Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
Stars: ✭ 111 (+56.34%)
ZatZeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (+326.76%)
HelkThe Hunting ELK
Stars: ✭ 3,097 (+4261.97%)
Uproot3ROOT I/O in pure Python and NumPy.
Stars: ✭ 312 (+339.44%)
SidekickHigh Performance HTTP Sidecar Load Balancer
Stars: ✭ 366 (+415.49%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (+408.45%)
SparklerSpark-Crawler: Apache Nutch-like crawler that runs on Apache Spark.
Stars: ✭ 362 (+409.86%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+481.69%)
CourseraQuiz & Assignment of Coursera
Stars: ✭ 774 (+990.14%)
Bigdataguide大数据学习,从零开始学习大数据,包含大数据学习各阶段学习视频、面试资料
Stars: ✭ 817 (+1050.7%)
CdapAn open source framework for building data analytic applications.
Stars: ✭ 509 (+616.9%)
BigsliceA serverless cluster computing system for the Go programming language
Stars: ✭ 469 (+560.56%)
JustenoughscalaforsparkA tutorial on the most important features and idioms of Scala that you need to use Spark's Scala APIs.
Stars: ✭ 538 (+657.75%)
Bdp Dataplatform大数据生态解决方案数据平台:基于大数据、数据平台、微服务、机器学习、商城、自动化运维、DevOps、容器部署平台、数据平台采集、数据平台存储、数据平台计算、数据平台开发、数据平台应用搭建的大数据解决方案。
Stars: ✭ 456 (+542.25%)
Coding Now学习记录的一些笔记,以及所看得一些电子书eBooks、视频资源和平常收纳的一些自己认为比较好的博客、网站、工具。涉及大数据几大组件、Python机器学习和数据分析、Linux、操作系统、算法、网络等
Stars: ✭ 750 (+956.34%)
Pyspark Setup DemoDemo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-66.2%)
Awesome Google ColabGoogle Colaboratory Notebooks and Repositories (by @firmai)
Stars: ✭ 863 (+1115.49%)
SkymapHigh-throughput gene to knowledge mapping through massive integration of public sequencing data.
Stars: ✭ 29 (-59.15%)
SparkApache Spark - A unified analytics engine for large-scale data processing
Stars: ✭ 31,618 (+44432.39%)
TedsdsApache Spark - Turbofan Engine Degradation Simulation Data Set example in Apache Spark
Stars: ✭ 14 (-80.28%)
PucketBucketing and partitioning system for Parquet
Stars: ✭ 29 (-59.15%)
Data Algorithms Book MapReduce, Spark, Java, and Scala for Data Algorithms Book
Stars: ✭ 949 (+1236.62%)
SparkmagicJupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (+1243.66%)
Esper TvEsper instance for TV news analysis
Stars: ✭ 37 (-47.89%)
MareMaRe leverages the power of Docker and Spark to run and scale your serial tools in MapReduce fashion.
Stars: ✭ 11 (-84.51%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (+1305.63%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-2.82%)
Pyspark ExamplesCode examples on Apache Spark using python
Stars: ✭ 58 (-18.31%)