All Projects → Javaorbigdata Interview → Similar Projects or Alternatives

1094 Open source projects that are alternatives of or similar to Javaorbigdata Interview

A robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]

Stars: ✭ 127 (-37.44%)

Mutual labels: spark

Tf Yarn

Train TensorFlow models on YARN in just a few lines of code!

Stars: ✭ 76 (-62.56%)

Mutual labels: hadoop

Bigdataclass

Two-day workshop that covers how to use R to interact databases and Spark

Stars: ✭ 110 (-45.81%)

Mutual labels: spark

Storm Doc Zh

Apache Storm 官方文档中文版

Stars: ✭ 142 (-30.05%)

Mutual labels: storm

Job Model

蚂蚁金服 - 国际事业群 - 前端招聘

Stars: ✭ 110 (-45.81%)

Mutual labels: interview

Lift

The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.

Stars: ✭ 127 (-37.44%)

Mutual labels: spark

Ds Cheatsheets

List of Data Science Cheatsheets to rule the world

Stars: ✭ 9,452 (+4556.16%)

Mutual labels: spark

Spark Tsne

Distributed t-SNE via Apache Spark

Stars: ✭ 151 (-25.62%)

Mutual labels: spark

Volcano

A Cloud Native Batch System (Project under CNCF)

Stars: ✭ 2,114 (+941.38%)

Mutual labels: bigdata

Books

技术书籍等

Stars: ✭ 110 (-45.81%)

Mutual labels: bigdata

Spark Practice

Apache Spark (PySpark) Practice on Real Data

Stars: ✭ 200 (-1.48%)

Mutual labels: spark

Lpa Detector

Optimize and improve the Label propagation algorithm

Stars: ✭ 75 (-63.05%)

Mutual labels: spark

Spark Authorizer

A Spark SQL extension which provides SQL Standard Authorization for Apache Spark

Stars: ✭ 141 (-30.54%)

Mutual labels: spark

Technical Interview Megarepo

Study materials for SE/CS technical interviews

Stars: ✭ 1,480 (+629.06%)

Mutual labels: interview

Labs

Research on distributed system

Stars: ✭ 73 (-64.04%)

Mutual labels: spark

Benchm Ml

A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).

Stars: ✭ 1,835 (+803.94%)

Mutual labels: spark

Hive Funnel Udf

Hive UDFs for funnel analysis

Stars: ✭ 72 (-64.53%)

Mutual labels: hadoop

Algorithms

📝 算法导论与JavaScript实现

Stars: ✭ 126 (-37.93%)

Mutual labels: interview

Transmogrifai

TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machine learning workflows on Apache Spark with minimal hand-tuning

Stars: ✭ 2,084 (+926.6%)

Mutual labels: spark

Countly Sdk Cordova

Countly Product Analytics SDK for Cordova, Icenium and Phonegap

Stars: ✭ 69 (-66.01%)

Mutual labels: bigdata

Spark Bigquery Connector

BigQuery data source for Apache Spark: Read data from BigQuery into DataFrames, write DataFrames into BigQuery tables.

Stars: ✭ 126 (-37.93%)

Mutual labels: spark

Atsd

Axibase Time Series Database Documentation

Stars: ✭ 68 (-66.5%)

Mutual labels: hadoop

Aztk

AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure

Stars: ✭ 152 (-25.12%)

Mutual labels: spark

Fast Mrmr

An improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).

Stars: ✭ 67 (-67%)

Mutual labels: spark

Scala Samples

There are pieces of scala code that explain Scala syntax and related things - like what you can do with all this

Stars: ✭ 125 (-38.42%)

Mutual labels: spark

Registry

Schema Registry

Stars: ✭ 184 (-9.36%)

Mutual labels: storm

Flinkstreamsql

基于开源的flink，对其实时sql进行扩展；主要实现了流与维表的join，支持原生flink SQL所有的语法

Stars: ✭ 1,682 (+728.57%)

Mutual labels: bigdata

Rsparkling

RSparkling: Use H2O Sparkling Water from R (Spark + R + Machine Learning)

Stars: ✭ 65 (-67.98%)

Mutual labels: spark

Android Interview Questions

A repository containing interview questions on DS, Java & Android based on my experiences.

Stars: ✭ 125 (-38.42%)

Mutual labels: interview

Php Interview Best Practices In China

📙 PHP 面试知识点汇总

Stars: ✭ 1,133 (+458.13%)

Mutual labels: interview

Athenacli

AthenaCLI is a CLI tool for AWS Athena service that can do auto-completion and syntax highlighting.

Stars: ✭ 151 (-25.62%)

Mutual labels: bigdata

Jumbune

Jumbune, an open source BigData APM & Data Quality Management Platform for Data Clouds. Enterprise feature offering is available at http://jumbune.com. More details of open source offering are at,

Stars: ✭ 64 (-68.47%)

Mutual labels: hadoop

Low Level Design Primer

Dedicated Resources for the Low-Level System Design. Learn how to design and implement large-scale systems. Prep for the system design interview.

Stars: ✭ 2,706 (+1233%)

Mutual labels: interview

Pyspark Twitter Stream Mining

Real-time Machine Learning with Apache Spark on Twitter Public Stream

Stars: ✭ 64 (-68.47%)

Mutual labels: spark

Interviewee Questions

Assorted questions to ask during the interview process

Stars: ✭ 169 (-16.75%)

Mutual labels: interview

Spark Doc Zh

Apache Spark 官方文档中文版

Stars: ✭ 1,126 (+454.68%)

Mutual labels: spark

Androidofferkiller

💪 Help you get a better offer.

Stars: ✭ 1,669 (+722.17%)

Mutual labels: interview

Javascript Interview Questions Developer

Danh sách những câu hỏi trong phỏng vấn Javascript 📝

Stars: ✭ 62 (-69.46%)

Mutual labels: interview

Leetcode In Swift

My solutions to LeetCode problems written in Swift

Stars: ✭ 150 (-26.11%)

Mutual labels: interview

Spark Alchemy

Collection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive

Stars: ✭ 122 (-39.9%)

Mutual labels: spark

Fed Note

我是Mokou， 📘 这里是写前端博客和备忘学习的地方。Vue3 源码解析连载中。喜欢请Star。

Stars: ✭ 180 (-11.33%)

Mutual labels: interview

Whylogs Java

Profile and monitor your ML data pipeline end-to-end

Stars: ✭ 164 (-19.21%)

Mutual labels: spark

Rasterframes

Geospatial Raster support for Spark DataFrames

Stars: ✭ 142 (-30.05%)

Mutual labels: spark

Spark R Notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 109 (-46.31%)

Mutual labels: bigdata

Interviewguide

《大厂面试指北》——包括Java基础、JVM、数据库、mysql、redis、计算机网络、算法、数据结构、操作系统、设计模式、系统设计、框架原理。最佳阅读地址：http://notfound9.github.io/interviewGuide/

Stars: ✭ 3,117 (+1435.47%)

Mutual labels: interview

Zemberek Nlp Server

Zemberek Türkçe NLP Java Kütüphanesi üzerine REST Docker Sunucu

Stars: ✭ 60 (-70.44%)

Mutual labels: spark

Deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

Stars: ✭ 2,020 (+895.07%)

Mutual labels: spark

Pyspark Examples

Code examples on Apache Spark using python

Stars: ✭ 58 (-71.43%)

Mutual labels: spark

Avro

Apache Avro is a data serialization system.

Stars: ✭ 2,005 (+887.68%)

Mutual labels: bigdata

Eat pyspark in 10 days

pyspark🍒🥭 is delicious，just eat it!😋😋

Stars: ✭ 116 (-42.86%)

Mutual labels: spark

Awesome Pulsar

A curated list of Pulsar tools, integrations and resources.

Stars: ✭ 57 (-71.92%)

Mutual labels: spark

Spark Iforest

Isolation Forest on Spark

Stars: ✭ 166 (-18.23%)

Mutual labels: spark

Parquet Index

Spark SQL index for Parquet tables

Stars: ✭ 109 (-46.31%)

Mutual labels: spark

Big Data Study

🐳 big data study

Stars: ✭ 141 (-30.54%)

Mutual labels: bigdata

Daudit

🌲 Configuration flaws detector for Hadoop, MongoDB, MySQL, and more!

Stars: ✭ 108 (-46.8%)

Mutual labels: bigdata

Distributed Dataset

A distributed data processing framework in Haskell.

Stars: ✭ 108 (-46.8%)

Mutual labels: spark

Interviews

A list of fancy questions I've been asked during the interviews I had. Some of them I ask when interviewing people.

Stars: ✭ 140 (-31.03%)

Mutual labels: interview

Haproxy Configs

80+ HAProxy Configs for Hadoop, Big Data, NoSQL, Docker, Elasticsearch, SolrCloud, HBase, MySQL, PostgreSQL, Apache Drill, Hive, Presto, Impala, Hue, ZooKeeper, SSH, RabbitMQ, Redis, Riak, Cloudera, OpenTSDB, InfluxDB, Prometheus, Kibana, Graphite, Rancher etc.

Stars: ✭ 106 (-47.78%)

Mutual labels: hadoop

Pyspark Cheatsheet

🐍 Quick reference guide to common patterns & functions in PySpark.

Stars: ✭ 108 (-46.8%)

Mutual labels: spark

Full Stack Interview

📝 Full Stack Questions for rocking your job interview 👍

Stars: ✭ 141 (-30.54%)

Mutual labels: interview

301-360 of 1094 similar projects