RbbjsonFlexible JSON traversal for rapid prototyping.
Stars: ✭ 155 (+1.97%)
Wukong AgentWeb scan foundation framework
Stars: ✭ 153 (+0.66%)
TestovoeHome assignments for data science positions
Stars: ✭ 149 (-1.97%)
StumpySTUMPY is a powerful and scalable Python library for modern time series analysis
Stars: ✭ 2,019 (+1228.29%)
AzuredatalakeSamples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-15.79%)
Airflow PipelineAn Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-15.79%)
AlgocodeWelcome everyone!🌟 Here you can solve problems, build scrappers and much more💻
Stars: ✭ 113 (-25.66%)
HermioneML made simple
Stars: ✭ 135 (-11.18%)
PyexpoolPython Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity, CPU cores and caching), load balancing and profiling capabilities of the external apps on NUMA architecture
Stars: ✭ 149 (-1.97%)
Spark AuthorizerA Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-7.24%)
Spring Boot Quick🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、spring-batch、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌
Stars: ✭ 1,819 (+1096.71%)
GenieDistributed Big Data Orchestration Service
Stars: ✭ 1,544 (+915.79%)
PlasmaPlasma Programming Language
Stars: ✭ 133 (-12.5%)
Blockchain2graphBlockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (-11.84%)
Torchbear🔥🐻 The Speakeasy Scripting Engine Which Combines Speed, Safety, and Simplicity
Stars: ✭ 128 (-15.79%)
Lambda ArchApplying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-26.97%)
Uncertainty MetricsAn easy-to-use interface for measuring uncertainty and robustness.
Stars: ✭ 145 (-4.61%)
Datasciencera curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+1036.18%)
Doddle Model🍰 doddle-model: machine learning in Scala.
Stars: ✭ 142 (-6.58%)
OpenubaA robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-16.45%)
EmbbEmbedded Multicore Building Blocks (EMB²): Library for parallel programming of embedded systems. Star us on GitHub? +1
Stars: ✭ 153 (+0.66%)
Parquet IndexSpark SQL index for Parquet tables
Stars: ✭ 109 (-28.29%)
HydrographA visual ETL development and debugging tool for big data
Stars: ✭ 144 (-5.26%)
Automl alexState-of-the art Automated Machine Learning python library for Tabular Data
Stars: ✭ 132 (-13.16%)
LiftThe LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.
Stars: ✭ 127 (-16.45%)
HnswlibJava library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-28.95%)
Go Tsnet-Distributed Stochastic Neighbor Embedding (t-SNE) in Go
Stars: ✭ 153 (+0.66%)
Machine Learning🌎 machine learning tutorials (mainly in Python3)
Stars: ✭ 1,924 (+1165.79%)
RasterframesGeospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-6.58%)
Dtale DesktopBuild a data visualization dashboard with simple snippets of python code
Stars: ✭ 128 (-15.79%)
Seq2seq tutorialCode For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (-13.16%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+6793.42%)
Gcp Data Engineer ExamStudy materials for the Google Cloud Professional Data Engineering Exam
Stars: ✭ 144 (-5.26%)
Flink Learningflink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+7385.53%)
MetaprobAn embedded language for probabilistic programming and meta-programming.
Stars: ✭ 155 (+1.97%)
LogigskA Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-29.61%)
Nd4jFast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+1046.05%)
DaceDaCe - Data Centric Parallel Programming
Stars: ✭ 106 (-30.26%)
ClustermqR package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
Stars: ✭ 106 (-30.26%)
Cc PysparkProcess Common Crawl data with Python and Spark
Stars: ✭ 147 (-3.29%)
NeuroflowArtificial Neural Networks for Scala
Stars: ✭ 105 (-30.92%)
DizkJava library for distributed zero knowledge proof systems
Stars: ✭ 140 (-7.89%)
RichdemHigh-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-16.45%)
LifelinesSurvival analysis in Python
Stars: ✭ 1,766 (+1061.84%)
Project kojakTraining a Neural Network to Detect Gestures and Control Smart Home Devices with OpenCV in Python
Stars: ✭ 147 (-3.29%)
Local ClusterEasy local cluster creation for Elixir to aid in unit testing
Stars: ✭ 142 (-6.58%)
PandahousePandas interface for Clickhouse database
Stars: ✭ 126 (-17.11%)