All Projects → Geni → Similar Projects or Alternatives

2024 Open source projects that are alternatives of or similar to Geni

Rbbjson
Flexible JSON traversal for rapid prototyping.
Stars: ✭ 155 (+1.97%)
Mutual labels:  data-science
Wukong Agent
Web scan foundation framework
Stars: ✭ 153 (+0.66%)
Mutual labels:  distributed-computing
Testovoe
Home assignments for data science positions
Stars: ✭ 149 (-1.97%)
Mutual labels:  data-science
Stumpy
STUMPY is a powerful and scalable Python library for modern time series analysis
Stars: ✭ 2,019 (+1228.29%)
Mutual labels:  data-science
Azuredatalake
Samples and Docs for Azure Data Lake Store and Analytics
Stars: ✭ 128 (-15.79%)
Mutual labels:  big-data
Storm Doc Zh
Apache Storm 官方文档中文版
Stars: ✭ 142 (-6.58%)
Mutual labels:  big-data
Airflow Pipeline
An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR
Stars: ✭ 128 (-15.79%)
Mutual labels:  spark
Algocode
Welcome everyone!🌟 Here you can solve problems, build scrappers and much more💻
Stars: ✭ 113 (-25.66%)
Mutual labels:  data-science
Hermione
ML made simple
Stars: ✭ 135 (-11.18%)
Mutual labels:  data-science
Jupyter
Stars: ✭ 145 (-4.61%)
Mutual labels:  data-science
Pyexpool
Python Multi-Process Execution Pool: concurrent asynchronous execution pool with custom resource constraints (memory, timeouts, affinity, CPU cores and caching), load balancing and profiling capabilities of the external apps on NUMA architecture
Stars: ✭ 149 (-1.97%)
Mutual labels:  parallel-computing
Spark Authorizer
A Spark SQL extension which provides SQL Standard Authorization for Apache Spark
Stars: ✭ 141 (-7.24%)
Mutual labels:  spark
Spring Boot Quick
🌿 基于springboot的快速学习示例,整合自己遇到的开源框架,如:rabbitmq(延迟队列)、Kafka、jpa、redies、oauth2、swagger、jsp、docker、spring-batch、异常处理、日志输出、多模块开发、多环境打包、缓存cache、爬虫、jwt、GraphQL、dubbo、zookeeper和Async等等📌
Stars: ✭ 1,819 (+1096.71%)
Mutual labels:  spark
Genie
Distributed Big Data Orchestration Service
Stars: ✭ 1,544 (+915.79%)
Mutual labels:  big-data
Plasma
Plasma Programming Language
Stars: ✭ 133 (-12.5%)
Mutual labels:  parallel-computing
Blockchain2graph
Blockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (-11.84%)
Mutual labels:  data-science
Torchbear
🔥🐻 The Speakeasy Scripting Engine Which Combines Speed, Safety, and Simplicity
Stars: ✭ 128 (-15.79%)
Mutual labels:  data-science
Lambda Arch
Applying Lambda Architecture with Spark, Kafka, and Cassandra.
Stars: ✭ 111 (-26.97%)
Mutual labels:  spark
Uncertainty Metrics
An easy-to-use interface for measuring uncertainty and robustness.
Stars: ✭ 145 (-4.61%)
Mutual labels:  data-science
Datasciencer
a curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+1036.18%)
Mutual labels:  data-science
Doddle Model
🍰 doddle-model: machine learning in Scala.
Stars: ✭ 142 (-6.58%)
Mutual labels:  data-science
Openuba
A robust, and flexible open source User & Entity Behavior Analytics (UEBA) framework used for Security Analytics. Developed with luv by Data Scientists & Security Analysts from the Cyber Security Industry. [PRE-ALPHA]
Stars: ✭ 127 (-16.45%)
Mutual labels:  spark
Embb
Embedded Multicore Building Blocks (EMB²): Library for parallel programming of embedded systems. Star us on GitHub? +1
Stars: ✭ 153 (+0.66%)
Mutual labels:  parallel-computing
Parquet Index
Spark SQL index for Parquet tables
Stars: ✭ 109 (-28.29%)
Mutual labels:  spark
Aliyun Emapreduce Datasources
Extended datasource support for Spark/Hadoop on Aliyun E-MapReduce.
Stars: ✭ 132 (-13.16%)
Mutual labels:  spark
Hass Data Detective
Explore and analyse your Home Assistant data
Stars: ✭ 109 (-28.29%)
Mutual labels:  data-science
Hydrograph
A visual ETL development and debugging tool for big data
Stars: ✭ 144 (-5.26%)
Mutual labels:  big-data
Automl alex
State-of-the art Automated Machine Learning python library for Tabular Data
Stars: ✭ 132 (-13.16%)
Mutual labels:  data-science
Lift
The LinkedIn Fairness Toolkit (LiFT) is a Scala/Spark library that enables the measurement of fairness in large scale machine learning workflows.
Stars: ✭ 127 (-16.45%)
Mutual labels:  spark
Hnswlib
Java library for approximate nearest neighbors search using Hierarchical Navigable Small World graphs
Stars: ✭ 108 (-28.95%)
Mutual labels:  spark
The Python Workshop
A New, Interactive Approach to Learning Python
Stars: ✭ 150 (-1.32%)
Mutual labels:  data-science
Go Tsne
t-Distributed Stochastic Neighbor Embedding (t-SNE) in Go
Stars: ✭ 153 (+0.66%)
Mutual labels:  data-science
Machine Learning
🌎 machine learning tutorials (mainly in Python3)
Stars: ✭ 1,924 (+1165.79%)
Mutual labels:  data-science
Rasterframes
Geospatial Raster support for Spark DataFrames
Stars: ✭ 142 (-6.58%)
Mutual labels:  spark
Dtale Desktop
Build a data visualization dashboard with simple snippets of python code
Stars: ✭ 128 (-15.79%)
Mutual labels:  data-science
Seq2seq tutorial
Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (-13.16%)
Mutual labels:  data-science
Awesome Bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+6793.42%)
Mutual labels:  data-science
Gcp Data Engineer Exam
Study materials for the Google Cloud Professional Data Engineering Exam
Stars: ✭ 144 (-5.26%)
Mutual labels:  data-engineering
Flink Learning
flink learning blog. http://www.54tianzhisheng.cn/ 含 Flink 入门、概念、原理、实战、性能调优、源码解析等内容。涉及 Flink Connector、Metrics、Library、DataStream API、Table API & SQL 等内容的学习案例,还有 Flink 落地应用的大型项目案例(PVUV、日志存储、百亿数据实时去重、监控告警)分享。欢迎大家支持我的专栏《大数据实时计算引擎 Flink 实战与性能优化》
Stars: ✭ 11,378 (+7385.53%)
Mutual labels:  spark
Awesome Datascience Colleges
A list of colleges and universities offering degrees in data science.
Stars: ✭ 131 (-13.82%)
Mutual labels:  data-science
Metaprob
An embedded language for probabilistic programming and meta-programming.
Stars: ✭ 155 (+1.97%)
Mutual labels:  data-science
Iot Traffic Monitor
Stars: ✭ 131 (-13.82%)
Mutual labels:  spark
Logigsk
A Linux based software package to control led's on Logitech G910, G810, G610 and G410.
Stars: ✭ 107 (-29.61%)
Mutual labels:  spark
Nd4j
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
Stars: ✭ 1,742 (+1046.05%)
Mutual labels:  spark
Dace
DaCe - Data Centric Parallel Programming
Stars: ✭ 106 (-30.26%)
Open Source Handbook
⭐️ Open source projects for all skill levels
Stars: ✭ 131 (-13.82%)
Mutual labels:  big-data
Clustermq
R package to send function calls as jobs on LSF, SGE, Slurm, PBS/Torque, or each via SSH
Stars: ✭ 106 (-30.26%)
Cc Pyspark
Process Common Crawl data with Python and Spark
Stars: ✭ 147 (-3.29%)
Mutual labels:  spark
Ds Ai Tech Notes
📖 [译] 数据科学和人工智能技术笔记
Stars: ✭ 131 (-13.82%)
Mutual labels:  data-science
Neuroflow
Artificial Neural Networks for Scala
Stars: ✭ 105 (-30.92%)
Mutual labels:  data-science
Machine learning for good
Machine learning fundamentals lesson in interactive notebooks
Stars: ✭ 142 (-6.58%)
Mutual labels:  data-science
Dizk
Java library for distributed zero knowledge proof systems
Stars: ✭ 140 (-7.89%)
Mutual labels:  distributed-computing
Richdem
High-performance Terrain and Hydrology Analysis
Stars: ✭ 127 (-16.45%)
Mutual labels:  big-data
Lifelines
Survival analysis in Python
Stars: ✭ 1,766 (+1061.84%)
Mutual labels:  data-science
Project kojak
Training a Neural Network to Detect Gestures and Control Smart Home Devices with OpenCV in Python
Stars: ✭ 147 (-3.29%)
Mutual labels:  data-science
Local Cluster
Easy local cluster creation for Elixir to aid in unit testing
Stars: ✭ 142 (-6.58%)
Mutual labels:  distributed-computing
Pandahouse
Pandas interface for Clickhouse database
Stars: ✭ 126 (-17.11%)
Mutual labels:  dataframe
Awesome Scientific Python
A curated list of awesome scientific Python resources
Stars: ✭ 127 (-16.45%)
Mutual labels:  data-science
Big Data Study
🐳 big data study
Stars: ✭ 141 (-7.24%)
Mutual labels:  big-data
Data Science For Marketing Analytics
Achieve your marketing goals with the data analytics power of Python
Stars: ✭ 127 (-16.45%)
Mutual labels:  data-science
241-300 of 2024 similar projects