MagellanGeo Spatial Data Analytics on Spark
Stars: ✭ 507 (-76.95%)
iisInformation Inference Service of the OpenAIRE system
Stars: ✭ 16 (-99.27%)
Mobydq🐳 Tool to automate data quality checks on data pipelines
Stars: ✭ 123 (-94.41%)
FIW KRTFamilies In the WIld: A Kinship Recogntion Toolbox.
Stars: ✭ 18 (-99.18%)
Stream FrameworkStream Framework is a Python library, which allows you to build news feed, activity streams and notification systems using Cassandra and/or Redis. The authors of Stream-Framework also provide a cloud service for feed technology:
Stars: ✭ 4,576 (+108%)
shiftingA privacy-focused list of alternatives to mainstream services to help the competition.
Stars: ✭ 31 (-98.59%)
HadoopDedup🍉基于Hadoop和HBase的大规模海量数据去重
Stars: ✭ 27 (-98.77%)
RedisliteRedis in a python module.
Stars: ✭ 464 (-78.91%)
Attic PredictionioPredictionIO, a machine learning server for developers and ML engineers.
Stars: ✭ 12,522 (+469.18%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-79.36%)
CookbookThe Data Engineering Cookbook
Stars: ✭ 9,829 (+346.77%)
merkle-dbHigh-scalability analytics database built on immutable merkle-trees
Stars: ✭ 44 (-98%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+902.18%)
metriqlThe metrics layer for your data. Join us at https://metriql.com/slack
Stars: ✭ 227 (-89.68%)
Report自动化配置报表平台。演示地址http://58.87.112.247/report 账号 visitor密码123456
Stars: ✭ 123 (-94.41%)
dislibThe Distributed Computing library for python implemented using PyCOMPSs programming model for HPC.
Stars: ✭ 39 (-98.23%)
Circosjsd3 library to build circular graphs
Stars: ✭ 436 (-80.18%)
BookkeeperApache Bookkeeper
Stars: ✭ 1,178 (-46.45%)
cdp-servicecdp数据平台,帮助企业充分了解客户,实现千人千面的精准营销。
Stars: ✭ 30 (-98.64%)
sgdAn R package for large scale estimation with stochastic gradient descent
Stars: ✭ 55 (-97.5%)
Belajarpython.comOpen Source Indonesian Python Programming Tutorial Site
Stars: ✭ 141 (-93.59%)
ytprivYT metadata exporter
Stars: ✭ 28 (-98.73%)
Opendata.cern.chSource code for the CERN Open Data portal
Stars: ✭ 411 (-81.32%)
scikit-learn-intelexIntel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
Stars: ✭ 887 (-59.68%)
MockneatMockNeat is a Java 8+ library that facilitates the generation of arbitrary data for your applications.
Stars: ✭ 410 (-81.36%)
SigmfThe Signal Metadata Format Specification
Stars: ✭ 120 (-94.55%)
bagriXML/Document DB on top of distributed cache
Stars: ✭ 40 (-98.18%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-96.86%)
IgniteApache Ignite
Stars: ✭ 4,027 (+83.05%)
mascMicrosoft's contributions for Spark with Apache Accumulo
Stars: ✭ 20 (-99.09%)
Spark.jlJulia binding for Apache Spark
Stars: ✭ 153 (-93.05%)
HiveApache Hive
Stars: ✭ 4,031 (+83.23%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+858.59%)
Vue Virtual Scroll List⚡️A vue component support big amount data list with high render performance and efficient.
Stars: ✭ 3,201 (+45.5%)
MetorikkuA simplified, lightweight ETL Framework based on Apache Spark
Stars: ✭ 361 (-83.59%)
Data AcceleratorData Accelerator for Apache Spark simplifies onboarding to Streaming of Big Data. It offers a rich, easy to use experience to help with creation, editing and management of Spark jobs on Azure HDInsights or Databricks while enabling the full power of the Spark engine.
Stars: ✭ 247 (-88.77%)
DrillApache Drill is a distributed MPP query layer for self describing data
Stars: ✭ 1,619 (-26.41%)
SylphStream computing platform for bigdata
Stars: ✭ 362 (-83.55%)
Bigdata PlaygroundA complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Stars: ✭ 177 (-91.95%)
KeyviKeyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and lookup performance.
Stars: ✭ 171 (-92.23%)
FluoApache Fluo
Stars: ✭ 159 (-92.77%)
100daysofmlcodeMy journey to learn and grow in the domain of Machine Learning and Artificial Intelligence by performing the #100DaysofMLCode Challenge.
Stars: ✭ 146 (-93.36%)
GafferA large-scale entity and relation database supporting aggregation of properties
Stars: ✭ 1,642 (-25.36%)
OrcAn ORC file format reader and writer for Go.
Stars: ✭ 97 (-95.59%)
Pyspark Setup DemoDemo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
Stars: ✭ 24 (-98.91%)