Ppts, códigos y videos de las meetups, data science days, videollamadas y workshops. Data Science Research es una organización sin fines de lucro que busca difundir, descentralizar y difundir los conocimientos en Ciencia de Datos e Inteligencia Artificial en el Perú, dando oportunidades a nuevos talentos mediante MeetUps, Workshops y Semilleros …

Stars: ✭ 60 (+57.89%)

Mutual labels: big-data

Hdfs Shell

HDFS Shell is a HDFS manipulation tool to work with functions integrated in Hadoop DFS

Stars: ✭ 117 (+207.89%)

Mutual labels: big-data

audio noise clustering

https://dodiku.github.io/audio_noise_clustering/results/ ==> An experiment with a variety of clustering (and clustering-like) techniques to reduce noise on an audio speech recording.

Stars: ✭ 24 (-36.84%)

Mutual labels: clustering

Cmak

CMAK is a tool for managing Apache Kafka clusters

Stars: ✭ 10,544 (+27647.37%)

Mutual labels: big-data

wrangler

Wrangler Transform: A DMD system for transforming Big Data

Stars: ✭ 63 (+65.79%)

Mutual labels: big-data

Asakusafw

Asakusa Framework

Stars: ✭ 114 (+200%)

Mutual labels: big-data

scikit-cmeans

Flexible, extensible fuzzy c-means clustering in python.

Stars: ✭ 18 (-52.63%)

Mutual labels: clustering

Pythondata

repo for code published on pythondata.com

Stars: ✭ 113 (+197.37%)

Mutual labels: big-data

bigquery-kafka-connect

☁️ nodejs kafka connect connector for Google BigQuery

Stars: ✭ 17 (-55.26%)

Mutual labels: big-data

Genie

Distributed Big Data Orchestration Service

Stars: ✭ 1,544 (+3963.16%)

Mutual labels: big-data

predictionio-sdk-java

PredictionIO Java SDK

Stars: ✭ 107 (+181.58%)

Mutual labels: big-data

Spark R Notebooks

R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks

Stars: ✭ 109 (+186.84%)

Mutual labels: big-data

tf-example-models

TensorFlow-based implementation of (Gaussian) Mixture Model and some other examples.

Stars: ✭ 42 (+10.53%)

Mutual labels: k-means

Tennis Crystal Ball

Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction

Stars: ✭ 107 (+181.58%)

Mutual labels: big-data

fuzzy-c-means

Fuzzy c-means Clustering

Stars: ✭ 34 (-10.53%)

Mutual labels: clustering

Maha

A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.

Stars: ✭ 101 (+165.79%)

Mutual labels: big-data

react-map-gl-cluster

Urbica React Cluster Component for Mapbox GL JS

Stars: ✭ 27 (-28.95%)

Mutual labels: clustering

Graph sampling

Graph Sampling is a python package containing various approaches which samples the original graph according to different sample sizes.

Stars: ✭ 99 (+160.53%)

Mutual labels: big-data

DP means

Dirichlet Process K-means

Stars: ✭ 36 (-5.26%)

Mutual labels: clustering

Bigdata Notes

大数据入门指南 ⭐

Stars: ✭ 10,991 (+28823.68%)

Mutual labels: big-data

impfuzzy

Fuzzy Hash calculated from import API of PE files

Stars: ✭ 67 (+76.32%)

Mutual labels: clustering

Orc

An ORC file format reader and writer for Go.

Stars: ✭ 97 (+155.26%)

Mutual labels: big-data

HadoopDedup

🍉基于Hadoop和HBase的大规模海量数据去重

Stars: ✭ 27 (-28.95%)

Mutual labels: big-data

Streamx

kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)

Stars: ✭ 96 (+152.63%)

Mutual labels: big-data

Deep-multimodal-subspace-clustering-networks

Tensorflow implementation of "Deep Multimodal Subspace Clustering Networks"

Stars: ✭ 62 (+63.16%)

Mutual labels: clustering

Treeviz

Tree diagrams with JavaScript 🌲 📈

Stars: ✭ 95 (+150%)

Mutual labels: big-data

ml-book

Codice sorgente ed Errata Corrige del mio libro "A tu per tu col Machine Learning"

Stars: ✭ 16 (-57.89%)

Mutual labels: clustering

M-NMF

An implementation of "Community Preserving Network Embedding" (AAAI 2017)

Stars: ✭ 119 (+213.16%)

Mutual labels: clustering

Machine-Learning-Algorithms

All Machine Learning Algorithms

Stars: ✭ 24 (-36.84%)

Mutual labels: clustering

big-data-lite

Samples to the Oracle Big Data Lite VM

Stars: ✭ 41 (+7.89%)

Mutual labels: big-data

Smart Array To Tree

Convert large amounts of data array to tree fastly

Stars: ✭ 91 (+139.47%)

Mutual labels: big-data

hierarchical-clustering

A Python implementation of divisive and hierarchical clustering algorithms. The algorithms were tested on the Human Gene DNA Sequence dataset and dendrograms were plotted.

Stars: ✭ 62 (+63.16%)

Mutual labels: clustering

Dataengineeringproject

Example end to end data engineering project.

Stars: ✭ 82 (+115.79%)

Mutual labels: big-data

pyclustertend

A python package to assess cluster tendency

Stars: ✭ 38 (+0%)

Mutual labels: clustering

Uproot4

ROOT I/O in pure Python and NumPy.