Datumbox FrameworkDatumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Stars: ✭ 1,063 (+68.73%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+35.56%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+112.38%)
DataconfsA list of conferences connected with data worldwide.
Stars: ✭ 36 (-94.29%)
Awesome R Learning ResourcesA curated collection of free resources to help deepen your understanding of the R programming language. Updated regularly. Contributions encouraged via pull request (see contributing.md).
Stars: ✭ 181 (-71.27%)
Stats Maths With PythonGeneral statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (-39.52%)
automile-netAutomile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 24 (-96.19%)
RedashMake Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
Stars: ✭ 20,147 (+3097.94%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-34.44%)
Awesome Datascience📝 An awesome Data Science repository to learn and apply for real world problems.
Stars: ✭ 17,520 (+2680.95%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+383.17%)
ClickhouseClickHouse® is a free analytics DBMS for big data
Stars: ✭ 21,089 (+3247.46%)
Dremio OssDremio - the missing link in modern data
Stars: ✭ 862 (+36.83%)
Rakam Api📈 Collect customer event data from your apps. (Note that this project only includes the API collector, not the visualization platform)
Stars: ✭ 772 (+22.54%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (-89.05%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+4154.44%)
Spark With PythonFundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-76.19%)
GrafanaThe open and composable observability and data visualization platform. Visualize metrics, logs, and traces from multiple sources like Prometheus, Loki, Elasticsearch, InfluxDB, Postgres and many more.
Stars: ✭ 45,930 (+7190.48%)
MproveOpen source Business Intelligence tool 🎉
Stars: ✭ 212 (-66.35%)
CboardAn easy to use, self-service open BI reporting and BI dashboard platform.
Stars: ✭ 2,795 (+343.65%)
Awesome StreamlitThe purpose of this project is to share knowledge on how awesome Streamlit is and can be
Stars: ✭ 769 (+22.06%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+3399.68%)
Climate Change Data🌍 A curated list of APIs, open data and ML/AI projects on climate change
Stars: ✭ 195 (-69.05%)
DataformDataform is a framework for managing SQL based data operations in BigQuery, Snowflake, and Redshift
Stars: ✭ 342 (-45.71%)
Datascience Ai Machinelearning ResourcesAlex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (-34.29%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-27.94%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+791.59%)
Feature SelectionFeatures selector based on the self selected-algorithm, loss function and validation method
Stars: ✭ 534 (-15.24%)
Sigma coding youtubeThis is a collection of all the code that can be found on my YouTube channel Sigma Coding.
Stars: ✭ 611 (-3.02%)
Vehicle counting tensorflow🚘 "MORE THAN VEHICLE COUNTING!" This project provides prediction for speed, color and size of the vehicles with TensorFlow Object Counting API.
Stars: ✭ 582 (-7.62%)
Data Science Your WayWays of doing Data Science Engineering and Machine Learning in R and Python
Stars: ✭ 530 (-15.87%)
Awesome Technical Writing📚 A curated list of awesome resources : articles, books, videos, tools, podcasts about technical writing
Stars: ✭ 573 (-9.05%)
Interpretable machine learning with pythonExamples of techniques for training interpretable ML models, explaining ML models, and debugging ML models for accuracy, discrimination, and security.
Stars: ✭ 530 (-15.87%)
Lets PlotAn open-source plotting library for statistical data.
Stars: ✭ 531 (-15.71%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-1.27%)
Book sampleanother book on data science
Stars: ✭ 611 (-3.02%)
Data Science CompetitionsGoal of this repo is to provide the solutions of all Data Science Competitions(Kaggle, Data Hack, Machine Hack, Driven Data etc...).
Stars: ✭ 572 (-9.21%)
PetoolsPE Tools - Portable executable (PE) manipulation toolkit
Stars: ✭ 528 (-16.19%)
RumaleRumale is a machine learning library in Ruby
Stars: ✭ 526 (-16.51%)
PygmyAn open-source, feature rich & extensible url-shortener + analytics written in Python 🍪
Stars: ✭ 569 (-9.68%)
ThrillThrill - An EXPERIMENTAL Algorithmic Distributed Big Data Batch Processing Framework in C++
Stars: ✭ 528 (-16.19%)
Moderndive bookStatistical Inference via Data Science: A ModernDive into R and the Tidyverse
Stars: ✭ 527 (-16.35%)
MoviegeekA django website used in the book Practical Recommender Systems to illustrate how recommender algorithms can be implemented.
Stars: ✭ 608 (-3.49%)
Course V3The 3rd edition of course.fast.ai
Stars: ✭ 4,785 (+659.52%)
Countly ServerCountly helps you get insights from your application. Available self-hosted or on private cloud.
Stars: ✭ 4,857 (+670.95%)
BaikalA graph-based functional API for building complex scikit-learn pipelines.
Stars: ✭ 573 (-9.05%)
DapyEasy-to-use data analysis / manipulation framework for humans
Stars: ✭ 523 (-16.98%)
Disk.frameFast Disk-Based Parallelized Data Manipulation Framework for Larger-than-RAM Data
Stars: ✭ 517 (-17.94%)
LazydataLazydata: Scalable data dependencies for Python projects
Stars: ✭ 627 (-0.48%)
Matrixprofile TsA Python library for detecting patterns and anomalies in massive datasets using the Matrix Profile
Stars: ✭ 621 (-1.43%)
OozieMirror of Apache Oozie
Stars: ✭ 602 (-4.44%)
Pygam[HELP REQUESTED] Generalized Additive Models in Python
Stars: ✭ 569 (-9.68%)
Awesome Cloudrun👓 ⏩ A curated list of resources about all things Cloud Run
Stars: ✭ 521 (-17.3%)
Umbrella☂️ Analytics abstraction layer for Swift
Stars: ✭ 519 (-17.62%)
GiraphMirror of Apache Giraph
Stars: ✭ 569 (-9.68%)