All Projects → Dataflowjavasdk → Similar Projects or Alternatives

1712 Open source projects that are alternatives of or similar to Dataflowjavasdk

Collapse
Advanced and Fast Data Transformation in R
Stars: ✭ 184 (-78.45%)
Pyod
A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)
Stars: ✭ 5,083 (+495.2%)
Dex
Dex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (+44.96%)
Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+56.67%)
Mutual labels:  data-science, data-analysis, big-data
Pachyderm
Reproducible Data Science at Scale!
Stars: ✭ 5,305 (+521.19%)
Mutual labels:  data-science, data-analysis, big-data
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-87.24%)
Mutual labels:  data-science, data-analysis, big-data
Pycm
Multi-class confusion matrix library in Python
Stars: ✭ 1,076 (+26%)
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+37.59%)
Mutual labels:  data-science, data-analysis, big-data
Model Describer
model-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-97.42%)
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-74.47%)
Tennis Crystal Ball
Ultimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-87.47%)
Mutual labels:  data-science, data-analysis, big-data
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-17.56%)
Setl
A simple Spark-powered ETL framework that just works 🍺
Stars: ✭ 79 (-90.75%)
Mutual labels:  data-science, data-analysis, big-data
Datasciencevm
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-82.08%)
Mutual labels:  data-science, data-analysis, big-data
Data Science Live Book
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-77.4%)
Mutual labels:  data-science, data-analysis, big-data
Dataproofer
A proofreader for your data
Stars: ✭ 628 (-26.46%)
Datascience
Curated list of Python resources for data science.
Stars: ✭ 3,051 (+257.26%)
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-46.84%)
Mutual labels:  data-science, data-analysis, big-data
Data Science With Ruby
Practical Data Science with Ruby based tools.
Stars: ✭ 549 (-35.71%)
Accelerator
The Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-83.96%)
Mutual labels:  data-science, big-data, data-mining
Machine learning for good
Machine learning fundamentals lesson in interactive notebooks
Stars: ✭ 142 (-83.37%)
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-36.65%)
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-79.98%)
Nfstream
NFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-27.17%)
Knowage Server
Knowage is the professional open source suite for modern business analytics over traditional sources and big data systems.
Stars: ✭ 276 (-67.68%)
Mutual labels:  data-analysis, big-data, data-mining
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+1.17%)
Ai Learn
人工智能学习路线图,整理近200个实战案例与项目,免费提供配套教材,零基础入门,就业实战!包括:Python,数学,机器学习,数据分析,深度学习,计算机视觉,自然语言处理,PyTorch tensorflow machine-learning,deep-learning data-analysis data-mining mathematics data-science artificial-intelligence python tensorflow tensorflow2 caffe keras pytorch algorithm numpy pandas matplotlib seaborn nlp cv等热门领域
Stars: ✭ 4,387 (+413.7%)
Elki
ELKI Data Mining Toolkit
Stars: ✭ 613 (-28.22%)
Tsrepr
TSrepr: R package for time series representations
Stars: ✭ 75 (-91.22%)
Urs
Universal Reddit Scraper - A comprehensive Reddit scraping command-line tool written in Python.
Stars: ✭ 275 (-67.8%)
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-86.77%)
Mutual labels:  data-science, data-analysis, big-data
Rightmove webscraper.py
Python class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-85.36%)
Vizuka
Explore high-dimensional datasets and how your algo handles specific regions.
Stars: ✭ 100 (-88.29%)
Mutual labels:  data-science, big-data, data-mining
Deepgraph
Analyze Data with Pandas-based Networks. Documentation:
Stars: ✭ 232 (-72.83%)
Pydataroad
open source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-64.64%)
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-97.89%)
Biolitmap
Code for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-97.89%)
Mutual labels:  data-science, data-mining
Mlxtend
A library of extension and helper modules for Python's data analysis and machine learning libraries.
Stars: ✭ 3,729 (+336.65%)
Mutual labels:  data-science, data-mining
Graph Fraud Detection Papers
A curated list of fraud detection papers using graph information or graph neural networks
Stars: ✭ 339 (-60.3%)
Mutual labels:  data-science, data-mining
Pretzel
Javascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-96.96%)
Mutual labels:  data-science, big-data
Kneed
Knee point detection in Python 📈
Stars: ✭ 328 (-61.59%)
Mutual labels:  data-science, data-analysis
Scikit Mobility
scikit-mobility: mobility analysis in Python
Stars: ✭ 339 (-60.3%)
Mutual labels:  data-science, data-analysis
Football Data
football (soccer) datasets
Stars: ✭ 18 (-97.89%)
Mutual labels:  data-science, data-analysis
Pandas Summary
An extension to pandas dataframes describe function.
Stars: ✭ 361 (-57.73%)
Mutual labels:  data-science, data-analysis
Quantitative Notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (-58.31%)
Mutual labels:  data-science, data-analysis
Articles
A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (-59.02%)
Mutual labels:  data-science, data-analysis
Skdata
Python tools for data analysis
Stars: ✭ 16 (-98.13%)
Mutual labels:  data-science, data-analysis
Akshare
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
Stars: ✭ 4,334 (+407.49%)
Mutual labels:  data-science, data-analysis
Artificial Adversary
🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (-59.25%)
Mutual labels:  data-science, data-mining
Dataexplorer
Automate Data Exploration and Treatment
Stars: ✭ 362 (-57.61%)
Mutual labels:  data-science, data-analysis
Data Science
Collection of useful data science topics along with code and articles
Stars: ✭ 315 (-63.11%)
Mutual labels:  data-science, data-analysis
Datacleaner
The premier open source Data Quality solution
Stars: ✭ 391 (-54.22%)
Mutual labels:  data-science, data-analysis
Sktime
A unified framework for machine learning with time series
Stars: ✭ 4,741 (+455.15%)
Mutual labels:  data-science, data-mining
The Elements Of Statistical Learning Python Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
Stars: ✭ 405 (-52.58%)
Mutual labels:  data-science, data-analysis
Datascience Ai Machinelearning Resources
Alex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (-51.52%)
Mutual labels:  data-science, big-data
Ml From Scratch
Python implementations of some of the fundamental Machine Learning models and algorithms from scratch.
Stars: ✭ 20,624 (+2314.99%)
Mutual labels:  data-science, data-mining
Cogcomp Nlp
CogComp's Natural Language Processing libraries and Demos:
Stars: ✭ 410 (-51.99%)
Mutual labels:  big-data, data-mining
Dataframe
C++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (-3.04%)
Mutual labels:  data-science, data-analysis
Jupyter pivottablejs
Drag’n’drop Pivot Tables and Charts for Jupyter/IPython Notebook, care of PivotTable.js
Stars: ✭ 428 (-49.88%)
Mutual labels:  data-science, data-analysis
Data Science Ipython Notebooks
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+2481.73%)
Mutual labels:  data-science, big-data
1-60 of 1712 similar projects