All Projects → Optimus → Similar Projects or Alternatives

7345 Open source projects that are alternatives of or similar to Optimus

Spark Py Notebooks
Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+35.7%)
Pandas Videos
Jupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+74.04%)
optimus
🚚 Agile Data Preparation Workflows made easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
Stars: ✭ 1,351 (+37.02%)
W2v
Word2Vec models with Twitter data using Spark. Blog:
Stars: ✭ 64 (-93.51%)
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+53.75%)
My Journey In The Data Science World
📢 Ready to learn or review your knowledge!
Stars: ✭ 1,175 (+19.17%)
Spark R Notebooks
R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-88.95%)
Spark Movie Lens
An on-line movie recommender using Spark, Python Flask, and the MovieLens dataset
Stars: ✭ 745 (-24.44%)
Mutual labels:  jupyter-notebook, spark, bigdata
Spark With Python
Fundamentals of Spark with Python (using PySpark), code examples
Stars: ✭ 150 (-84.79%)
Mutual labels:  jupyter-notebook, spark, pyspark
Skdata
Python tools for data analysis
Stars: ✭ 16 (-98.38%)
Dtale
Visualizer for pandas data structures
Stars: ✭ 2,864 (+190.47%)
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-88.44%)
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-87.53%)
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+137.02%)
Machine learning for good
Machine learning fundamentals lesson in interactive notebooks
Stars: ✭ 142 (-85.6%)
Spring2017 proffosterprovost
Introduction to Data Science
Stars: ✭ 18 (-98.17%)
Data Science Resources
👨🏽‍🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-82.66%)
Handyspark
HandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (-83.98%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Tdd Example
A simple Spark TDD example
Stars: ✭ 23 (-97.67%)
Mutual labels:  jupyter-notebook, spark, pyspark
Spark Practice
Apache Spark (PySpark) Practice on Real Data
Stars: ✭ 200 (-79.72%)
Mutual labels:  jupyter-notebook, spark, pyspark
Deep Learning Machine Learning Stock
Stock for Deep Learning and Machine Learning
Stars: ✭ 240 (-75.66%)
Cookbook 2nd
IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (-28.6%)
Datascience course
Curso de Data Science em Português
Stars: ✭ 294 (-70.18%)
Pydataroad
open source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-69.37%)
Zat
Zeek Analysis Tools (ZAT): Processing and analysis of Zeek network data with Pandas, scikit-learn, Kafka and Spark
Stars: ✭ 303 (-69.27%)
Mutual labels:  jupyter-notebook, spark, data-analysis
The Elements Of Statistical Learning Python Notebooks
A series of Python Jupyter notebooks that help you better understand "The Elements of Statistical Learning" book
Stars: ✭ 405 (-58.92%)
Quantitative Notebooks
Educational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (-63.89%)
Agile data code 2
Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (-58.11%)
Mutual labels:  jupyter-notebook, data-science, spark
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+744.73%)
Youtube Like Predictor
YouTube Like Count Predictions using Machine Learning
Stars: ✭ 137 (-86.11%)
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-88.54%)
Data Science Portfolio
A Portfolio of my Data Science Projects
Stars: ✭ 149 (-84.89%)
Pyspark Learning
Updated repository
Stars: ✭ 147 (-85.09%)
Mutual labels:  jupyter-notebook, spark, pyspark
Datasciencevm
Tools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-84.48%)
Loandefault Prediction
Lending Club Loan data analysis
Stars: ✭ 113 (-88.54%)
Covid19 Severity Prediction
Extensive and accessible COVID-19 data + forecasting for counties and hospitals. 📈
Stars: ✭ 170 (-82.76%)
Azure Cosmosdb Spark
Apache Spark Connector for Azure Cosmos DB
Stars: ✭ 165 (-83.27%)
Mutual labels:  jupyter-notebook, spark, pyspark
Web Database Analytics
Web scrapping and related analytics using Python tools
Stars: ✭ 175 (-82.25%)
Scalable Data Science Platform
Content for architecting a data science platform for products using Luigi, Spark & Flask.
Stars: ✭ 158 (-83.98%)
Mutual labels:  jupyter-notebook, data-science, spark
Mydatascienceportfolio
Applying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-76.98%)
Mutual labels:  jupyter-notebook, data-science, spark
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-77.89%)
Python Bigdata
Data science and Big Data with Python
Stars: ✭ 112 (-88.64%)
Mutual labels:  jupyter-notebook, data-science, spark
Cryptocurrency Analysis Python
Open-Source Tutorial For Analyzing and Visualizing Cryptocurrency Data
Stars: ✭ 278 (-71.81%)
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-72.31%)
Data Science On Gcp
Source code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (-12.37%)
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-98.68%)
Mutual labels:  spark, bigdata, data-analysis
Data Science
Collection of useful data science topics along with code and articles
Stars: ✭ 315 (-68.05%)
Articles
A repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (-64.5%)
Resources
PyMC3 educational resources
Stars: ✭ 930 (-5.68%)
data processing course
Some class materials for a data processing course using PySpark
Stars: ✭ 50 (-94.93%)
Mutual labels:  spark, bigdata, pyspark
Cookbook 2nd Code
Code of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (-45.13%)
Data Analysis And Machine Learning Projects
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Stars: ✭ 5,166 (+423.94%)
H2o 3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+473.63%)
Mutual labels:  jupyter-notebook, data-science, spark
Data Forge Ts
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (-1.93%)
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (-53.96%)
Sparkmagic
Jupyter magics and kernels for working with remote Spark clusters
Stars: ✭ 954 (-3.25%)
Mutual labels:  jupyter-notebook, spark, pyspark
Drugs Recommendation Using Reviews
Analyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-96.45%)
Ml Da Coursera Yandex Mipt
Machine Learning and Data Analysis Coursera Specialization from Yandex and MIPT
Stars: ✭ 108 (-89.05%)
Cracking The Data Science Interview
A Collection of Cheatsheets, Books, Questions, and Portfolio For DS/ML Interview Prep
Stars: ✭ 672 (-31.85%)
Udacity-Data-Analyst-Nanodegree
Repository for the projects needed to complete the Data Analyst Nanodegree.
Stars: ✭ 31 (-96.86%)
1-60 of 7345 similar projects