All Projects → prosto → Similar Projects or Alternatives

1470 Open source projects that are alternatives of or similar to prosto

splink
Implementation of Fellegi-Sunter's canonical model of record linkage in Apache Spark, including EM algorithm to estimate parameters
Stars: ✭ 181 (+235.19%)
Mutual labels:  spark
Engezny
Engezny is a python package that quickly generates all possible charts from your dataframe and saves them for you, and engezny is only supporting now uni-parameter visualization using the pie, bar and barh visualizations.
Stars: ✭ 25 (-53.7%)
Mutual labels:  pandas
bigdata-fun
A complete (distributed) BigData stack, running in containers
Stars: ✭ 14 (-74.07%)
Mutual labels:  spark
visualize-data-with-python
A Jupyter notebook using some standard techniques for data science and data engineering to analyze data for the 2017 flooding in Houston, TX.
Stars: ✭ 60 (+11.11%)
Mutual labels:  spark
web-dashboard-demo
The following application contains the DevExpress Dashboard Component for Angular. The client side is hosted on the GitHub Pages and gets data from the server side that hosts on DevExpress.com.
Stars: ✭ 65 (+20.37%)
Mutual labels:  business-intelligence
tellery
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (+305.56%)
Mutual labels:  business-intelligence
alfred-gitignore
Create .gitignore files using Alfred
Stars: ✭ 15 (-72.22%)
Mutual labels:  workflow
pre-commit-dbt
🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
Stars: ✭ 149 (+175.93%)
Mutual labels:  business-intelligence
openverse-catalog
Identifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-50%)
Mutual labels:  spark
machine-learning-capstone-project
This is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning
Stars: ✭ 28 (-48.15%)
Mutual labels:  pandas
sentry-spark
Apache Spark Sentry Integration
Stars: ✭ 14 (-74.07%)
Mutual labels:  spark
dominance-analysis
This package can be used for dominance analysis or Shapley Value Regression for finding relative importance of predictors on given dataset. This library can be used for key driver analysis or marginal resource allocation models.
Stars: ✭ 111 (+105.56%)
Mutual labels:  feature-engineering
alfred-packagist
Alfred workflow to search for PHP packages with Packagist
Stars: ✭ 21 (-61.11%)
Mutual labels:  workflow
tukio
Tukio is an event based workflow generator library
Stars: ✭ 27 (-50%)
Mutual labels:  workflow
mune
Simple stock price analytics
Stars: ✭ 14 (-74.07%)
Mutual labels:  pandas
carry
Python ETL(Extract-Transform-Load) tool / Data migration tool
Stars: ✭ 115 (+112.96%)
Mutual labels:  pandas
movingpandas-examples
Example notebooks illustrating MovingPandas use cases
Stars: ✭ 116 (+114.81%)
Mutual labels:  pandas
pandas-stubs
Pandas type stubs. Helps you type-check your code.
Stars: ✭ 84 (+55.56%)
Mutual labels:  pandas
ECG analysis
No description or website provided.
Stars: ✭ 32 (-40.74%)
Mutual labels:  data-processing
pypar
Efficient and scalable parallelism using the message passing interface (MPI) to handle big data and highly computational problems.
Stars: ✭ 66 (+22.22%)
Mutual labels:  map-reduce
kobe-every-shot-ever
A Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (+22.22%)
Mutual labels:  pandas
support-tickets-classification
This case study shows how to create a model for text analysis and classification and deploy it as a web service in Azure cloud in order to automatically classify support tickets. This project is a proof of concept made by Microsoft (Commercial Software Engineering team) in collaboration with Endava http://endava.com/en
Stars: ✭ 142 (+162.96%)
Mutual labels:  pandas
es pandas
Read, write and update large scale pandas DataFrame with Elasticsearch
Stars: ✭ 34 (-37.04%)
Mutual labels:  pandas
git-commands-workflows
🚀 All the git commands and workflows you need to know
Stars: ✭ 50 (-7.41%)
Mutual labels:  workflow
flock
Flock: A Low-Cost Streaming Query Engine on FaaS Platforms
Stars: ✭ 232 (+329.63%)
Mutual labels:  olap
open-data-anonimizer
Python Data Anonymization & Masking Library For Data Science Tasks
Stars: ✭ 36 (-33.33%)
Mutual labels:  pandas
dashinator
Dashinator the daringly delightful dashboard. A replacement for dashing
Stars: ✭ 56 (+3.7%)
Mutual labels:  business-intelligence
xstate-viz
Visualizer for XState machines
Stars: ✭ 274 (+407.41%)
Mutual labels:  workflow
CaseManagement
CMMN engine implementation in dotnet core
Stars: ✭ 16 (-70.37%)
Mutual labels:  workflow
anesthetic
Nested Sampling post-processing and plotting
Stars: ✭ 34 (-37.04%)
Mutual labels:  pandas
streamlit-pandas-profiling
Pandas profiling component for Streamlit.
Stars: ✭ 135 (+150%)
Mutual labels:  pandas
parallel-corpora-tools
Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.
Stars: ✭ 35 (-35.19%)
Mutual labels:  data-processing
Dominando-Pandas
Este repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (-59.26%)
Mutual labels:  pandas
fer
Facial Expression Recognition
Stars: ✭ 32 (-40.74%)
Mutual labels:  pandas
gan tensorflow
Automatic feature engineering using Generative Adversarial Networks using TensorFlow.
Stars: ✭ 48 (-11.11%)
Mutual labels:  feature-engineering
rec-core
Data pipelining service
Stars: ✭ 19 (-64.81%)
Mutual labels:  data-processing
Python-Matematica
Explorando aspectos fundamentais da matemática com Python e Jupyter
Stars: ✭ 41 (-24.07%)
Mutual labels:  pandas
Chatistics
A WhatsApp Chat analyzer and statistics.
Stars: ✭ 32 (-40.74%)
Mutual labels:  pandas
tsioc
AOP, Ioc container, Boot framework, unit testing framework , activities workflow framework.
Stars: ✭ 15 (-72.22%)
Mutual labels:  workflow
skutil
NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-learn and h2o extension classes (as well as caret classes for python). See more here: https://tgsmith61591.github.io/skutil
Stars: ✭ 29 (-46.3%)
Mutual labels:  pandas
Arch-Data-Science
Archlinux PKGBUILDs for Data Science, Machine Learning, Deep Learning, NLP and Computer Vision
Stars: ✭ 92 (+70.37%)
Mutual labels:  pandas
automile-php
Automile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 28 (-48.15%)
Mutual labels:  business-intelligence
featurewiz
Use advanced feature engineering strategies and select best features from your data set with a single line of code.
Stars: ✭ 229 (+324.07%)
Mutual labels:  feature-engineering
datart
Datart is a next generation Data Visualization Open Platform
Stars: ✭ 1,042 (+1829.63%)
Mutual labels:  business-intelligence
mindware
An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.
Stars: ✭ 34 (-37.04%)
Mutual labels:  feature-engineering
Data-Analyst-Nanodegree
This repo consists of the projects that I completed as a part of the Udacity's Data Analyst Nanodegree's curriculum.
Stars: ✭ 13 (-75.93%)
Mutual labels:  data-wrangling
pulserl
Apache Pulsar client library for Erlang/Elixir
Stars: ✭ 15 (-72.22%)
Mutual labels:  data-processing
exemplary-ml-pipeline
Exemplary, annotated machine learning pipeline for any tabular data problem.
Stars: ✭ 23 (-57.41%)
Mutual labels:  feature-engineering
spark-acid
ACID Data Source for Apache Spark based on Hive ACID
Stars: ✭ 91 (+68.52%)
Mutual labels:  spark
automile-net
Automile offers a simple, smart, cutting-edge telematics solution for businesses to track and manage their business vehicles.
Stars: ✭ 24 (-55.56%)
Mutual labels:  business-intelligence
excel-to-python-course
Student materials and handouts for Excel to Python course
Stars: ✭ 73 (+35.19%)
Mutual labels:  pandas
pantab
Read/Write pandas DataFrames with Tableau Hyper Extracts
Stars: ✭ 64 (+18.52%)
Mutual labels:  pandas
iSkyLIMS
is an open-source LIMS (laboratory Information Management System) for Next Generation Sequencing sample management, statistics and reports, and bioinformatics analysis service management.
Stars: ✭ 33 (-38.89%)
Mutual labels:  workflow
five-minute-midas
Predicting Profitable Day Trading Positions using Decision Tree Classifiers. scikit-learn | Flask | SQLite3 | pandas | MLflow | Heroku | Streamlit
Stars: ✭ 41 (-24.07%)
Mutual labels:  pandas
stargate
An Apache Pulsar client written in Elixir
Stars: ✭ 33 (-38.89%)
Mutual labels:  data-processing
release-notify-action
GitHub Action that triggers e-mails with release notes when these are created
Stars: ✭ 64 (+18.52%)
Mutual labels:  workflow
jekyll-deploy-action
🪂 A Github Action to deploy the Jekyll site conveniently for GitHub Pages.
Stars: ✭ 162 (+200%)
Mutual labels:  workflow
Google-DSC-Platform-Extension
Hello DSC Leads, Automate your process of adding attendees manually.
Stars: ✭ 16 (-70.37%)
Mutual labels:  pandas
ydata-quality
Data Quality assessment with one line of code
Stars: ✭ 311 (+475.93%)
Mutual labels:  pandas
klar-EDA
A python library for automated exploratory data analysis
Stars: ✭ 15 (-72.22%)
Mutual labels:  data-preprocessing
301-360 of 1470 similar projects