All Projects → prosto → Similar Projects or Alternatives

1470 Open source projects that are alternatives of or similar to prosto

incubator-linkis
Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.
Stars: ✭ 2,459 (+4453.7%)
Mutual labels:  spark
benten
A language server for Common Workflow Language
Stars: ✭ 50 (-7.41%)
Mutual labels:  workflow
DataCon
🏆DataCon大数据安全分析大赛,2019年方向二(恶意代码检测)冠军源码、2020年方向五(恶意代码分析)季军源码
Stars: ✭ 69 (+27.78%)
Mutual labels:  feature-engineering
advanced-data-wrangling-in-R-legacy
Advanced-data-wrangling-in-R, Workshop
Stars: ✭ 14 (-74.07%)
Mutual labels:  data-wrangling
monthly-returns-heatmap
Python Monthly Returns Heatmap (DEPRECATED! Use QuantStats instead)
Stars: ✭ 23 (-57.41%)
Mutual labels:  pandas
Springboard-DataScienceTrack-Student
Springboard Program: Data Science Career Track - NLP
Stars: ✭ 92 (+70.37%)
Mutual labels:  data-wrangling
Information-Retrieval
Information Retrieval algorithms developed in python. To follow the blog posts, click on the link:
Stars: ✭ 103 (+90.74%)
Mutual labels:  pandas
obsplus
A Pandas-Centric ObsPy Expansion Pack
Stars: ✭ 28 (-48.15%)
Mutual labels:  pandas
bootstrap-gulp-starter-template
Bootstrap 4 + Gulp 4 + Panini for improve front-end development workflow
Stars: ✭ 67 (+24.07%)
Mutual labels:  workflow
wakatime-to-toggl
📩 Sync your WakaTime data in Toggl
Stars: ✭ 23 (-57.41%)
Mutual labels:  workflow
frovedis
Framework of vectorized and distributed data analytics
Stars: ✭ 59 (+9.26%)
Mutual labels:  spark
alfred-mailto
Send emails to recipients and groups from Alfred
Stars: ✭ 59 (+9.26%)
Mutual labels:  workflow
Python-for-data-analysis
No description or website provided.
Stars: ✭ 18 (-66.67%)
Mutual labels:  pandas
PandasVersusExcel
Python数据分析入门,数据分析师入门
Stars: ✭ 120 (+122.22%)
Mutual labels:  pandas
Data-Science-Tutorials
Python Tutorials for Data Science
Stars: ✭ 104 (+92.59%)
Mutual labels:  pandas
Chapter-2
Code examples for Chapter 2 of Data Wrangling with JavaScript
Stars: ✭ 16 (-70.37%)
Mutual labels:  data-wrangling
alfred-workflow
No description or website provided.
Stars: ✭ 26 (-51.85%)
Mutual labels:  workflow
pytd
Treasure Data Driver for Python
Stars: ✭ 15 (-72.22%)
Mutual labels:  pandas
elegant-git
Elegant Git is an assistant who carefully automates routine work with Git.
Stars: ✭ 38 (-29.63%)
Mutual labels:  workflow
tutorials
Short programming tutorials pertaining to data analysis.
Stars: ✭ 14 (-74.07%)
Mutual labels:  pandas
BigData-News
基于Spark2.2新闻网大数据实时系统项目
Stars: ✭ 36 (-33.33%)
Mutual labels:  spark
DataProfiler
What's in your data? Extract schema, statistics and entities from datasets
Stars: ✭ 843 (+1461.11%)
Mutual labels:  pandas
blog
blog entries
Stars: ✭ 39 (-27.78%)
Mutual labels:  spark
quickstep
Quickstep project
Stars: ✭ 22 (-59.26%)
Mutual labels:  olap
zenaton-ruby
💎 Ruby gem to run and orchestrate background jobs with Zenaton Workflow Engine
Stars: ✭ 32 (-40.74%)
Mutual labels:  workflow
onelinerhub
2.5k code solutions with clear explanation @ onelinerhub.com
Stars: ✭ 645 (+1094.44%)
Mutual labels:  pandas
leaflet heatmap
简单的可视化湖州通话数据 假设数据量很大,没法用浏览器直接绘制热力图,把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后,再使用Apache Spark绘制热力图,然后用leafletjs加载OpenStreetMap图层和热力图图层,以达到良好的交互效果。现在使用Apache Spark实现绘制,可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法,并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .
Stars: ✭ 13 (-75.93%)
Mutual labels:  spark
Bike-Sharing-Demand-Kaggle
Top 5th percentile solution to the Kaggle knowledge problem - Bike Sharing Demand
Stars: ✭ 33 (-38.89%)
Mutual labels:  feature-engineering
spreadsheets-to-dataframes
Pycon 2021 Tutorial to help Spreadsheet (Excel) Users learn Python
Stars: ✭ 30 (-44.44%)
Mutual labels:  pandas
autoencoders tensorflow
Automatic feature engineering using deep learning and Bayesian inference using TensorFlow.
Stars: ✭ 66 (+22.22%)
Mutual labels:  feature-engineering
mimir
Data-ish exploration through SQL+Uncertainty
Stars: ✭ 26 (-51.85%)
Mutual labels:  data-wrangling
weaverbird
A visual data pipeline builder with various backends
Stars: ✭ 65 (+20.37%)
Mutual labels:  pandas
ACEseqWorkflow
Allele-specific copy number estimation with whole genome sequencing
Stars: ✭ 19 (-64.81%)
Mutual labels:  workflow
toucan-connectors
Connectors available to retrieve data in Toucan Toco small apps
Stars: ✭ 13 (-75.93%)
Mutual labels:  pandas
kafka-compose
🎼 Docker compose files for various kafka stacks
Stars: ✭ 32 (-40.74%)
Mutual labels:  spark
fal
do more with dbt. fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
Stars: ✭ 567 (+950%)
Mutual labels:  pandas
swordfish
Open-source distribute workflow schedule tools, also support streaming task.
Stars: ✭ 35 (-35.19%)
Mutual labels:  spark
my curd
超轻量 快速开发脚手架、流程平台。
Stars: ✭ 38 (-29.63%)
Mutual labels:  workflow
spark-util
low-level helpers for Apache Spark libraries and tests
Stars: ✭ 16 (-70.37%)
Mutual labels:  spark
traceml
Engine for ML/Data tracking, visualization, dashboards, and model UI for Polyaxon.
Stars: ✭ 445 (+724.07%)
Mutual labels:  data-processing
sparkar-volts
An extensive non-reactive Typescript framework that eases the development experience in Spark AR
Stars: ✭ 15 (-72.22%)
Mutual labels:  spark
chatstats
💬📊 Fun data visualizations for Facebook Messenger chats
Stars: ✭ 18 (-66.67%)
Mutual labels:  pandas
Python-Data-Wrangling
D-Lab's 3 hour introduction to data wrangling in Python. Learn how to import and manipulate dataframes using pandas in Python.
Stars: ✭ 41 (-24.07%)
Mutual labels:  pandas
query2report
Query2Report is a simple open source business intelligence platform that allows users to build report/dashboard for business analytics or enterprise reporting
Stars: ✭ 43 (-20.37%)
Mutual labels:  business-intelligence
experiments
Code examples for my blog posts
Stars: ✭ 21 (-61.11%)
Mutual labels:  spark
web-dashboard-demo
The following application contains the DevExpress Dashboard Component for Angular. The client side is hosted on the GitHub Pages and gets data from the server side that hosts on DevExpress.com.
Stars: ✭ 65 (+20.37%)
Mutual labels:  business-intelligence
tellery
Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Stars: ✭ 219 (+305.56%)
Mutual labels:  business-intelligence
alfred-gitignore
Create .gitignore files using Alfred
Stars: ✭ 15 (-72.22%)
Mutual labels:  workflow
pre-commit-dbt
🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
Stars: ✭ 149 (+175.93%)
Mutual labels:  business-intelligence
openverse-catalog
Identifies and collects data on cc-licensed content across web crawl data and public apis.
Stars: ✭ 27 (-50%)
Mutual labels:  spark
machine-learning-capstone-project
This is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning
Stars: ✭ 28 (-48.15%)
Mutual labels:  pandas
harlan
Harlan é o sistema modular que permite você automatizar toda sua governança cadastral da nuvem.
Stars: ✭ 25 (-53.7%)
Mutual labels:  business-intelligence
outside-collaborators
Automatically Manage Outside Collaborators Organization-wide
Stars: ✭ 45 (-16.67%)
Mutual labels:  workflow
Papers4DataAchitect
Collect papers for data engineering such as OLTP/OLAP/ETL/DistributedStorage.
Stars: ✭ 17 (-68.52%)
Mutual labels:  olap
bitnami-docker-airflow-scheduler
Bitnami Docker Image for Apache Airflow Scheduler
Stars: ✭ 19 (-64.81%)
Mutual labels:  workflow
pantab
Read/Write pandas DataFrames with Tableau Hyper Extracts
Stars: ✭ 64 (+18.52%)
Mutual labels:  pandas
iSkyLIMS
is an open-source LIMS (laboratory Information Management System) for Next Generation Sequencing sample management, statistics and reports, and bioinformatics analysis service management.
Stars: ✭ 33 (-38.89%)
Mutual labels:  workflow
five-minute-midas
Predicting Profitable Day Trading Positions using Decision Tree Classifiers. scikit-learn | Flask | SQLite3 | pandas | MLflow | Heroku | Streamlit
Stars: ✭ 41 (-24.07%)
Mutual labels:  pandas
stargate
An Apache Pulsar client written in Elixir
Stars: ✭ 33 (-38.89%)
Mutual labels:  data-processing
release-notify-action
GitHub Action that triggers e-mails with release notes when these are created
Stars: ✭ 64 (+18.52%)
Mutual labels:  workflow
241-300 of 1470 similar projects