All Projects → prosto → Similar Projects or Alternatives

1470 Open source projects that are alternatives of or similar to prosto

Linkis helps easily connect to various back-end computation/storage engines(Spark, Python, TiDB...), exposes various interfaces(REST, JDBC, Java ...), with multi-tenancy, high performance, and resource control.

Stars: ✭ 2,459 (+4453.7%)

Mutual labels: spark

benten

A language server for Common Workflow Language

Stars: ✭ 50 (-7.41%)

Mutual labels: workflow

DataCon

🏆DataCon大数据安全分析大赛，2019年方向二（恶意代码检测）冠军源码、2020年方向五（恶意代码分析）季军源码

Stars: ✭ 69 (+27.78%)

Mutual labels: feature-engineering

advanced-data-wrangling-in-R-legacy

Advanced-data-wrangling-in-R, Workshop

Stars: ✭ 14 (-74.07%)

Mutual labels: data-wrangling

monthly-returns-heatmap

Python Monthly Returns Heatmap (DEPRECATED! Use QuantStats instead)

Stars: ✭ 23 (-57.41%)

Mutual labels: pandas

Springboard-DataScienceTrack-Student

Springboard Program: Data Science Career Track - NLP

Stars: ✭ 92 (+70.37%)

Mutual labels: data-wrangling

Information-Retrieval

Information Retrieval algorithms developed in python. To follow the blog posts, click on the link:

Stars: ✭ 103 (+90.74%)

Mutual labels: pandas

obsplus

A Pandas-Centric ObsPy Expansion Pack

Stars: ✭ 28 (-48.15%)

Mutual labels: pandas

bootstrap-gulp-starter-template

Bootstrap 4 + Gulp 4 + Panini for improve front-end development workflow

Stars: ✭ 67 (+24.07%)

Mutual labels: workflow

wakatime-to-toggl

📩 Sync your WakaTime data in Toggl

Stars: ✭ 23 (-57.41%)

Mutual labels: workflow

frovedis

Framework of vectorized and distributed data analytics

Stars: ✭ 59 (+9.26%)

Mutual labels: spark

alfred-mailto

Send emails to recipients and groups from Alfred

Stars: ✭ 59 (+9.26%)

Mutual labels: workflow

Python-for-data-analysis

No description or website provided.

Stars: ✭ 18 (-66.67%)

Mutual labels: pandas

PandasVersusExcel

Python数据分析入门，数据分析师入门

Stars: ✭ 120 (+122.22%)

Mutual labels: pandas

Data-Science-Tutorials

Python Tutorials for Data Science

Stars: ✭ 104 (+92.59%)

Mutual labels: pandas

Chapter-2

Code examples for Chapter 2 of Data Wrangling with JavaScript

Stars: ✭ 16 (-70.37%)

Mutual labels: data-wrangling

alfred-workflow

No description or website provided.

Stars: ✭ 26 (-51.85%)

Mutual labels: workflow

pytd

Treasure Data Driver for Python

Stars: ✭ 15 (-72.22%)

Mutual labels: pandas

elegant-git

Elegant Git is an assistant who carefully automates routine work with Git.

Stars: ✭ 38 (-29.63%)

Mutual labels: workflow

tutorials

Short programming tutorials pertaining to data analysis.

Stars: ✭ 14 (-74.07%)

Mutual labels: pandas

BigData-News

基于Spark2.2新闻网大数据实时系统项目

Stars: ✭ 36 (-33.33%)

Mutual labels: spark

DataProfiler

What's in your data? Extract schema, statistics and entities from datasets

Stars: ✭ 843 (+1461.11%)

Mutual labels: pandas

blog

blog entries

Stars: ✭ 39 (-27.78%)

Mutual labels: spark

quickstep

Quickstep project

Stars: ✭ 22 (-59.26%)

Mutual labels: olap

zenaton-ruby

💎 Ruby gem to run and orchestrate background jobs with Zenaton Workflow Engine

Stars: ✭ 32 (-40.74%)

Mutual labels: workflow

onelinerhub

2.5k code solutions with clear explanation @ onelinerhub.com

Stars: ✭ 645 (+1094.44%)

Mutual labels: pandas

leaflet heatmap

简单的可视化湖州通话数据假设数据量很大，没法用浏览器直接绘制热力图，把绘制热力图这一步骤放到线下计算分析。使用Apache Spark并行计算数据之后，再使用Apache Spark绘制热力图，然后用leafletjs加载OpenStreetMap图层和热力图图层，以达到良好的交互效果。现在使用Apache Spark实现绘制，可能是Apache Spark不擅长这方面的计算或者是我没有设计好算法，并行计算的速度比不上单机计算。Apache Spark绘制热力图和计算代码在这 https://github.com/yuanzhaokang/ParallelizeHeatmap.git .

Stars: ✭ 13 (-75.93%)

Mutual labels: spark

Bike-Sharing-Demand-Kaggle

Top 5th percentile solution to the Kaggle knowledge problem - Bike Sharing Demand

Stars: ✭ 33 (-38.89%)

Mutual labels: feature-engineering

spreadsheets-to-dataframes

Pycon 2021 Tutorial to help Spreadsheet (Excel) Users learn Python

Stars: ✭ 30 (-44.44%)

Mutual labels: pandas

autoencoders tensorflow

Automatic feature engineering using deep learning and Bayesian inference using TensorFlow.

Stars: ✭ 66 (+22.22%)

Mutual labels: feature-engineering

mimir

Data-ish exploration through SQL+Uncertainty

Stars: ✭ 26 (-51.85%)

Mutual labels: data-wrangling

weaverbird

A visual data pipeline builder with various backends

Stars: ✭ 65 (+20.37%)

Mutual labels: pandas

ACEseqWorkflow

Allele-specific copy number estimation with whole genome sequencing

Stars: ✭ 19 (-64.81%)

Mutual labels: workflow

toucan-connectors

Connectors available to retrieve data in Toucan Toco small apps

Stars: ✭ 13 (-75.93%)

Mutual labels: pandas

kafka-compose

🎼 Docker compose files for various kafka stacks

Stars: ✭ 32 (-40.74%)

Mutual labels: spark

fal

do more with dbt. fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.

Stars: ✭ 567 (+950%)

Mutual labels: pandas

swordfish

Open-source distribute workflow schedule tools, also support streaming task.

Stars: ✭ 35 (-35.19%)

Mutual labels: spark

my curd

超轻量快速开发脚手架、流程平台。

Stars: ✭ 38 (-29.63%)

Mutual labels: workflow

spark-util

low-level helpers for Apache Spark libraries and tests

Stars: ✭ 16 (-70.37%)

Mutual labels: spark

traceml

Engine for ML/Data tracking, visualization, dashboards, and model UI for Polyaxon.

Stars: ✭ 445 (+724.07%)

Mutual labels: data-processing

sparkar-volts

An extensive non-reactive Typescript framework that eases the development experience in Spark AR

Stars: ✭ 15 (-72.22%)

Mutual labels: spark

chatstats

💬📊 Fun data visualizations for Facebook Messenger chats

Stars: ✭ 18 (-66.67%)

Mutual labels: pandas

Python-Data-Wrangling

D-Lab's 3 hour introduction to data wrangling in Python. Learn how to import and manipulate dataframes using pandas in Python.

Stars: ✭ 41 (-24.07%)

Mutual labels: pandas

query2report

Query2Report is a simple open source business intelligence platform that allows users to build report/dashboard for business analytics or enterprise reporting

Stars: ✭ 43 (-20.37%)

Mutual labels: business-intelligence

experiments

Code examples for my blog posts

Stars: ✭ 21 (-61.11%)

Mutual labels: spark

web-dashboard-demo

The following application contains the DevExpress Dashboard Component for Angular. The client side is hosted on the GitHub Pages and gets data from the server side that hosts on DevExpress.com.

Stars: ✭ 65 (+20.37%)

Mutual labels: business-intelligence

tellery

Tellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.

Stars: ✭ 219 (+305.56%)

Mutual labels: business-intelligence

alfred-gitignore

Create .gitignore files using Alfred

Stars: ✭ 15 (-72.22%)

Mutual labels: workflow

pre-commit-dbt

🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.

Stars: ✭ 149 (+175.93%)

Mutual labels: business-intelligence

openverse-catalog

Identifies and collects data on cc-licensed content across web crawl data and public apis.

Stars: ✭ 27 (-50%)

Mutual labels: spark

machine-learning-capstone-project

This is the final project for the Udacity Machine Learning Nanodegree: Predicting article retweets and likes based on the title using Machine Learning

Stars: ✭ 28 (-48.15%)

Mutual labels: pandas

harlan

Harlan é o sistema modular que permite você automatizar toda sua governança cadastral da nuvem.

Stars: ✭ 25 (-53.7%)

Mutual labels: business-intelligence

outside-collaborators

Automatically Manage Outside Collaborators Organization-wide

Stars: ✭ 45 (-16.67%)

Mutual labels: workflow

Papers4DataAchitect

Collect papers for data engineering such as OLTP/OLAP/ETL/DistributedStorage.

Stars: ✭ 17 (-68.52%)

Mutual labels: olap

bitnami-docker-airflow-scheduler

Bitnami Docker Image for Apache Airflow Scheduler

Stars: ✭ 19 (-64.81%)

Mutual labels: workflow

pantab

Read/Write pandas DataFrames with Tableau Hyper Extracts

Stars: ✭ 64 (+18.52%)

Mutual labels: pandas

iSkyLIMS

is an open-source LIMS (laboratory Information Management System) for Next Generation Sequencing sample management, statistics and reports, and bioinformatics analysis service management.

Stars: ✭ 33 (-38.89%)

Mutual labels: workflow

five-minute-midas

Stars: ✭ 41 (-24.07%)

Mutual labels: pandas

stargate

An Apache Pulsar client written in Elixir

Stars: ✭ 33 (-38.89%)

Mutual labels: data-processing

release-notify-action

GitHub Action that triggers e-mails with release notes when these are created

Stars: ✭ 64 (+18.52%)

Mutual labels: workflow

241-300 of 1470 similar projects