SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+59.98%)
Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-81.16%)
AirbyteAirbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
Stars: ✭ 4,919 (+325.15%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+140.71%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-93.69%)
KlibEasy to use Python library of customized functions for cleaning and analyzing data.
Stars: ✭ 192 (-83.41%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+295.94%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (-72.77%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-60.76%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+358.51%)
Qs ledgerQuantified Self Personal Data Aggregator and Data Analysis
Stars: ✭ 559 (-51.69%)
Ananas DesktopA hackable data integration & analysis tool to enable non technical users to edit data processing jobs and visualise data on demand.
Stars: ✭ 551 (-52.38%)
Yugabyte DbThe high-performance distributed SQL database for global, internet-scale apps.
Stars: ✭ 5,890 (+409.08%)
Tiny tdsTinyTDS - Simple and fast FreeTDS bindings for Ruby using DB-Library.
Stars: ✭ 575 (-50.3%)
Tsne CudaGPU Accelerated t-SNE for CUDA with Python bindings
Stars: ✭ 1,120 (-3.2%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+385.48%)
SiubaPython library for using dplyr like syntax with pandas and SQL
Stars: ✭ 605 (-47.71%)
Beekeeper StudioModern and easy to use SQL client for MySQL, Postgres, SQLite, SQL Server, and more. Linux, MacOS, and Windows.
Stars: ✭ 8,053 (+596.02%)
JailerDatabase Subsetting and Relational Data Browsing Tool.
Stars: ✭ 576 (-50.22%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (-47.02%)
Data Science CareerCareer Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Stars: ✭ 630 (-45.55%)
DataprooferA proofreader for your data
Stars: ✭ 628 (-45.72%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (-45.29%)
Node SqliteSQLite client for Node.js applications with SQL-based migrations API written in Typescript
Stars: ✭ 642 (-44.51%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (-46.24%)
GenjiDocument-oriented, embedded SQL database
Stars: ✭ 636 (-45.03%)
CovenantsqlA decentralized, trusted, high performance, SQL database with blockchain features
Stars: ✭ 1,148 (-0.78%)
RoughvizReusable JavaScript library for creating sketchy/hand-drawn styled charts in the browser.
Stars: ✭ 6,022 (+420.48%)
TidbTiDB is an open source distributed HTAP database compatible with the MySQL protocol
Stars: ✭ 29,871 (+2481.76%)
EngsoccerdataEnglish and European soccer results 1871-2020
Stars: ✭ 615 (-46.85%)
GodbA Go SQL query builder and struct mapper.
Stars: ✭ 651 (-43.73%)
BaikaldbBaikalDB, A Distributed HTAP Database.
Stars: ✭ 707 (-38.89%)
Directus DockerDirectus 6 Docker — Legacy Container [EOL]
Stars: ✭ 68 (-94.12%)
MigrateDatabase migrations. CLI and Golang library.
Stars: ✭ 7,712 (+566.55%)
Daru Viewdaru-view is for easy and interactive plotting in web application & IRuby notebook. daru-view is a plugin gem to the existing daru gem.
Stars: ✭ 65 (-94.38%)
Db DumperDump the contents of a database
Stars: ✭ 744 (-35.7%)
EralchemyEntity Relation Diagrams generation tool
Stars: ✭ 767 (-33.71%)
ImdbpyIMDbPY is a Python package useful to retrieve and manage the data of the IMDb movie database about movies, people, characters and companies
Stars: ✭ 792 (-31.55%)
Efcore.pgEntity Framework Core provider for PostgreSQL
Stars: ✭ 838 (-27.57%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (-28.44%)
Event Managementhelps to register an users for on events conducted in college fests with simple logic with secured way
Stars: ✭ 65 (-94.38%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (-36.56%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-98.62%)
Raio X📊 Análise de dados das mulheres do curso de Ciência da Computação na UFCG
Stars: ✭ 18 (-98.44%)
BiolitmapCode for the paper "BIOLITMAP: a web-based geolocated and temporal visualization of the evolution of bioinformatics publications" in Oxford Bioinformatics.
Stars: ✭ 18 (-98.44%)
VerticapyVerticaPy is a Python library that exposes sci-kit like functionality to conduct data science projects on data stored in Vertica, thus taking advantage Vertica’s speed and built-in analytics and machine learning capabilities.
Stars: ✭ 59 (-94.9%)
Go SqlbuilderA flexible and powerful SQL string builder library plus a zero-config ORM.
Stars: ✭ 539 (-53.41%)
Nano SqlUniversal database layer for the client, server & mobile devices. It's like Lego for databases.
Stars: ✭ 717 (-38.03%)
Sql StreamsPainless low level jdbc abstraction using the java 8 stream api.
Stars: ✭ 17 (-98.53%)
Reiner萊納 - A MySQL wrapper which might be better than the ORMs and written in Golang
Stars: ✭ 19 (-98.36%)
ResourcesPyMC3 educational resources
Stars: ✭ 930 (-19.62%)