Data Forge TsThe JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Stars: ✭ 967 (+2002.17%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-65.22%)
SkdataPython tools for data analysis
Stars: ✭ 16 (-65.22%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+18006.52%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+1430.43%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-60.87%)
WildfirepyWildfirePy, a Python library for Wildfire GIS data analysis.
Stars: ✭ 21 (-54.35%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-23.91%)
NfstreamNFStream: a Flexible Network Data Analysis Framework.
Stars: ✭ 622 (+1252.17%)
VectorbtUltimate Python library for time series analysis and backtesting at scale
Stars: ✭ 855 (+1758.7%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+12110.87%)
Model Describermodel-describer : Making machine learning interpretable to humans
Stars: ✭ 22 (-52.17%)
Raio X📊 Análise de dados das mulheres do curso de Ciência da Computação na UFCG
Stars: ✭ 18 (-60.87%)
PyamplitudeA Python connector for Amplitude Analytics
Stars: ✭ 16 (-65.22%)
KodiakEnhance your feature engineering workflow with Kodiak
Stars: ✭ 20 (-56.52%)
Getting StartedThis repository is a getting started guide to Singer.
Stars: ✭ 734 (+1495.65%)
Mdcs PyMDCS is an acronym for Mosaic Dataset Configuration Script and is the entry point to a collection of Python classes/libraries that could be consumed by a Python client application to complete a given workflow for creating a mosaic dataset, populating it with data, and setting all required/desired parameters.
Stars: ✭ 38 (-17.39%)
TerraformerA geographic toolkit for dealing with geometry, geography, formats, and building geo databases
Stars: ✭ 643 (+1297.83%)
Pyda 2e Zh📖 [译] 利用 Python 进行数据分析 · 第 2 版
Stars: ✭ 866 (+1782.61%)
SiubaPython library for using dplyr like syntax with pandas and SQL
Stars: ✭ 605 (+1215.22%)
ApogeeTools for dealing with APOGEE data
Stars: ✭ 34 (-26.09%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (+1756.52%)
Qs ledgerQuantified Self Personal Data Aggregator and Data Analysis
Stars: ✭ 559 (+1115.22%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+2043.48%)
PandasFlexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Stars: ✭ 32,029 (+69528.26%)
NannyA tidyverse suite for (pre-) machine-learning: cluster, PCA, permute, impute, rotate, redundancy, triangular, smart-subset, abundant and variable features.
Stars: ✭ 17 (-63.04%)
Ether sqlA python library to push ethereum blockchain data into an sql database.
Stars: ✭ 41 (-10.87%)
Crunchbase MlMerge and Acquisitions Prediction based on M&A information from Crunchbase.
Stars: ✭ 20 (-56.52%)
DataframeC++ DataFrame for statistical, Financial, and ML analysis -- in modern C++ using native types, continuous memory storage, and no pointers are involved
Stars: ✭ 828 (+1700%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (+2032.61%)
StatsmodelsStatsmodels: statistical modeling and econometrics in Python
Stars: ✭ 6,935 (+14976.09%)
MathematicavsrExample projects, code, and documents for comparing Mathematica with R.
Stars: ✭ 41 (-10.87%)
MetabaseThe simplest, fastest way to get business intelligence and analytics to everyone in your company 😋
Stars: ✭ 26,803 (+58167.39%)
Eda minerSwiss army knife, but for visualization, analytics, and machine learning. View docs here: http://edaminer.com/docs/ and a demo (don't abuse) here: http://edaminer.com/
Stars: ✭ 13 (-71.74%)
DataprooferA proofreader for your data
Stars: ✭ 628 (+1265.22%)
IsacreatorISAcreator is a Java desktop application which allows for the creation and editing of ISA-Tab files. Originally developed by Eamonn Maguire, with further contributions by Alejandra Gonzalez-Beltran, David Johnson and Philippe Rocca-Serra (Uni. of Oxford).
Stars: ✭ 34 (-26.09%)
ElkiELKI Data Mining Toolkit
Stars: ✭ 613 (+1232.61%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+1778.26%)
Geometry Api JavaThe Esri Geometry API for Java enables developers to write custom applications for analysis of spatial data. This API is used in the Esri GIS Tools for Hadoop and other 3rd-party data processing solutions.
Stars: ✭ 585 (+1171.74%)
Musictaster一种song2vec、artist2vec的实践
Stars: ✭ 38 (-17.39%)
AlluxioAlluxio, data orchestration for analytics and machine learning in the cloud
Stars: ✭ 5,379 (+11593.48%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+11432.61%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+17210.87%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-43.48%)
Data SelfieData Selfie - a browser extension to track yourself on Facebook and analyze your data.
Stars: ✭ 1,009 (+2093.48%)
Pytima python package for the interfacial analysis of molecular simulations
Stars: ✭ 38 (-17.39%)
RshrfrsHRF: A Toolbox for Resting State HRF Deconvolution and Connectivity Analysis (MATLAB)
Stars: ✭ 33 (-28.26%)