Data-ScienceUsing Kaggle Data and Real World Data for Data Science and prediction in Python, R, Excel, Power BI, and Tableau.
Stars: ✭ 15 (-66.67%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+18408.89%)
highdimStatistics for high-dimensional data (homogeneity, sphericity, independence, spherical uniformity)
Stars: ✭ 16 (-64.44%)
ImpyImpy is a Python3 library with features that help you in your computer vision tasks.
Stars: ✭ 109 (+142.22%)
KaggleKaggle Kernels (Python, R, Jupyter Notebooks)
Stars: ✭ 26 (-42.22%)
CorBinianCorBinian: A toolbox for modelling and simulating high-dimensional binary and count-data with correlations
Stars: ✭ 15 (-66.67%)
DataprepDataPrep — The easiest way to prepare data in Python
Stars: ✭ 639 (+1320%)
ppmlhdfePoisson pseudo-likelihood regression with multiple levels of fixed effects
Stars: ✭ 46 (+2.22%)
DatavisualizationTutorials on visualizing data using python packages like bokeh, plotly, seaborn and igraph
Stars: ✭ 234 (+420%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+6088.89%)
Scikit PosthocsMultiple Pairwise Comparisons (Post Hoc) Tests in Python
Stars: ✭ 186 (+313.33%)
ECharts.jlJulia package for the Apache ECharts v4 visualization library
Stars: ✭ 80 (+77.78%)
Ee OutliersOpen-source framework to detect outliers in Elasticsearch events
Stars: ✭ 172 (+282.22%)
VisdatPreliminary Exploratory Visualisation of Data
Stars: ✭ 377 (+737.78%)
Gitinspector📊 The statistical analysis tool for git repositories
Stars: ✭ 2,058 (+4473.33%)
Inspectdf🛠️ 📊 Tools for Exploring and Comparing Data Frames
Stars: ✭ 195 (+333.33%)
MethylkitR package for DNA methylation analysis
Stars: ✭ 116 (+157.78%)
Autoeda ResourcesA list of software and papers related to automatic and fast Exploratory Data Analysis
Stars: ✭ 268 (+495.56%)
PycmMulti-class confusion matrix library in Python
Stars: ✭ 1,076 (+2291.11%)
arulesVizVisualizing Association Rules and Frequent Itemsets with R
Stars: ✭ 49 (+8.89%)
DatadoubleconfirmSimple datasets and notebooks for data visualization, statistical analysis and modelling - with write-ups here: http://projectosyo.wix.com/datadoubleconfirm.
Stars: ✭ 24 (-46.67%)
Pymc3Probabilistic Programming in Python: Bayesian Modeling and Probabilistic Machine Learning with Aesara
Stars: ✭ 6,214 (+13708.89%)
100 Days Of Ml CodeA day to day plan for this challenge. Covers both theoritical and practical aspects
Stars: ✭ 172 (+282.22%)
Git Quick Stats▁▅▆▃▅ Git quick statistics is a simple and efficient way to access various statistics in git repository.
Stars: ✭ 5,139 (+11320%)
olliePyOlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
Stars: ✭ 46 (+2.22%)
Expan Open-source Python library for statistical analysis of randomised control trials (A/B tests)
Stars: ✭ 275 (+511.11%)
tkTk interface module using tcltklib
Stars: ✭ 106 (+135.56%)
THE-SPARKS-FOUNDATION📌 This repo. Contains Basic - Advance level Machine learning / business analysis Projects. 👨💻
Stars: ✭ 87 (+93.33%)
HandysparkHandySpark - bringing pandas-like capabilities to Spark dataframes
Stars: ✭ 158 (+251.11%)
Data-Scientist-In-PythonThis repository contains notes and projects of Data scientist track from dataquest course work.
Stars: ✭ 23 (-48.89%)
MetaOmGraphMetaOmGraph: a workbench for interactive exploratory data analysis of large expression datasets
Stars: ✭ 30 (-33.33%)
taucharts📊 An R htmlwidget interface to the TauCharts javascript library
Stars: ✭ 66 (+46.67%)
continuous BernoulliThere are C language computer programs about the simulator, transformation, and test statistic of continuous Bernoulli distribution. More than that, the book contains continuous Binomial distribution and continuous Trinomial distribution.
Stars: ✭ 22 (-51.11%)
skimpyskimpy is a light weight tool that provides summary statistics about variables in data frames within the console.
Stars: ✭ 236 (+424.44%)
bobaSpecifying and executing multiverse analysis
Stars: ✭ 50 (+11.11%)
ScattertextBeautiful visualizations of how language differs among document types.
Stars: ✭ 1,722 (+3726.67%)
StatKitA collection of statistical analysis tools for your Swift programs.
Stars: ✭ 66 (+46.67%)
adenineADENINE: A Data ExploratioN PipelINE
Stars: ✭ 15 (-66.67%)
webmc3A web interface for exploring PyMC3 traces
Stars: ✭ 46 (+2.22%)
insightA Tcl/Tk Frontend for GDB. This is an AppImage(Portable Package) of insight for the sake of Jeff Duntemann's amazing book.
Stars: ✭ 31 (-31.11%)
trtlTk-powered Ruby turtle graphics
Stars: ✭ 65 (+44.44%)
XdaR package for exploratory data analysis
Stars: ✭ 112 (+148.89%)
glimmer-dsl-tkGlimmer DSL for Tk (Ruby Tk Desktop Development GUI Library)
Stars: ✭ 26 (-42.22%)
leilaLibrería para la evaluación de calidad de datos, e interacción con el portal de datos.gov.co
Stars: ✭ 56 (+24.44%)
tukaanA modern, cross platform Python toolkit for creating desktop GUI applications. Contributors are welcome!
Stars: ✭ 97 (+115.56%)
scikit-hubnessA Python package for hubness analysis and high-dimensional data mining
Stars: ✭ 41 (-8.89%)
student-grade-analyticsAnalyse academic and non-academic information of students and predict grades
Stars: ✭ 17 (-62.22%)
SparkoraPowerful rapid automatic EDA and feature engineering library with a very easy to use API 🌟
Stars: ✭ 51 (+13.33%)
bbbikeBBBike
Stars: ✭ 56 (+24.44%)
cpptclC++ library for interoperability between C++ and TCL
Stars: ✭ 33 (-26.67%)
scCODAA Bayesian model for compositional single-cell data analysis
Stars: ✭ 109 (+142.22%)
Hn so analysisIs there a relationship between popularity of a given technology on Stack Overflow (SO) and Hacker News (HN)? And a few words about causality
Stars: ✭ 94 (+108.89%)
Data-Science-SeriesFor all those who're struggling to find a good hands-on resource (with case studies) to master their Data Science skills, Here's all what you need!
Stars: ✭ 48 (+6.67%)