All Projects → jessecambon → Data-Science-Sandbox

jessecambon / Data-Science-Sandbox

Licence: MIT License
Code and resources to serve as a starting point for data science projects.

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to Data-Science-Sandbox

confstat-bot
Statistics for telegram groups
Stars: ✭ 26 (+85.71%)
Mutual labels:  statistics
ThunderStats
This addon adds awesome statistics to your beloved Thunderbird!
Stars: ✭ 34 (+142.86%)
Mutual labels:  statistics
mortAAR
R Package - Analysis of Archaeological Mortality Data
Stars: ✭ 13 (-7.14%)
Mutual labels:  statistics
mongodb-info
MongoDB Info - command line tool to get stats.
Stars: ✭ 13 (-7.14%)
Mutual labels:  statistics
AMIDD
Introduction to Applied Mathematics and Informatics in Drug Discovery (AMIDD)
Stars: ✭ 13 (-7.14%)
Mutual labels:  statistics
d3-boxplot
d3js box plot plugin
Stars: ✭ 21 (+50%)
Mutual labels:  statistics
parcours-r
Valise pédagogique pour la formation à R
Stars: ✭ 25 (+78.57%)
Mutual labels:  statistics
Probability Theory
A quick introduction to all most important concepts of Probability Theory, only freshman level of mathematics needed as prerequisite.
Stars: ✭ 25 (+78.57%)
Mutual labels:  statistics
Euler
The open-source computational framework for the Swift language
Stars: ✭ 37 (+164.29%)
Mutual labels:  statistics
growthbook
Open Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+16628.57%)
Mutual labels:  statistics
karan36k.github.io
These are all the articles and pages I have in my data science website. I try to transcribe all I learn and post regularly. Please visit and feel free to email me for suggestions.
Stars: ✭ 22 (+57.14%)
Mutual labels:  statistics
quarantine-bot
WhatsApp bot powered by Twilio API to get through the quarantine. Latest COVID19 statistics, world news, inspirational quotes and cat photos.
Stars: ✭ 24 (+71.43%)
Mutual labels:  statistics
laravel-model-stats
Model statistics dashboard for your Laravel Application
Stars: ✭ 186 (+1228.57%)
Mutual labels:  statistics
actinia core
Actinia Core is an open source REST API for scalable, distributed, high performance processing of geographical data that uses mainly GRASS GIS for computational tasks (DOI: https://doi.org/10.5281/zenodo.5879231)
Stars: ✭ 41 (+192.86%)
Mutual labels:  geospatial-analysis
php-statistics
Statistics library for PHP
Stars: ✭ 32 (+128.57%)
Mutual labels:  statistics
interactive-simple-linear-regression
A PureScript, browser-based implementation of simple linear regression.
Stars: ✭ 15 (+7.14%)
Mutual labels:  statistics
BayesHMM
Full Bayesian Inference for Hidden Markov Models
Stars: ✭ 35 (+150%)
Mutual labels:  statistics
wp-analytify
Google Analytics Dashboard Plugin For WordPress By Analytify
Stars: ✭ 20 (+42.86%)
Mutual labels:  statistics
teach-r-online
Materials for the Teaching statistics and data science online workshops in July 2020
Stars: ✭ 52 (+271.43%)
Mutual labels:  statistics
goodreads-toolbox
9 tools for Goodreads.com, for finding people based on the books they’ve read, finding books popular among the people you follow, following new book reviews, etc
Stars: ✭ 56 (+300%)
Mutual labels:  statistics

The Data Science Codex

A collection of code and resources to serve as a starting point for data science projects. For more explanation and material on R visit my blog.

Notes

Data Visualization

Statistical Modeling and Machine Learning

  • Modeling Fundamentals (R) - A primer on logistic and linear regression modeling with the classic Titanic dataset.
  • Survival Analysis (R) - Survival analysis methods such as cox proportion hazard models and Kaplan-Meier curves.
  • Modeling Workflows (R) - Streamlined Tidyverse modeling workflows with the gapminder dataset.
  • Multilevel Models (R) - Multi-level aka. mixed effects models
  • Time Series Modeling (R) - Experimenting with time series modeling (tsibble, forecast libraries, prophet, etc.)
  • Ordinal Regression (R) - Experimenting with ordinal (ranked categorical outcome) regression
  • Presenting Regression Models (R) - Code for cleaning the outputs of regression models for presentations.
  • Sklearn Modeling Workflows (Python) - Modeling workflows with sklearn (cross-validation, randomized search for optimizing hyperparameters, lift curves).
  • Sklearn - Skopt Workflow (Python) - Modeling workflow with sklearn and scikit-optimize (bayesian hyperparameter optimization.
  • Machine Learning with Caret (R) - Using the Caret library for machine learning.
  • Parsnip (R) - fitting models with the parsnip package (from tidymodels)

Bayesian Models

Clustering

  • k-means clustering (R) - Using the k-means algorithm to cluster data.
  • Clustering (Python) - Agglomerative (Hierarchical) clustering, k-means clustering, and Gaussian mixture models

Stats Analysis

NLP

  • Document Embeddings (Python) - Using word embeddings to compare the similarity of State of the Union addresses.
  • State of the Union Analysis (Python) - An exploration of state of the union addresses with topic modeling and sentiment analysis.
  • Sentiment Analysis (R) - Exploring sentiment analysis in R.
  • LSTM Demo (Python) - An LSTM network for predicting if a company review from glassdoor is positive

Miscellaneous

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].