NannyA tidyverse suite for (pre-) machine-learning: cluster, PCA, permute, impute, rotate, redundancy, triangular, smart-subset, abundant and variable features.
Stars: ✭ 17 (-10.53%)
GreyNSightsPrivacy-Preserving Data Analysis using Pandas
Stars: ✭ 18 (-5.26%)
Morpheus CoreThe foundational library of the Morpheus data science framework
Stars: ✭ 203 (+968.42%)
SupersetApache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+224289.47%)
tidyweekRepo dedicated to #tidyweek & Mentorship pilot
Stars: ✭ 25 (+31.58%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (+800%)
Riceteacatpandarepo with challenge material for riceteacatpanda (2020)
Stars: ✭ 18 (-5.26%)
rfordatasciencewikiResources for the R4DS Online Learning Community, including answer keys to the text
Stars: ✭ 40 (+110.53%)
Fraud-Detection-in-Online-TransactionsDetecting Frauds in Online Transactions using Anamoly Detection Techniques Such as Over Sampling and Under-Sampling as the ratio of Frauds is less than 0.00005 thus, simply applying Classification Algorithm may result in Overfitting
Stars: ✭ 41 (+115.79%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (+5063.16%)
Dataanalysisinaction(已完结)《极客时间数据分析实战45讲-详细笔记》包含markdown、图片、思维导图、代码 、数据。 可直接阅读代码、测试!
Stars: ✭ 482 (+2436.84%)
Countly Sdk CordovaCountly Product Analytics SDK for Cordova, Icenium and Phonegap
Stars: ✭ 69 (+263.16%)
Countly Sdk WebCountly Product Analytics SDK for websites and web applications
Stars: ✭ 165 (+768.42%)
R4EconR Code Examples Multi-dimensional/Panel Data
Stars: ✭ 16 (-15.79%)
8-Week-SQL-ChallengeCase study solutions for #8WeekSQLChallenge at https://8weeksqlchallenge.com
Stars: ✭ 43 (+126.32%)
gocellsEvent Based Applications [DEPRECATED]
Stars: ✭ 69 (+263.16%)
hpdbscanHighly parallel DBSCAN (HPDBSCAN)
Stars: ✭ 19 (+0%)
PSelectPowerShell DSL for aggregating data
Stars: ✭ 27 (+42.11%)
casewhenCreate reusable dplyr::case_when() functions
Stars: ✭ 64 (+236.84%)
GuitarA Simple and Efficient Distributed Multidimensional BI Analysis Engine.
Stars: ✭ 86 (+352.63%)
tidytree🚿A Tidy Tool for Phylogenetic Tree Data Manipulation
Stars: ✭ 34 (+78.95%)
hotmapWebGL Heatmap Viewer for Big Data and Bioinformatics
Stars: ✭ 13 (-31.58%)
olliePyOlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
Stars: ✭ 46 (+142.11%)
visionsType System for Data Analysis in Python
Stars: ✭ 136 (+615.79%)
re-datare_data - fix data issues before your users & CEO would discover them 😊
Stars: ✭ 955 (+4926.32%)
demeterProcess and analyze X-ray Absorption Spectroscopy data using Feff and either Larch or Ifeffit.
Stars: ✭ 50 (+163.16%)
covid-19COVID-19 World is yet another Project to build a Dashboard like app to showcase the data related to the COVID-19(Corona Virus).
Stars: ✭ 28 (+47.37%)
social-dataCode and data for eviction and housing analysis in the US
Stars: ✭ 17 (-10.53%)
Dominando-PandasEste repositório está destinado ao processo de aprendizagem da biblioteca Pandas.
Stars: ✭ 22 (+15.79%)
genieGenie: A Fast and Robust Hierarchical Clustering Algorithm (this R package has now been superseded by genieclust)
Stars: ✭ 21 (+10.53%)
dataViz CADiMaterials for the "Data Visualization" CADi workshop @ "Tecnológico de Monterrey"
Stars: ✭ 14 (-26.32%)
parcours-rValise pédagogique pour la formation à R
Stars: ✭ 25 (+31.58%)
noamross.netRepository for my personal website
Stars: ✭ 17 (-10.53%)
resamplrR package cross-validation, bootstrap, permutation, and rolling window resampling techniques for the tidyverse.
Stars: ✭ 35 (+84.21%)
Naive-Resume-MatchingText Similarity Applied to resume, to compare Resumes with Job Descriptions and create a score to rank them. Similar to an ATS.
Stars: ✭ 27 (+42.11%)
lightdashAn open source alternative to Looker built using dbt. Made for analysts ❤️
Stars: ✭ 1,082 (+5594.74%)
growthbookOpen Source Feature Flagging and A/B Testing Platform
Stars: ✭ 2,342 (+12226.32%)
data vis statistics geosciencesThis repository contains the laboratory portion of an upper level undergraduate class in Python on data visualization and statistics for geo & space scientists. Labs are updated when the course is in session through the most recent branch. See master version for current class.
Stars: ✭ 32 (+68.42%)
CoreMSCoreMS is a comprehensive mass spectrometry software framework
Stars: ✭ 20 (+5.26%)
EEGEduInteractive Brain Playground - Browser based tutorials on EEG with webbluetooth and muse
Stars: ✭ 91 (+378.95%)
collateralMap, find and isolate captured side effects
Stars: ✭ 39 (+105.26%)
AlphaPlot📈 Application for statistical analysis and data visualization which can generate different types of publication quality 2D and 3D plots with extensive visual customization.
Stars: ✭ 140 (+636.84%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (+57.89%)
ipython-notebooksA collection of Jupyter notebooks exploring different datasets.
Stars: ✭ 43 (+126.32%)
sherlock🔎 Find usernames across social networks.
Stars: ✭ 47 (+147.37%)
genieclustGenie++ Fast and Robust Hierarchical Clustering with Noise Point Detection - for Python and R
Stars: ✭ 34 (+78.95%)
pyglotaranA Python library for Global and Target Analysis of time-resolved spectroscopy data
Stars: ✭ 33 (+73.68%)
PythonTipsDSPython Tips for Data Scientist
Stars: ✭ 23 (+21.05%)
vinumVinum is a SQL processor for Python, designed for data analysis workflows and in-memory analytics.
Stars: ✭ 57 (+200%)
desctableAn R package to produce descriptive and comparative tables
Stars: ✭ 49 (+157.89%)