Data Science Live BookAn open source book to learn data science, data analysis and machine learning, suitable for all ages!
Stars: ✭ 193 (-81.84%)
Datascience Ai Machinelearning ResourcesAlex Castrounis' curated set of resources for artificial intelligence (AI), machine learning, data science, internet of things (IoT), and more.
Stars: ✭ 414 (-61.05%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-89.93%)
Oie ResourcesA curated list of Open Information Extraction (OIE) resources: papers, code, data, etc.
Stars: ✭ 283 (-73.38%)
KoalasKoalas: pandas API on Apache Spark
Stars: ✭ 3,044 (+186.36%)
Data Science LearningRepository of code and resources related to different data science and machine learning topics. For learning, practice and teaching purposes.
Stars: ✭ 273 (-74.32%)
Stats Maths With PythonGeneral statistics, mathematical programming, and numerical/scientific computing scripts and notebooks in Python
Stars: ✭ 381 (-64.16%)
FivethirtyeightR package of data and code behind the stories and interactives at FiveThirtyEight
Stars: ✭ 422 (-60.3%)
Data Science Ipython NotebooksData science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
Stars: ✭ 22,048 (+1974.13%)
PachydermReproducible Data Science at Scale!
Stars: ✭ 5,305 (+399.06%)
TablesawJava dataframe and visualization library
Stars: ✭ 2,785 (+161.99%)
Data Science FreeFree Resources For Data Science created by Shubham Kumar
Stars: ✭ 232 (-78.17%)
XlearnHigh performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.
Stars: ✭ 2,968 (+179.21%)
bigstatsrR package for statistical tools with big matrices stored on disk.
Stars: ✭ 139 (-86.92%)
Scikit Mobilityscikit-mobility: mobility analysis in Python
Stars: ✭ 339 (-68.11%)
Csinva.github.ioSlides, paper notes, class notes, blog posts, and research on ML 📉, statistics 📊, and AI 🤖.
Stars: ✭ 342 (-67.83%)
Edward2A simple probabilistic programming language.
Stars: ✭ 419 (-60.58%)
Uncertainty BaselinesHigh-quality implementations of standard and SOTA methods on a variety of tasks.
Stars: ✭ 278 (-73.85%)
Mlj.jlA Julia machine learning framework
Stars: ✭ 982 (-7.62%)
Imbalanced LearnA Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Stars: ✭ 5,617 (+428.41%)
Boltons🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library. Nothing like Michael Bolton.
Stars: ✭ 5,671 (+433.49%)
Data Science CareerCareer Resources for Data Science, Machine Learning, Big Data and Business Analytics Career Repository
Stars: ✭ 630 (-40.73%)
BlogrScripts + data to recreate analyses published on http://benjaminlmoore.wordpress.com and http://blm.io
Stars: ✭ 23 (-97.84%)
DatascienceprojectsThe code repository for projects and tutorials in R and Python that covers a variety of topics in data visualization, statistics sports analytics and general application of probability theory.
Stars: ✭ 223 (-79.02%)
ImodelsInterpretable ML package 🔍 for concise, transparent, and accurate predictive modeling (sklearn-compatible).
Stars: ✭ 194 (-81.75%)
DatascienceCurated list of Python resources for data science.
Stars: ✭ 3,051 (+187.02%)
PretzelJavascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-97.55%)
AttacaRobust, distributed version control for large files.
Stars: ✭ 41 (-96.14%)
FacetHuman-explainable AI.
Stars: ✭ 269 (-74.69%)
TrinoOfficial repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Stars: ✭ 4,581 (+330.95%)
Uci Ml ApiSimple API for UCI Machine Learning Dataset Repository (search, download, analyze)
Stars: ✭ 190 (-82.13%)
ProbabilityProbabilistic reasoning and statistical analysis in TensorFlow
Stars: ✭ 3,550 (+233.96%)
Openintro Statistics📚 An open-source textbook written at the college level. OpenIntro also offers a second college-level intro stat textbook and also a high school variant.
Stars: ✭ 283 (-73.38%)
Mlinterview A curated awesome list of AI Startups in India & Machine Learning Interview Guide. Feel free to contribute!
Stars: ✭ 410 (-61.43%)
TeachingTeaching Materials for Dr. Waleed A. Yousef
Stars: ✭ 435 (-59.08%)
ObservationsTools for loading standard data sets in machine learning
Stars: ✭ 190 (-82.13%)
Facebook data analyzerAnalyze facebook copy of your data with ruby language. Download zip file from facebook and get info about friends ranking by message, vocabulary, contacts, friends added statistics and more
Stars: ✭ 515 (-51.55%)
Onlinestats.jlSingle-pass algorithms for statistics
Stars: ✭ 507 (-52.3%)
NipypeWorkflows and interfaces for neuroimaging packages
Stars: ✭ 557 (-47.6%)
EdwardA probabilistic programming language in TensorFlow. Deep generative models, variational inference.
Stars: ✭ 4,674 (+339.7%)
H2o 3H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
Stars: ✭ 5,656 (+432.08%)
SmileStatistical Machine Intelligence & Learning Engine
Stars: ✭ 5,412 (+409.13%)
Dataframe GoDataFrames for Go: For statistics, machine-learning, and data manipulation/exploration
Stars: ✭ 487 (-54.19%)
LooperA resource list for causality in statistics, data science and physics
Stars: ✭ 23 (-97.84%)
SocratA Dynamic Web Toolbox for Interactive Data Processing, Analysis, and Visualization
Stars: ✭ 26 (-97.55%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-19.66%)
CollapseAdvanced and Fast Data Transformation in R
Stars: ✭ 184 (-82.69%)
VirgilioVirgilio is developed and maintained by these awesome people.
You can email us virgilio.datascience (at) gmail.com or join the Discord chat.
Stars: ✭ 13,200 (+1141.77%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (-57.29%)
AutodlAutomated Deep Learning without ANY human intervention. 1'st Solution for AutoDL [email protected]
Stars: ✭ 854 (-19.66%)