Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-60.14%)
Zarr.jsJavascript implementation of Zarr
Stars: ✭ 54 (-60.87%)
LifelinesSurvival analysis in Python
Stars: ✭ 1,766 (+1179.71%)
OpenDiffusionKinetics open-source monorepo
Stars: ✭ 116 (-15.94%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+869.57%)
TiledbThe Universal Storage Engine
Stars: ✭ 1,072 (+676.81%)
TpotA Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
Stars: ✭ 8,378 (+5971.01%)
Node Druid QuerySimple querying library for Druid (http://druid.io)
Stars: ✭ 93 (-32.61%)
Numerical Linear AlgebraFree online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Stars: ✭ 8,263 (+5887.68%)
Keras ContribKeras community contributions
Stars: ✭ 1,532 (+1010.14%)
PresentationsTalks & Workshops by the CODAIT team
Stars: ✭ 50 (-63.77%)
Ml PyxisTool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Stars: ✭ 93 (-32.61%)
Mckinsey Smartcities Traffic PredictionAdventure into using multi attention recurrent neural networks for time-series (city traffic) for the 2017-11-18 McKinsey IronMan (24h non-stop) prediction challenge
Stars: ✭ 49 (-64.49%)
Ds With PysimpleguiData science and Machine Learning GUI programs/ desktop apps with PySimpleGUI package
Stars: ✭ 93 (-32.61%)
CausalnexA Python library that helps data scientists to infer causation rather than observing correlation.
Stars: ✭ 1,036 (+650.72%)
TruvisoryThis project is meant to provide resources to users who want to access good LinkedIn posts which contains resources to learn any Technology, Design, Self-Branding, Motivation etc. You can visit project by:
Stars: ✭ 116 (-15.94%)
Tageditor🏖TagEditor - Annotation tool for spaCy
Stars: ✭ 92 (-33.33%)
ZenmlZenML 🙏: MLOps framework to create reproducible ML pipelines for production machine learning.
Stars: ✭ 1,019 (+638.41%)
Automl alexState-of-the art Automated Machine Learning python library for Tabular Data
Stars: ✭ 132 (-4.35%)
DiffgramData Annotation, Data Labeling, Annotation Tooling, Training Data for Machine Learning
Stars: ✭ 43 (-68.84%)
Pytorch WrapperProvides a systematic and extensible way to build, train, evaluate, and tune deep learning models using PyTorch.
Stars: ✭ 92 (-33.33%)
MachinelearningA repo with tutorials for algorithms from scratch
Stars: ✭ 96 (-30.43%)
FasttextUnofficial implementation of the paper "Bag of Tricks for Efficient Text Classification" by Joulin et al.
Stars: ✭ 53 (-61.59%)
RcongressoPacote R para acessar dados do congresso nacional.
Stars: ✭ 42 (-69.57%)
Lda Topic ModelingA PureScript, browser-based implementation of LDA topic modeling.
Stars: ✭ 91 (-34.06%)
SusiSuSi: Python package for unsupervised, supervised and semi-supervised self-organizing maps (SOM)
Stars: ✭ 42 (-69.57%)
PipelinexPipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
Stars: ✭ 127 (-7.97%)
SoccergraphrSoccer Analytics in R using OPTA data
Stars: ✭ 42 (-69.57%)
H2o TutorialsTutorials and training material for the H2O Machine Learning Platform
Stars: ✭ 1,305 (+845.65%)
Ds Take HomeMy solution to the book A Collection of Data Science Take-Home Challenges
Stars: ✭ 1,004 (+627.54%)
MlrMachine Learning in R
Stars: ✭ 1,542 (+1017.39%)
Qriyou're invited to a data party!
Stars: ✭ 1,003 (+626.81%)
TexeraBig Data Analytics Using Interactive Workflows
Stars: ✭ 90 (-34.78%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (+623.19%)
PandasschemaA validation library for Pandas data frames using user-friendly schemas
Stars: ✭ 135 (-2.17%)
DrakeAn R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (+842.75%)
UgfraudAn Unsupervised Graph-based Toolbox for Fraud Detection
Stars: ✭ 38 (-72.46%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-18.12%)
Optimus🚚 Agile Data Preparation Workflows made easy with dask, cudf, dask_cudf and pyspark
Stars: ✭ 986 (+614.49%)
Pymc Example ProjectExample PyMC3 project for performing Bayesian data analysis using a probabilistic programming approach to machine learning.
Stars: ✭ 90 (-34.78%)
ParalleldistR Package: Parallel Distance Matrix Computation using Multiple Threads
Stars: ✭ 37 (-73.19%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-9.42%)
BubblyA python package for plotting animated and interactive bubble charts using Plotly
Stars: ✭ 37 (-73.19%)
Daily Coding ProblemSeries of the problem 💯 and solution ✅ asked by Daily Coding problem👨🎓 website.
Stars: ✭ 90 (-34.78%)
Janitorsimple tools for data cleaning in R
Stars: ✭ 981 (+610.87%)
Kaggle HousepricesKaggle Kernel for House Prices competition https://www.kaggle.com/massquantity/all-you-need-is-pca-lb-0-11421-top-4
Stars: ✭ 113 (-18.12%)
ProbflowA Python package for building Bayesian models with TensorFlow or PyTorch
Stars: ✭ 95 (-31.16%)
25daysinmachinelearningI will update this repository to learn Machine learning with python with statistics content and materials
Stars: ✭ 53 (-61.59%)
Ml Template AzureTemplate for getting started with automated ML Ops on Azure Machine Learning
Stars: ✭ 52 (-62.32%)
AethosAutomated Data Science and Machine Learning library to optimize workflow.
Stars: ✭ 94 (-31.88%)
Ppd599USC urban data science course series with Python and Jupyter
Stars: ✭ 1,062 (+669.57%)