ReflowA language and runtime for distributed, incremental data processing in the cloud
Stars: ✭ 706 (+1207.41%)
HydrogenRun code interactively, inspect data, and plot. All the power of Jupyter kernels, inside your favorite text editor.
Stars: ✭ 3,763 (+6868.52%)
Qriyou're invited to a data party!
Stars: ✭ 1,003 (+1757.41%)
OrchestA new kind of IDE for Data Science.
Stars: ✭ 694 (+1185.19%)
WptoolsWikipedia tools (for Humans): easily extract data from Wikipedia, Wikidata, and other MediaWikis
Stars: ✭ 371 (+587.04%)
Intro PythonPython pour Statistique et Science des Données -- Syntaxe, Trafic de Données, Graphes, Programmation, Apprentissage
Stars: ✭ 21 (-61.11%)
LoopyA code generator for array-based code on CPUs and GPUs
Stars: ✭ 367 (+579.63%)
DecimalA high-performance, arbitrary-precision, floating-point decimal library.
Stars: ✭ 363 (+572.22%)
HashmapHashMap JavaScript class for Node.js and the browser. The keys can be anything and won't be stringified
Stars: ✭ 363 (+572.22%)
FeatexpFeature exploration for supervised learning
Stars: ✭ 688 (+1174.07%)
Learn Something Every Day📝 A compilation of everything that I learn; Computer Science, Software Development, Engineering, Math, and Coding in General. Read the rendered results here ->
Stars: ✭ 362 (+570.37%)
Quantitative NotebooksEducational notebooks on quantitative finance, algorithmic trading, financial modelling and investment strategy
Stars: ✭ 356 (+559.26%)
PixiedustPython Helper library for Jupyter Notebooks
Stars: ✭ 998 (+1748.15%)
GirderA data management platform for the web, developed by Kitware
Stars: ✭ 350 (+548.15%)
Artificial Adversary🗣️ Tool to generate adversarial text examples and test machine learning models against them
Stars: ✭ 348 (+544.44%)
ClevercsvCleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved dialect detection, and comes with a handy command line application for working with CSV files.
Stars: ✭ 887 (+1542.59%)
ThesemicolonThis repository contains Ipython notebooks and datasets for the data analytics youtube tutorials on The Semicolon.
Stars: ✭ 345 (+538.89%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+537.04%)
Scikit Mobilityscikit-mobility: mobility analysis in Python
Stars: ✭ 339 (+527.78%)
Ipython DashboardA stand alone, light-weight web server for building, sharing graphs created in ipython. Build for data science, data analysis guys. Aiming at building an interactive visualization, collaborated dashboard, and real-time streaming graph.
Stars: ✭ 664 (+1129.63%)
Eseur Code DataCode and data used to create the examples in "Evidence-based Software Engineering based on the publicly available data"
Stars: ✭ 340 (+529.63%)
Pandas ProfilingCreate HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+15324.07%)
P IterationUtilities that make array iteration easy when using async/await or Promises
Stars: ✭ 337 (+524.07%)
MlxtendA library of extension and helper modules for Python's data analysis and machine learning libraries.
Stars: ✭ 3,729 (+6805.56%)
Morphism⚡ Type-safe data transformer for JavaScript, TypeScript & Node.js.
Stars: ✭ 336 (+522.22%)
Hyperparameter hunterEasy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries
Stars: ✭ 648 (+1100%)
DatmoOpen source production model management tool for data scientists
Stars: ✭ 334 (+518.52%)
Pydata.krPyData Korea 공식 홈페이지입니다. (준비중)
Stars: ✭ 13 (-75.93%)
ArtificioDeep Learning Computer Vision Algorithms for Real-World Use
Stars: ✭ 326 (+503.7%)
FeaturetoolsAn open source python library for automated feature engineering
Stars: ✭ 5,891 (+10809.26%)
MlibLibrary of generic and type safe containers in pure C language (C99 or C11) for a wide collection of container (comparable to the C++ STL).
Stars: ✭ 321 (+494.44%)
Datumbox FrameworkDatumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
Stars: ✭ 1,063 (+1868.52%)
ProbabilityProbabilistic reasoning and statistical analysis in TensorFlow
Stars: ✭ 3,550 (+6474.07%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+1500%)
PandasvaultAdvanced Pandas Vault — Utilities, Functions and Snippets (by @firmai).
Stars: ✭ 316 (+485.19%)
Zero To Mastery MlAll course materials for the Zero to Mastery Machine Learning and Data Science course.
Stars: ✭ 631 (+1068.52%)
Scikit RebateA scikit-learn-compatible Python implementation of ReBATE, a suite of Relief-based feature selection algorithms for Machine Learning.
Stars: ✭ 314 (+481.48%)
UgfraudAn Unsupervised Graph-based Toolbox for Fraud Detection
Stars: ✭ 38 (-29.63%)
Pyspark Example ProjectExample project implementing best practices for PySpark ETL jobs and applications.
Stars: ✭ 633 (+1072.22%)
FasttextUnofficial implementation of the paper "Bag of Tricks for Efficient Text Classification" by Joulin et al.
Stars: ✭ 53 (-1.85%)
Ml Template AzureTemplate for getting started with automated ML Ops on Azure Machine Learning
Stars: ✭ 52 (-3.7%)
Php MlPHP-ML - Machine Learning library for PHP
Stars: ✭ 7,900 (+14529.63%)
Computervision RecipesBest Practices, code samples, and documentation for Computer Vision.
Stars: ✭ 8,214 (+15111.11%)
Python TrainingPython training for business analysts and traders
Stars: ✭ 972 (+1700%)
Array FirstGet the first element or first n elements of an array.
Stars: ✭ 6 (-88.89%)
PbaEfficient Learning of Augmentation Policy Schedules
Stars: ✭ 461 (+753.7%)