DeveeldbDeveelDB is a complete SQL database system, primarly developed for .NET/Mono frameworks
Stars: ✭ 80 (-98.37%)
ApogeeTools for dealing with APOGEE data
Stars: ✭ 34 (-99.31%)
Mlcourse.aiOpen Machine Learning Course
Stars: ✭ 7,963 (+61.88%)
DataflowjavasdkGoogle Cloud Dataflow provides a simple, powerful model for building both batch and streaming parallel data processing pipelines.
Stars: ✭ 854 (-82.64%)
Hyperlearn50% faster, 50% less RAM Machine Learning. Numba rewritten Sklearn. SVD, NNMF, PCA, LinearReg, RidgeReg, Randomized, Truncated SVD/PCA, CSR Matrices all 50+% faster
Stars: ✭ 1,204 (-75.52%)
DexDex : The Data Explorer -- A data visualization tool written in Java/Groovy/JavaFX capable of powerful ETL and publishing web visualizations.
Stars: ✭ 1,238 (-74.83%)
QuiltQuilt is a self-organizing data hub for S3
Stars: ✭ 1,007 (-79.53%)
Ether sqlA python library to push ethereum blockchain data into an sql database.
Stars: ✭ 41 (-99.17%)
Digeds catThis research seeks to examine best practice in the field of digital editions by collating relevant evidence in a detailed catalogue of extant digital projects.
Stars: ✭ 40 (-99.19%)
ElectrophysiologysoftwareA list of openly available software tools for (mostly human) electrophysiology.
Stars: ✭ 54 (-98.9%)
PresentationsTalks & Workshops by the CODAIT team
Stars: ✭ 50 (-98.98%)
Applied Ml📚 Papers & tech blogs by companies sharing their work on data science & machine learning in production.
Stars: ✭ 17,824 (+262.35%)
Etl with pythonETL with Python - Taught at DWH course 2017 (TAU)
Stars: ✭ 68 (-98.62%)
MlboxMLBox is a powerful Automated Machine Learning python library.
Stars: ✭ 1,199 (-75.63%)
Ml PyxisTool for reading and writing datasets of tensors in a Lightning Memory-Mapped Database (LMDB). Designed to manage machine learning datasets with fast reading speeds.
Stars: ✭ 93 (-98.11%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (-72.8%)
TdpThe Darkest Pipeline - Multithreaded pipelines for modern C++
Stars: ✭ 67 (-98.64%)
Etl unicorn数据可视化, 数据挖掘, 数据处理 ETL
Stars: ✭ 156 (-96.83%)
Locopylocopy: Loading/Unloading to Redshift and Snowflake using Python.
Stars: ✭ 73 (-98.52%)
Tsrepr TSrepr: R package for time series representations
Stars: ✭ 75 (-98.48%)
StetlStetl, Streaming ETL, is a lightweight geospatial processing and ETL framework written in Python.
Stars: ✭ 64 (-98.7%)
Ai Expert RoadmapRoadmap to becoming an Artificial Intelligence Expert in 2021
Stars: ✭ 15,441 (+213.91%)
Fklearnfklearn: Functional Machine Learning
Stars: ✭ 1,305 (-73.47%)
DrakeAn R-focused pipeline toolkit for reproducibility and high-performance computing
Stars: ✭ 1,301 (-73.55%)
PretzelJavascript full-stack framework for Big Data visualisation and analysis
Stars: ✭ 26 (-99.47%)
Awesome BigdataA curated list of awesome big data frameworks, ressources and other awesomeness.
Stars: ✭ 10,478 (+113.01%)
Scikit Learnscikit-learn: machine learning in Python
Stars: ✭ 48,322 (+882.35%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-97.82%)
CubesLight-weight Python OLAP framework for multi-dimensional data analysis
Stars: ✭ 1,393 (-71.68%)
Metlmito ETL tool
Stars: ✭ 153 (-96.89%)
XdaR package for exploratory data analysis
Stars: ✭ 112 (-97.72%)
CodesearchnetDatasets, tools, and benchmarks for representation learning of code.
Stars: ✭ 1,378 (-71.99%)
Pyspark Cheatsheet🐍 Quick reference guide to common patterns & functions in PySpark.
Stars: ✭ 108 (-97.8%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-97.78%)
AlgocodeWelcome everyone!🌟 Here you can solve problems, build scrappers and much more💻
Stars: ✭ 113 (-97.7%)
DatasciencevmTools and Docs on the Azure Data Science Virtual Machine (http://aka.ms/dsvm)
Stars: ✭ 153 (-96.89%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-97.7%)
Seaborn TutorialThis repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-97.68%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (-69.18%)
Pandas VideosJupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (-65.11%)
Spark AlchemyCollection of open-source Spark tools & frameworks that have made the data engineering and data science teams at Swoop highly productive
Stars: ✭ 122 (-97.52%)
Datasist A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-97.5%)
Covid19 scenariosModels of COVID-19 outbreak trajectories and hospital demand
Stars: ✭ 1,355 (-72.45%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (-62.37%)
D6t PythonAccelerate data science
Stars: ✭ 118 (-97.6%)
KibaData processing & ETL framework for Ruby
Stars: ✭ 1,618 (-67.11%)
RikoA Python stream processing engine modeled after Yahoo! Pipes
Stars: ✭ 1,571 (-68.06%)
Deeplearning NotesNotes for Deep Learning Specialization Courses led by Andrew Ng.
Stars: ✭ 126 (-97.44%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-97.46%)
Reddit DetectivePlay detective on Reddit: Discover political disinformation campaigns, secret influencers and more
Stars: ✭ 129 (-97.38%)
Blockchain2graphBlockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (-97.28%)