Spark NotebookInteractive and Reactive Data Science using Scala and Spark.
Stars: ✭ 3,081 (+2069.72%)
Dist KerasDistributed Deep Learning, with a focus on distributed training, using Keras and Apache Spark.
Stars: ✭ 613 (+331.69%)
Griffon VmGriffon Data Science Virtual Machine
Stars: ✭ 128 (-9.86%)
PysparklingA pure Python implementation of Apache Spark's RDD and DStream interfaces.
Stars: ✭ 231 (+62.68%)
Agile data code 2Code for Agile Data Science 2.0, O'Reilly 2017, Second Edition
Stars: ✭ 413 (+190.85%)
Pulsar SparkWhen Apache Pulsar meets Apache Spark
Stars: ✭ 55 (-61.27%)
Seq2seq tutorialCode For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"
Stars: ✭ 132 (-7.04%)
Spark.NET for Apache® Spark™ makes Apache Spark™ easily accessible to .NET developers.
Stars: ✭ 1,721 (+1111.97%)
2016 Ml ContestMachine learning contest - October 2016 TLE
Stars: ✭ 135 (-4.93%)
Automl alexState-of-the art Automated Machine Learning python library for Tabular Data
Stars: ✭ 132 (-7.04%)
ScilabFree and Open Source software for numerical computation providing a powerful computing environment for engineering and scientific applications.
Stars: ✭ 138 (-2.82%)
AcceleratorThe Accelerator is a tool for fast and reproducible processing of large amounts of data.
Stars: ✭ 137 (-3.52%)
Dtale DesktopBuild a data visualization dashboard with simple snippets of python code
Stars: ✭ 128 (-9.86%)
Book This book serves as an introduction to a whole new way of thinking systematically about geographic data, using geographical analysis and computation to unlock new insights hidden within data.
Stars: ✭ 141 (-0.7%)
Qlik Py ToolsData Science algorithms for Qlik implemented as a Python Server Side Extension (SSE).
Stars: ✭ 135 (-4.93%)
ButterfreeA tool for building feature stores.
Stars: ✭ 126 (-11.27%)
Stock PredictionSmart Algorithms to predict buying and selling of stocks on the basis of Mutual Funds Analysis, Stock Trends Analysis and Prediction, Portfolio Risk Factor, Stock and Finance Market News Sentiment Analysis and Selling profit ratio. Project developed as a part of NSE-FutureTech-Hackathon 2018, Mumbai. Team : Semicolon
Stars: ✭ 125 (-11.97%)
HermioneML made simple
Stars: ✭ 135 (-4.93%)
Dive Into Machine LearningDive into Machine Learning with Python Jupyter notebook and scikit-learn! First posted in 2016, maintained as of 2021. Pull requests welcome.
Stars: ✭ 10,810 (+7512.68%)
TntorchTensor Network Learning with PyTorch
Stars: ✭ 133 (-6.34%)
TrafficA toolbox for processing and analysing air traffic data
Stars: ✭ 138 (-2.82%)
PecanThe Predictive Ecosystem Analyzer (PEcAn) is an integrated ecological bioinformatics toolbox.
Stars: ✭ 132 (-7.04%)
NlpaugData augmentation for NLP
Stars: ✭ 2,761 (+1844.37%)
Rpy2Interface to use R from Python
Stars: ✭ 132 (-7.04%)
Machine Learning And Data ScienceThis is a repository which contains all my work related Machine Learning, AI and Data Science. This includes my graduate projects, machine learning competition codes, algorithm implementations and reading material.
Stars: ✭ 137 (-3.52%)
DatascicompA collection of popular Data Science Challenges/Competitions || Countdown timers to keep track of the entry deadlines.
Stars: ✭ 1,636 (+1052.11%)
Stats337Readings in applied data science
Stars: ✭ 1,625 (+1044.37%)
Ripser.pyA Lean Persistent Homology Library for Python
Stars: ✭ 139 (-2.11%)
Torchbear🔥🐻 The Speakeasy Scripting Engine Which Combines Speed, Safety, and Simplicity
Stars: ✭ 128 (-9.86%)
Data Science WgSF Brigade's Data Science Working Group.
Stars: ✭ 135 (-4.93%)
LifelinesSurvival analysis in Python
Stars: ✭ 1,766 (+1143.66%)
Doddle Model🍰 doddle-model: machine learning in Scala.
Stars: ✭ 142 (+0%)
PandasschemaA validation library for Pandas data frames using user-friendly schemas
Stars: ✭ 135 (-4.93%)
PipelinexPipelineX: Python package to build ML pipelines for experimentation with Kedro, MLflow, and more
Stars: ✭ 127 (-10.56%)
TomaHelps you write algorithms in PyTorch that adapt to the available (CUDA) memory
Stars: ✭ 139 (-2.11%)
Cape PythonCollaborate on privacy-preserving policy for data science projects in Pandas and Apache Spark
Stars: ✭ 125 (-11.97%)
Beyond Jupyter🐍💻📊 All material from the PyCon.DE 2018 Talk "Beyond Jupyter Notebooks - Building your own data science platform with Python & Docker" (incl. Slides, Video, Udemy MOOC & other References)
Stars: ✭ 135 (-4.93%)
Deeplearning NotesNotes for Deep Learning Specialization Courses led by Andrew Ng.
Stars: ✭ 126 (-11.27%)
MatrixprofileA Python 3 library making time series data mining tasks, utilizing matrix profile algorithms, accessible to everyone.
Stars: ✭ 141 (-0.7%)
Aws Data WranglerPandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
Stars: ✭ 2,385 (+1579.58%)
Blockchain2graphBlockchain2graph extracts blockchain data (bitcoin) and insert them into a graph database (neo4j).
Stars: ✭ 134 (-5.63%)
Rightmove webscraper.pyPython class to scrape data from rightmove.co.uk and return listings in a pandas DataFrame object
Stars: ✭ 125 (-11.97%)
Dbg PdsDeutsche Boerse's Financial Trading Public Data Set
Stars: ✭ 124 (-12.68%)
DatasciencecourseraData Science Repo and blog for John Hopkins Coursera Courses. Please let me know if you have any questions.
Stars: ✭ 1,928 (+1257.75%)
AcceleratorsData science and AI solution accelerator suite that provides templates for prototyping, reporting, and presenting data science analytics of specific domains
Stars: ✭ 134 (-5.63%)
Datasciencera curated list of R tutorials for Data Science, NLP and Machine Learning
Stars: ✭ 1,727 (+1116.2%)