Amazing Feature EngineeringFeature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-9.17%)
Datasist A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-48.75%)
Pythondatarepo for code published on pythondata.com
Stars: ✭ 113 (-52.92%)
Pydataroadopen source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (+25.83%)
Data Science HacksData Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (+13.75%)
Data Science On GcpSource code accompanying book: Data Science on the Google Cloud Platform, Valliappa Lakshmanan, O'Reilly 2017
Stars: ✭ 864 (+260%)
Dat8General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+531.67%)
Ml Workspace🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+873.75%)
Seaborn TutorialThis repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-52.5%)
Drugs Recommendation Using ReviewsAnalyzing the Drugs Descriptions, conditions, reviews and then recommending it using Deep Learning Models, for each Health Condition of a Patient.
Stars: ✭ 35 (-85.42%)
ArticlesA repository for the source code, notebooks, data, files, and other assets used in the data science and machine learning articles on LearnDataSci
Stars: ✭ 350 (+45.83%)
CoursesQuiz & Assignment of Coursera
Stars: ✭ 454 (+89.17%)
Machine Learning Workflow With PythonThis is a comprehensive ML techniques with python: Define the Problem- Specify Inputs & Outputs- Data Collection- Exploratory data analysis -Data Preprocessing- Model Design- Training- Evaluation
Stars: ✭ 157 (-34.58%)
DeltapyDeltaPy - Tabular Data Augmentation (by @firmai)
Stars: ✭ 344 (+43.33%)
Cookbook 2ndIPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018
Stars: ✭ 704 (+193.33%)
Data ScienceCollection of useful data science topics along with code and articles
Stars: ✭ 315 (+31.25%)
DtaleVisualizer for pandas data structures
Stars: ✭ 2,864 (+1093.33%)
Cookbook 2nd CodeCode of the IPython Cookbook, Second Edition, by Cyrille Rossant, Packt Publishing 2018 [read-only repository]
Stars: ✭ 541 (+125.42%)
Kaggle CompetitionsThere are plenty of courses and tutorials that can help you learn machine learning from scratch but here in GitHub, I want to solve some Kaggle competitions as a comprehensive workflow with python packages. After reading, you can use this workflow to solve other real problems and use it as a template.
Stars: ✭ 86 (-64.17%)
Data Science Resources👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-28.75%)
YaboxYet another black-box optimization library for Python
Stars: ✭ 103 (-57.08%)
NniAn open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
Stars: ✭ 10,698 (+4357.5%)
BlurrData transformations for the ML era
Stars: ✭ 96 (-60%)
Spark Py NotebooksApache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 1,338 (+457.5%)
SupersetApache Superset is a Data Visualization and Data Exploration Platform
Stars: ✭ 42,634 (+17664.17%)
Tennis Crystal BallUltimate Tennis Statistics and Tennis Crystal Ball - Tennis Big Data Analysis and Prediction
Stars: ✭ 107 (-55.42%)
KlibEasy to use Python library of customized functions for cleaning and analyzing data.
Stars: ✭ 192 (-20%)
Stock Market Analysis And PredictionStock Market Analysis and Prediction is the project on technical analysis, visualization and prediction using data provided by Google Finance.
Stars: ✭ 112 (-53.33%)
Spark R Notebooks R on Apache Spark (SparkR) tutorials for Big Data analysis and Machine Learning as IPython / Jupyter notebooks
Stars: ✭ 109 (-54.58%)
KriskStatistical Interactive Visualization with pandas+Jupyter integration on top of Echarts.
Stars: ✭ 111 (-53.75%)
SweetvizVisualize and compare datasets, target values and associations, with one line of code.
Stars: ✭ 1,851 (+671.25%)
Pandas VideosJupyter notebook and datasets from the pandas Q&A video series
Stars: ✭ 1,716 (+615%)
TsfelAn intuitive library to extract features from time series
Stars: ✭ 202 (-15.83%)
PbpythonCode, Notebooks and Examples from Practical Business Python
Stars: ✭ 1,724 (+618.33%)
Deeplearning NotesNotes for Deep Learning Specialization Courses led by Andrew Ng.
Stars: ✭ 126 (-47.5%)
Stock PredictionSmart Algorithms to predict buying and selling of stocks on the basis of Mutual Funds Analysis, Stock Trends Analysis and Prediction, Portfolio Risk Factor, Stock and Finance Market News Sentiment Analysis and Selling profit ratio. Project developed as a part of NSE-FutureTech-Hackathon 2018, Mumbai. Team : Semicolon
Stars: ✭ 125 (-47.92%)
Bitcoin Value Predictor[NOT MAINTAINED] Predicting Bit coin price using Time series analysis and sentiment analysis of tweets on bitcoin
Stars: ✭ 91 (-62.08%)
SumzerotradingA Java API for Developing Automated Trading Applications for the Equity, Futures, and Currency Markets
Stars: ✭ 128 (-46.67%)
Dtale DesktopBuild a data visualization dashboard with simple snippets of python code
Stars: ✭ 128 (-46.67%)
GradioCreate UIs for your machine learning model in Python in 3 minutes
Stars: ✭ 4,358 (+1715.83%)
Gwu data miningMaterials for GWU DNSC 6279 and DNSC 6290.
Stars: ✭ 217 (-9.58%)
Edavizedaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab
Stars: ✭ 220 (-8.33%)
DatascienceprojectsThe code repository for projects and tutorials in R and Python that covers a variety of topics in data visualization, statistics sports analytics and general application of probability theory.
Stars: ✭ 223 (-7.08%)
StreamlitStreamlit — The fastest way to build data apps in Python
Stars: ✭ 16,906 (+6944.17%)
MydatascienceportfolioApplying Data Science and Machine Learning to Solve Real World Business Problems
Stars: ✭ 227 (-5.42%)