khuyentran1401 / Data Science
Collection of useful data science topics along with code and articles
Stars: ✭ 315
Programming Languages
python
139335 projects - #7 most used programming language
Labels
Projects that are alternatives of or similar to Data Science
Dat8
General Assembly's 2015 Data Science course in Washington, DC
Stars: ✭ 1,516 (+381.27%)
Mutual labels: jupyter-notebook, data-science, data-analysis, natural-language-processing, data-visualization
Data Science Resources
👨🏽🏫You can learn about what data science is and why it's important in today's modern world. Are you interested in data science?🔋
Stars: ✭ 171 (-45.71%)
Mutual labels: artificial-intelligence, jupyter-notebook, data-science, data-analysis, data-visualization
Courses
Quiz & Assignment of Coursera
Stars: ✭ 454 (+44.13%)
Mutual labels: jupyter-notebook, data-science, data-analysis, natural-language-processing, data-visualization
Amazing Feature Engineering
Feature engineering is the process of using domain knowledge to extract features from raw data via data mining techniques. These features can be used to improve the performance of machine learning algorithms. Feature engineering can be considered as applied machine learning itself.
Stars: ✭ 218 (-30.79%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Datascience
Curated list of Python resources for data science.
Stars: ✭ 3,051 (+868.57%)
Mutual labels: artificial-intelligence, data-science, data-analysis, data-visualization
Ml Workspace
🛠 All-in-one web-based IDE specialized for machine learning and data science.
Stars: ✭ 2,337 (+641.9%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Data Science Hacks
Data Science Hacks consists of tips, tricks to help you become a better data scientist. Data science hacks are for all - beginner to advanced. Data science hacks consist of python, jupyter notebook, pandas hacks and so on.
Stars: ✭ 273 (-13.33%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Deep Learning Machine Learning Stock
Stock for Deep Learning and Machine Learning
Stars: ✭ 240 (-23.81%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Pandas Profiling
Create HTML profiling reports from pandas DataFrame objects
Stars: ✭ 8,329 (+2544.13%)
Mutual labels: artificial-intelligence, jupyter-notebook, data-science, data-analysis
Datascience course
Curso de Data Science em Português
Stars: ✭ 294 (-6.67%)
Mutual labels: artificial-intelligence, jupyter-notebook, data-science, data-analysis
Fixy
Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların eksiklerini gideren open source bir yazım destekleyicisi/denetleyicisi oluşturmak. Kullanıcıların yazdıkları metinlerdeki yazım yanlışlarını derin öğrenme yaklaşımıyla çözüp aynı zamanda metinlerde anlamsal analizi de gerçekleştirerek bu bağlamda ortaya çıkan yanlışları da fark edip düzeltebilmek.
Stars: ✭ 165 (-47.62%)
Mutual labels: artificial-intelligence, jupyter-notebook, data-science, natural-language-processing
Pydataroad
open source for wechat-official-account (ID: PyDataLab)
Stars: ✭ 302 (-4.13%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Data Science Portfolio
A Portfolio of my Data Science Projects
Stars: ✭ 149 (-52.7%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Fantasy Basketball
Scraping statistics, predicting NBA player performance with neural networks and boosting algorithms, and optimising lineups for Draft Kings with genetic algorithm. Capstone Project for Machine Learning Engineer Nanodegree by Udacity.
Stars: ✭ 146 (-53.65%)
Mutual labels: jupyter-notebook, data-science, scraping, data-visualization
Dtale
Visualizer for pandas data structures
Stars: ✭ 2,864 (+809.21%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Datasist
A Python library for easy data analysis, visualization, exploration and modeling
Stars: ✭ 123 (-60.95%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Awesome Ai Ml Dl
Awesome Artificial Intelligence, Machine Learning and Deep Learning as we learn it. Study notes and a curated list of awesome resources of such topics.
Stars: ✭ 831 (+163.81%)
Mutual labels: artificial-intelligence, jupyter-notebook, time-series, natural-language-processing
Pythondata
repo for code published on pythondata.com
Stars: ✭ 113 (-64.13%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Seaborn Tutorial
This repository is my attempt to help Data Science aspirants gain necessary Data Visualization skills required to progress in their career. It includes all the types of plot offered by Seaborn, applied on random datasets.
Stars: ✭ 114 (-63.81%)
Mutual labels: jupyter-notebook, data-science, data-analysis, data-visualization
Nlpaug
Data augmentation for NLP
Stars: ✭ 2,761 (+776.51%)
Mutual labels: artificial-intelligence, jupyter-notebook, data-science, natural-language-processing
Data Science Topics
Collection of useful data science topics along with code and articles in my data science blog. If you are using an RSS feeds reader, add this link to your reader to get the notifications on my latest articles.
If you prefer shorter versions of these blogs, subscribe to Data Science Simplified to receive bite-sized data science tips in your mailbox daily.
Contents
- Data Science Tools
- Machine Learning
- Natural Language Processing
- Computer Vision
- Time Series
- Visualization
- Mathematical Programming
- Scraping
- Python
- Terminal
- Linear Algebra
- Data Structure
- Statistics
- Applications
- Learning Tips
- Productive Tips
- Book Review
- Data Science Portfolio
Data Science Tools
Title | Medium article | Repository |
---|---|---|
How to Create Fake Data with Faker | link | link |
How to Embed Interactive Charts on your Medium Articles and Personal Website | link | link |
How to Monitor And Log your Machine Learning Experiment Remotely with HyperDash | link | link |
How to Leverage Visual Studio Code for your Data Science Projects | link | |
Introduction to Datapane: A Python Library to Build Interactive Reports | link | |
Datapane’s New Features: Create a Beautiful Dashboard in Python in a Few Lines of Code | link | link |
How to Share your Python Objects Across Different Environments in One Line of Code | link | link |
Top 4 Code Viewers for Data Scientist in VSCode | link | |
How to Create and View Interactive Cheatsheets on the Command-line | link | |
Introduction to Hydra.cc: A Powerful Framework to Configure your Data Science Projects | link | |
How to Share your Jupyter Notebook in 3 Lines of Code with Ngrok | link | |
Introduction to Schema: A Python Libary to Validate your Data | link | link |
Pytest for Data Scientists | link | link |
Introduction to DVC: Data Version Control Tool for Machine Learning Projects | link | link |
Incorporate the Best Practices for Python with These Top 4 VSCode Extensions | link | |
4 Lessor-Known Yet Awesome Tips for Pytest | link | link |
How to Effortlessly Publish your Python Package to PyPI Using Poetry | link | link |
Machine Learning
Title | Medium article | Repository |
---|---|---|
How to Efficiently Fine-Tune your Machine Learning Models | link | link |
How to Learn Non-linear Dataset with Support Vector Machines | link | link |
Introduction to IBM Federated Learning: A Collaborative Approach to Train ML Models on Private Data | link | link |
3 Steps to Improve your Efficiency when Hypertuning ML Models | link | |
Train your Machine Learning Model 150x Faster with cuML | link | link |
Introduction to Weight & Biases: Track and Visualize your Machine Learning Experiments in 3 Lines of Code | link | link |
Natural Language Processing
Title | Medium article | Repository |
---|---|---|
Sentiment Analysis of LinkedIn Messages | link | link |
Find Common Words in Article with Python Module Newspaper and NLTK | link | link |
How to Tokenize Tweets with Python | link | link |
How to Solve Analogies with Word2Vec | link | link |
What is PyTorch | link | link |
Convolutional Neural Network in Natural Language Processing | link | link |
Supercharge your Python String with TextBlob | link | link |
pyLDAvis: Topic Modelling Exploration Tool That Every NLP Data Scientist Should Know | link | link |
Streamlit and spaCy: Create an App to Predict Sentiment and Word Similarities with Minimal Domain Knowledge | link | link |
Computer Vision
Title | Medium article | Repository |
---|---|---|
How to Create an App to Classify Dogs Using fastai and Streamlit | link | link |
Time Series
Title | Medium article | Repository |
---|---|---|
3 Ways to Extract Features from Dates with Python | link | link |
Visualization
Title | Medium article | Repository |
---|---|---|
What I Learned from Scraping 15k Data Science Articles on Medium | link | link |
How to Create Interactive Plots with Altair | link | link |
How to Create a Drop-Down Menu and a Slide Bar for your Favorite Visualization Tool | link | link |
I Scraped more than 1k Top Machine Learning Github Profiles and this is what I Found | link | link |
Top 6 Python Libraries for Visualization: Which one to Use? | link | link |
Introduction to Yellowbrick: A Python Library to Visualize the Prediction of your Machine Learning Model | link | link |
Visualize Gender-Specific Tweets with Scattertext | link | link |
Visualize Your Team’s Projects Using Python Gantt Chart | link | link |
Mathematical Programming
Title | Medium article | Repository |
---|---|---|
How to choose stocks to invest in with Python | link | link |
Maximize your Productivity with Python | link | link |
How to Find a Good Match with Python | link | link |
Scraping
Title | Medium article | Repository |
---|---|---|
Web Scrape Movie Database with Beautiful Soup | link | link |
top-github-scraper: Scrape Top Github Users and Repositories Based On a Keyword in One Line of Code | link | link |
Python
Title | Medium article | Repository |
---|---|---|
Numpy Tricks for your Data Science Projects | link | link |
Cython-A Speed-Up Tool for your Python Function | link | link |
Timing for Efficient Python Code | link | link |
How to Use Lambda for Efficient Python Code | link | link |
Python Tricks for Keeping Track of Your Data | link | link |
Boost Your Efficiency With Specialized Dictionary Implementations in Python | link | link |
Dictionary as an Alternative to If-Else | link | link |
How to Use Zip to Manipulate a List of Tuples | link | link |
Get the Most out of Your Array With These Four Numpy Methods | link | link |
3 Python Tricks to Read, Create, and Run Multiple Files Automatically | link | link |
How to Exclude the Outliers in Pandas DataFrame | link | link |
Python Clean Code: 6 Best Practices to Make Your Python Functions More Readable | link | link |
3 Techniques to Effortlessly Import and Execute Python Modules | link | link |
Terminal
Title | Medium article | Repository |
---|---|---|
Understand CSV Files from your Terminal with XSV | link | |
Prettify your Terminal Text With Termcolor and Pyfiglet | link | link |
Stop Using Print to Debug in Python. Use Icecream Instead | link | |
Rich: Generate Rich and Beautiful Text in the Terminal with Python | link | link |
Linear Algebra
Title | Medium article | Repository |
---|---|---|
How to Build a Matrix Module from Scratch | link | link |
Linear Algebra for Machine Learning: Solve a System of Linear Equations | link | link |
Data Structure
Title | Medium article | Repository |
---|---|---|
Convex Hull: An Innovative Approach to Gift-Wrap your Data | link | link |
How to Visualize Social Network With Graph Theory | link | link |
How to Search Data with KDTree | link | link |
How to Find the Nearest Hospital with a Voronoi Diagram | link | link |
Statistics
Title | Medium article | Repository |
---|---|---|
Can Datasets of a Dinosaur and a Circle have Identical Statistics? | link | |
Introduction to One-Way ANOVA: A Test to Compare the Means between More than Two Groups | link | link |
Applications
Title | Medium article | Repository |
---|---|---|
How to Create an Interactive Startup Growth Calculator with Python | link | link |
Streamlit and spaCy: Create an App to Predict Sentiment and Word Similarities with Minimal Domain Knowledge | link | link |
Learning Tips
Title | Medium article | Repository |
---|---|---|
How to Learn Data Science when Life does not Give You a Break | link | |
How to Accelerate your Data Science Career by Putting yourself in the Right Environment | link | |
To become a Better Data Scientist, you need to Think like a Programmer | link | |
How not to be Overwhelmed with Data Science | link |
Productive Tips
Title | Medium article | Repository |
---|---|---|
Boost Your Efficiency with Customized Code Snippets on VSCode | link | |
How to Organize your Data Science Articles with Github | link | link |
How to Create Reusable Command-Line | link | |
5 Reasons why you should Switch from Jupyter Notebook to Scripts | link | |
3 Ways to Get Notified with Python | link | link |
Top 9 Keyboard Shortcuts in VSCode for Data Scientists | link | |
7 Reasons Why you Should Start Documenting your Code | link |
Book Review
Title | Medium article | Repository |
---|---|---|
Python Machine Learning: A Comprehensive Handbook for Machine Learning | link |
Data Science Portfolio
Title | Medium article | Repository |
---|---|---|
How to Create an Elegant Website for your Data Science Portfolio in 10 minutes | link | |
Build an Impressive Github Profile in 3 Steps | link |
Note that the project description data, including the texts, logos, images, and/or trademarks,
for each open source project belongs to its rightful owner.
If you wish to add or remove any projects, please contact us at [email protected].