geometric-smoteImplementation of the Geometric SMOTE over-sampling algorithm.
SyntheticSunSyntheticSun is a defense-in-depth security automation and monitoring framework which utilizes threat intelligence, machine learning, managed AWS security services and, serverless technologies to continuously prevent, detect and respond to threats.
prostoProsto is a data processing toolkit radically changing how data is processed by heavily relying on functions and operations with functions - an alternative to map-reduce and join-groupby
jazzThe Scripting Engine that Combines Speed, Safety, and Simplicity
BCGThe BCG Open-Access Data Science & Advanced Analytics Virtual Experience Program
fastMLA Python package built on sklearn for running a series of classification Algorithms in a faster and easier way.
diabetes use caseSample use case for Xavier AI in Healthcare conference: https://www.xavierhealth.org/ai-summit-day2/
mindwareAn efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.
cortana-intelligence-customer360This repository contains instructions and code to deploy a customer 360 profile solution on Azure stack using the Cortana Intelligence Suite.
notebookWeb based Clojure notebook application/-library.
telleryTellery lets you build metrics using SQL and bring them to your team. As easy as using a document. As powerful as a data modeling tool.
Open-Data-Laban initiative to provide infrastructure for reproducible workflows around open data
HARRecognize one of six human activities such as standing, sitting, and walking using a Softmax Classifier trained on mobile phone sensor data.
pixiedust-facebook-analysisA Jupyter notebook that uses the Watson Visual Recognition and Natural Language Understanding services to enrich Facebook Analytics and uses Cognos Dashboard Embedded to explore and visualize the results in Watson Studio
teach-r-onlineMaterials for the Teaching statistics and data science online workshops in July 2020
kwxBERT, LDA, and TFIDF based keyword extraction in Python
r4dswebsitePublic repository for the R4DS community website.
ZS-Data-Science-ChallengeA Data science challenge - "Mekktronix Sales Forecasting" organised by ZS through Hackerearth platform. Rank: 223 out of 4743.
policy-data-analyzerBuilding a model to recognize incentives for landscape restoration in environmental policies from Latin America, the US and India. Bringing NLP to the world of policy analysis through an extensible framework that includes scraping, preprocessing, active learning and text analysis pipelines.
bc-population-indicatorR scripts for an indicator on trends in B.C.'s population size & distribution published on Environmental Reporting BC
beneathBeneath is a serverless real-time data platform ⚡️
zen-do-rUm livro sobre programação para não-programadores.
growthbookOpen Source Feature Flagging and A/B Testing Platform
ISLR-PythonNotes and implementations in Python for ISLR.
tutorialsGit Repo for Articles on Ergo Sum blog and the youtube channel https://www.youtube.com/channel/UCiie9CN--dazA7iT2sry5FA
data vis statistics geosciencesThis repository contains the laboratory portion of an upper level undergraduate class in Python on data visualization and statistics for geo & space scientists. Labs are updated when the course is in session through the most recent branch. See master version for current class.
labs-fa17Lab notebooks for the Fall 2017 offering of Georgia Tech's CSE 6040
visionsType System for Data Analysis in Python
skip-thought-ganGenerating Text through Adversarial Training(GAN) using Skip-Thought Vectors
opendatasetsA Python library for downloading datasets from Kaggle, Google Drive, and other online sources.
ntds 2016Material for the EPFL master course "A Network Tour of Data Science", edition 2016.
aduanaFrontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even when making big crawls (one billion pages).
coronavirus-statsAutomatically scrape data and statistics on Coronavirus to make them easily accessible in CSV format
Open-SentencingTo help public defenders better serve their clients, Open Sentencing shows racial bias in data such as demographics providing insights for each case
dcbenchA benchmark of data-centric tasks from across the machine learning lifecycle.
Pythonthis resporatory have ml,ai,nlp,data science etc.python language related material from many websites eg. datacamp,geeksforgeeks,linkedin,youtube,udemy etc. also it include programming challange/competion solutions
infoAll the general information you'll ever need about pursuing AI in Pakistan!
karan36k.github.ioThese are all the articles and pages I have in my data science website. I try to transcribe all I learn and post regularly. Please visit and feel free to email me for suggestions.
olliePyOlliePy is a python package which can help data scientists in exploring their data and evaluating and analysing their machine learning experiments by utilising the power and structure of modern web applications. The data scientist only needs to provide the data and any required information and OlliePy will generate the rest.
pythonPython codes from tutorials on the Data Professor YouTube channel