Stocksmachine learning web app game where the user competes against the AI in picking stocks
Stars: ✭ 108 (+332%)
Product-Categorization-NLPMulti-Class Text Classification for products based on their description with Machine Learning algorithms and Neural Networks (MLP, CNN, Distilbert).
Stars: ✭ 30 (+20%)
datahubDataHub - Synthetic data library
Stars: ✭ 66 (+164%)
Ml CheatsheetA constantly updated python machine learning cheatsheet
Stars: ✭ 136 (+444%)
ml-workflow-automationPython Machine Learning (ML) project that demonstrates the archetypal ML workflow within a Jupyter notebook, with automated model deployment as a RESTful service on Kubernetes.
Stars: ✭ 44 (+76%)
Lambda PacksPrecompiled packages for AWS Lambda
Stars: ✭ 997 (+3888%)
skippaSciKIt-learn Pipeline in PAndas
Stars: ✭ 33 (+32%)
Data Analysis主要是爬虫与数据分析项目总结,外加建模与机器学习,模型的评估。
Stars: ✭ 142 (+468%)
skutilNOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-learn and h2o extension classes (as well as caret classes for python). See more here: https://tgsmith61591.github.io/skutil
Stars: ✭ 29 (+16%)
resolutions-2019A list of data mining and machine learning papers that I implemented in 2019.
Stars: ✭ 19 (-24%)
Text-SummarizationAbstractive and Extractive Text summarization using Transformers.
Stars: ✭ 38 (+52%)
pybacenThis library was developed for economic analysis in the Brazilian scenario (Investments, micro and macroeconomic indicators)
Stars: ✭ 40 (+60%)
long-short-transformerImplementation of Long-Short Transformer, combining local and global inductive biases for attention over long sequences, in Pytorch
Stars: ✭ 103 (+312%)
muneSimple stock price analytics
Stars: ✭ 14 (-44%)
datascienvdatascienv is package that helps you to setup your environment in single line of code with all dependency and it is also include pyforest that provide single line of import all required ml libraries
Stars: ✭ 53 (+112%)
whyqddata wrangling simplicity, complete audit transparency, and at speed
Stars: ✭ 16 (-36%)
pandas twitterAnalyzing Trump's tweets using Python (Pandas + Twitter workshop)
Stars: ✭ 81 (+224%)
deepconsensusDeepConsensus uses gap-aware sequence transformers to correct errors in Pacific Biosciences (PacBio) Circular Consensus Sequencing (CCS) data.
Stars: ✭ 124 (+396%)
Active-Explainable-ClassificationA set of tools for leveraging pre-trained embeddings, active learning and model explainability for effecient document classification
Stars: ✭ 28 (+12%)
PracticalMachineLearningA collection of ML related stuff including notebooks, codes and a curated list of various useful resources such as books and softwares. Almost everything mentioned here is free (as speech not free food) or open-source.
Stars: ✭ 60 (+140%)
Data-Science-101Notes and tutorials on how to use python, pandas, seaborn, numpy, matplotlib, scipy for data science.
Stars: ✭ 19 (-24%)
anonymisationAnonymization of legal cases (Fr) based on Flair embeddings
Stars: ✭ 85 (+240%)
weaverbirdA visual data pipeline builder with various backends
Stars: ✭ 65 (+160%)
Transformers-TutorialsThis repository contains demos I made with the Transformers library by HuggingFace.
Stars: ✭ 2,828 (+11212%)
pyjanitorClean APIs for data cleaning. Python implementation of R package Janitor
Stars: ✭ 970 (+3780%)
espandasReading and writing pandas DataFrames in Elasticsearch
Stars: ✭ 24 (-4%)
dstoolboxTools that make working with scikit-learn and pandas easier.
Stars: ✭ 43 (+72%)
kobe-every-shot-everA Los Angeles Times analysis of Every shot in Kobe Bryant's NBA career
Stars: ✭ 66 (+164%)
bert-squeeze🛠️ Tools for Transformers compression using PyTorch Lightning ⚡
Stars: ✭ 56 (+124%)
toucan-connectorsConnectors available to retrieve data in Toucan Toco small apps
Stars: ✭ 13 (-48%)
pandas-workshopAn introductory workshop on pandas with notebooks and exercises for following along.
Stars: ✭ 161 (+544%)
DatscanDatScan is an initiative to build an open-source CMS that will have the capability to solve any problem using data Analysis just with the help of various modules and a vast standardized module library
Stars: ✭ 13 (-48%)
es pandasRead, write and update large scale pandas DataFrame with Elasticsearch
Stars: ✭ 34 (+36%)
KMeans elbowCode for determining optimal number of clusters for K-means algorithm using the 'elbow criterion'
Stars: ✭ 35 (+40%)
grailerweb scraping tool for grailed.com
Stars: ✭ 30 (+20%)
wechselCode for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.
Stars: ✭ 39 (+56%)
python-eodhistoricaldataDownload data from EOD historical data https://eodhistoricaldata.com/ using Python, Requests and Pandas.
Stars: ✭ 67 (+168%)
bagging puSimple sklearn based python implementation of Positive-Unlabeled (PU) classification using bagging based ensembles
Stars: ✭ 73 (+192%)
read-protobufSmall library to read serialized protobuf(s) directly into Pandas Dataframe
Stars: ✭ 28 (+12%)
hh researchАвтоматизация поиска и исследования вакансий с сайта hh.ru (Headhunter) с помощью методов Python. Классификация данных, поиск статистических параметров.
Stars: ✭ 36 (+44%)
trackanimationTrack Animation is a Python 2 and 3 library that provides an easy and user-adjustable way of creating visualizations from GPS data.
Stars: ✭ 74 (+196%)
anestheticNested Sampling post-processing and plotting
Stars: ✭ 34 (+36%)
ferFacial Expression Recognition
Stars: ✭ 32 (+28%)
hamiltonA scalable general purpose micro-framework for defining dataflows. You can use it to create dataframes, numpy matrices, python objects, ML models, etc.
Stars: ✭ 612 (+2348%)