All Projects → stanleyjzheng → PyData-Pseudolabelling-Keynote

stanleyjzheng / PyData-Pseudolabelling-Keynote

Licence: other
Accompanying notebook and sources to "A Guide to Pseudolabelling: How to get a Kaggle medal with only one model" (Dec. 2020 PyData Boston-Cambridge Keynote)

Programming Languages

Jupyter Notebook
11667 projects

Projects that are alternatives of or similar to PyData-Pseudolabelling-Keynote

awesome-kaggle-kernels
Compilation of good Kaggle Kernels.
Stars: ✭ 51 (+121.74%)
Mutual labels:  kaggle
kdsb17
Gaussian Mixture Convolutional AutoEncoder applied to CT lung scans from the Kaggle Data Science Bowl 2017
Stars: ✭ 18 (-21.74%)
Mutual labels:  kaggle
kaggledatasets
Collection of Kaggle Datasets ready to use for Everyone (Looking for contributors)
Stars: ✭ 44 (+91.3%)
Mutual labels:  kaggle
Kaggle-Cdiscount-Image-Classification-Challenge
No description or website provided.
Stars: ✭ 15 (-34.78%)
Mutual labels:  kaggle
Data-Science-Articles
A collection of my blogs on Data Science and Machine learning.
Stars: ✭ 66 (+186.96%)
Mutual labels:  kaggle
kaggle-malware-classification
Kaggle "Microsoft Malware Classification Challenge". 6th place solution
Stars: ✭ 29 (+26.09%)
Mutual labels:  kaggle
kaggle-berlin
Material of the Kaggle Berlin meetup group!
Stars: ✭ 36 (+56.52%)
Mutual labels:  kaggle
Data-Science-Projects
Data Science projects on various problem statements and datasets using Data Analysis, Machine Learning Algorithms, Deep Learning Algorithms, Natural Language Processing, Business Intelligence concepts by Python
Stars: ✭ 28 (+21.74%)
Mutual labels:  kaggle
tales-science-data
Companion repo to the GitBook, notes on Data Science topics
Stars: ✭ 41 (+78.26%)
Mutual labels:  pydata
Dog-Breed-Identification-Gluon
Kaggle 120种狗分类,Gluon实现
Stars: ✭ 45 (+95.65%)
Mutual labels:  kaggle
digit recognizer
CNN digit recognizer implemented in Keras Notebook, Kaggle/MNIST (0.995).
Stars: ✭ 27 (+17.39%)
Mutual labels:  kaggle
kaggle-airbnb
🌍 Where will a new guest book their first travel experience?
Stars: ✭ 53 (+130.43%)
Mutual labels:  kaggle
open-solution-cdiscount-starter
Open solution to the Cdiscount’s Image Classification Challenge
Stars: ✭ 20 (-13.04%)
Mutual labels:  kaggle
Kaggle-Quora-Question-Pairs
This is our team's solution report, which achieves top 10% (305/3307) in this competition.
Stars: ✭ 58 (+152.17%)
Mutual labels:  kaggle
fer
Facial Expression Recognition
Stars: ✭ 32 (+39.13%)
Mutual labels:  kaggle
Recruit-Restaurant-Visitor-Forecasting
6th place solution for Recruit-Restaurant-Visitor-Forecasting
Stars: ✭ 16 (-30.43%)
Mutual labels:  kaggle
histopathologic cancer detector
CNN histopathologic tumor identifier.
Stars: ✭ 26 (+13.04%)
Mutual labels:  kaggle
gender-unbiased BERT-based pronoun resolution
Source code for the ACL workshop paper and Kaggle competition by Google AI team
Stars: ✭ 42 (+82.61%)
Mutual labels:  kaggle
open-solution-ship-detection
Open solution to the Airbus Ship Detection Challenge
Stars: ✭ 54 (+134.78%)
Mutual labels:  kaggle
MSDS696-Masters-Final-Project
Earthquake Prediction Challenge with LightGBM and XGBoost
Stars: ✭ 58 (+152.17%)
Mutual labels:  kaggle

A Guide to Pseudolabelling: How to get a Kaggle medal with only one model

PyData Boston-Cambridge Dec. 2020

Talk now on YouTube: https://youtu.be/c8uWUOSGYUI

Notebook on Kaggle: https://www.kaggle.com/stanleyjzheng/exploring-pseudolabelling-schemes-pydata

Me

References

[1] Babakhin, Y., Sanakoyeu, A., & Kitamura, H. (2019). Semi-supervised segmentation of salt bodies in seismic images using an ensemble of convolutional neural networks. ArXiv:1904.04445. http://arxiv.org/abs/1904.04445

[2] Internet.org and Facebook (2013). A Focus on Efficiency. [White paper]. Archived on webarchive.org

[3] Li, Z., Ko, B., & Choi, H.-J. (2019). Naive semi-supervised deep learning using pseudo-label. Peer-to-Peer Networking and Applications, 12(5), 1358–1368. https://doi.org/10.1007/s12083-018-0702-9

  • Paper describing pretrain method for pseudolabelling as well as results from LSTM, CIFAR, MNIST.

[4] Xie, Q., Luong, M.-T., Hovy, E., & Le, Q. V. (2020). Self-training with Noisy Student improves ImageNet classification. ArXiv:1911.04252 [Cs, Stat]. http://arxiv.org/abs/1911.04252

[5] Global Wheat competition: https://www.kaggle.com/c/global-wheat-detection

[6] The following are solutions from the mentioned competitions: OpenVaccine 1st, OpenVaccine 2nd, OpenVaccine 3rd, Tweet sentiment extraction 1st, TReNDS Neuroimaging 1st, Global Wheat 1st, LISH-MOA 2nd public, LISH-MOA 5th, TGS Salt Identification 1st

[7] My notebook on pseudolabelling MNIST https://www.kaggle.com/stanleyjzheng/exploring-pseudolabelling-schemes-pydata

Further reading

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].