Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

🔬 Research Framework for Single and Multi-Players 🎰 Multi-Arms Bandits (MAB) Algorithms, implementing all the state-of-the-art algorithms for single-player (UCB, KL-UCB, Thompson...) and multi-player (MusicalChair, MEGA, rhoRand, MCTop/RandTopM etc).. Available on PyPI: https://pypi.org/project/SMPyBandits/ and documentation on

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Cellpose

a generalist algorithm for cellular segmentation

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Yolo Series

A series of notebooks describing how to use YOLO (darkflow) in python

Stars: ✭ 245 (+0%)

Mutual labels: jupyter-notebook

Kdepy

Kernel Density Estimation in Python

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Human body prior

VPoser: Variational Human Pose Prior

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Aind2 Cnn

AIND Term 2 -- Lesson on Convolutional Neural Networks

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

Deeplearningcoursecodes

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

2016 01 Tennis Betting Analysis

Methodology and code supporting the BuzzFeed News/BBC article, "The Tennis Racket," published Jan. 17, 2016.

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Normalizing Flows Tutorial

Tutorial on normalizing flows.

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

Recmetrics

A library of metrics for evaluating recommender systems

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

Taco

🌮 Trash Annotations in Context Dataset Toolkit

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

Data Cleaning 101

Data Cleaning Libraries with Python

Stars: ✭ 243 (-0.82%)

Mutual labels: jupyter-notebook

Link Prediction

Representation learning for link prediction within social networks

Stars: ✭ 245 (+0%)

Mutual labels: jupyter-notebook

Guided Evolutionary Strategies

Stars: ✭ 245 (+0%)

Mutual labels: jupyter-notebook

Fouriertalkoscon

Presentation Materials for my "Sound Analysis with the Fourier Transform and Python" OSCON Talk.

Stars: ✭ 244 (-0.41%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

2017知乎看山杯机器学习挑战赛

Koala队伍解决方案

运行环境

基于Python2及PyTorch，需安装：

PyTorch
Numpy
visdom
fire

运行：

cd src; pip2 install -r requirements.txt
python2 -m visdom.server

数据分析

代码在data_analysis文件夹下

数据处理

代码在data_preprocess文件夹下，question_preprocess、label_preprocess、topic_preprocess，分别有对应的notebook和py版本。

单模型训练

代码在src文件夹下，需在其中新建snapshots文件夹，用于存储模型文件

dataset：存放数据load文件
models：存放所有模型定义文件，主要用到FastText.py、TextCNN.py、RNN.py
utils：存放工具文件，如模型加载与保存、日志、可视化、矩阵处理等
config.py：配置文件，可在运行时通过命令行修改
main.py：所有程序入口

在src下运行：

python2 main.py train --model=RNN --use_word=True --batch_size=256

上述命令后面都是可设置的参数

model是使用的模型（与models下文件名一致，结果保存在snapshots/模型名/）
use_word表示使用word训练，如使用char，则改为--use_char=True
batch_size表示训练batch，显存不够的可以适当减小
还有其他一些参数，见config.py中的配置

Boosting模型训练

对于单个模型来说，其所能实现的效果毕竟有限。通过分析数据，我们发现一个模型对于不同类别是具有偏向性的，即有的类可能会全部预测错，而另一个类则会全部预测对，这种类别之间的差异性对预测性能会有很大的影响因此，我们针对这种偏差，借鉴Boost提升的思想，提出了一个新颖的做法，对结果进行修复性训练多层并累加。

在src下运行：

python2 main.py train --model=RNN --use_word=True --batch_size=256　--boost=True --base_layer=0

将base_layer依次改为1、２、３...，可逐层训练，训练的累加结果保存在与模型同目录

各模型结果

线下结果，线上可高2个多千分点

word结果

单模型:

FastText: 0.4097
TextCNN: 0.4111
RNN: 0.4116

Boosting模型:

FastText10层: 0.41892
RNN10层: 0.42642
TextCNN10层: 0.42654

char结果比word低约1个百分点，但融合后会涨3个千分点左右

测试

加载训好的模型并测试：参考gen_test_res.py
直接融合各模型测试的结果文件：参考utils/resmat.py

细节

更详细的描述，请移步知乎解决方案文章：https://zhuanlan.zhihu.com/p/29020616

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 245

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (4) 🔗