All Projects → CyberZHG → Keras Multi Head

CyberZHG / Keras Multi Head

Licence: mit
A wrapper layer for stacking layers horizontally

Programming Languages

python
139335 projects - #7 most used programming language

Projects that are alternatives of or similar to Keras Multi Head

Heed
A fully typed LMDB/MDBX wrapper with minimum overhead
Stars: ✭ 142 (-17.44%)
Mutual labels:  wrapper
Vue Ui For Pc
基于Vue2.x的一套PC端UI组件,包括了Carousel 跑马灯、Cascader 级联、Checkbox 多选框、Collapse 折叠面板、DatePicker 日期选择、Dialog 对话框、Form 表单、Input 输入框、InputNumber 数字输入框、Layer 弹窗层、Loading 加载、Menu 菜单、Page 分页、Progress 进度条、Radio 单选框、SelectDropDown 仿select、Switch 开关、Table 表格、Tabs 标签页、Textarea 文本框、Tooltip 文字提示、BackTop 返回顶部、steps 步骤条、Transfer 穿梭框、Tree 树形、Upload 文件上传、Lazy 图片懒加载、Loading 加载、Pagination 分页等等
Stars: ✭ 156 (-9.3%)
Mutual labels:  layer
Spaces Api
An API wrapper for DigitalOcean's Spaces object storage designed for easy use.
Stars: ✭ 166 (-3.49%)
Mutual labels:  wrapper
Gl
Header-only C++17 wrapper for OpenGL 4.6 Core Profile.
Stars: ✭ 144 (-16.28%)
Mutual labels:  wrapper
Luwra
Minimal-overhead C++ wrapper for Lua
Stars: ✭ 152 (-11.63%)
Mutual labels:  wrapper
Zstd Rs
A rust binding for the zstd compression library.
Stars: ✭ 159 (-7.56%)
Mutual labels:  wrapper
Ytdlrc
☁️ Downloads videos and metadata with youtube-dl and moves each file on completion to an rclone remote
Stars: ✭ 140 (-18.6%)
Mutual labels:  wrapper
Config4k
A Kotlin wrapper for Typesafe Config
Stars: ✭ 168 (-2.33%)
Mutual labels:  wrapper
Py Kaldi Asr
Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.
Stars: ✭ 156 (-9.3%)
Mutual labels:  wrapper
Mailchimp Api
Super-simple, minimum abstraction MailChimp API v3 wrapper, in PHP
Stars: ✭ 1,977 (+1049.42%)
Mutual labels:  wrapper
Vue Axios
A small wrapper for integrating axios to Vuejs
Stars: ✭ 1,887 (+997.09%)
Mutual labels:  wrapper
React Layer Stack
Layering system for React. Useful for popover/modals/tooltip/dnd application
Stars: ✭ 152 (-11.63%)
Mutual labels:  layer
T14m4t
Automated brute-forcing attack tool.
Stars: ✭ 160 (-6.98%)
Mutual labels:  wrapper
Coinbasepro Csharp
The unofficial .NET/C# client library for the Coinbase Pro/GDAX API
Stars: ✭ 143 (-16.86%)
Mutual labels:  wrapper
Imageflow
A simple wrapper of TensorFlow for Converting, Importing (and Soon, Training) Images in tensorflow.
Stars: ✭ 166 (-3.49%)
Mutual labels:  wrapper
Youtube Music
🎵 A Mac app wrapper for music.youtube.com
Stars: ✭ 2,097 (+1119.19%)
Mutual labels:  wrapper
Desktop Google Keep Osx
A Super Simple Desktop Client for Mac OSX Built in Javascript and MacGap
Stars: ✭ 159 (-7.56%)
Mutual labels:  wrapper
Libuvsharp
.NET bindings for libuv
Stars: ✭ 170 (-1.16%)
Mutual labels:  wrapper
React Openlayers
OpenLayer React Components
Stars: ✭ 169 (-1.74%)
Mutual labels:  wrapper
Apriltag ros
A ROS wrapper of the AprilTag 3 visual fiducial detector
Stars: ✭ 160 (-6.98%)
Mutual labels:  wrapper

Keras Multi-Head

Travis Coverage Version Downloads License

A wrapper layer for stacking layers horizontally.

Install

pip install keras-multi-head

Usage

Duplicate Layers

The layer will be duplicated if only a single layer is provided. The layer_num argument controls how many layers will be duplicated eventually.

import keras
from keras_multi_head import MultiHead


model = keras.models.Sequential()
model.add(keras.layers.Embedding(input_dim=100, output_dim=20, name='Embedding'))
model.add(MultiHead(keras.layers.LSTM(units=32), layer_num=5, name='Multi-LSTMs'))
model.add(keras.layers.Flatten(name='Flatten'))
model.add(keras.layers.Dense(units=4, activation='softmax', name='Dense'))
model.build()
model.summary()

Use Multiple-Layers

The first argument could also be a list of layers with different configurations, however, they must have the same output shapes.

import keras
from keras_multi_head import MultiHead


model = keras.models.Sequential()
model.add(keras.layers.Embedding(input_dim=100, output_dim=20, name='Embedding'))
model.add(MultiHead([
    keras.layers.Conv1D(filters=32, kernel_size=3, padding='same'),
    keras.layers.Conv1D(filters=32, kernel_size=5, padding='same'),
    keras.layers.Conv1D(filters=32, kernel_size=7, padding='same'),
], name='Multi-CNNs'))
model.build()
model.summary()

Linear Transformation

The input data will be mapped to different values of the same shape for each layer when hidden_dim is given.

Regularization

The regularization is used when you expect to extract different features from the parallel layers. You can customize the indices of weights in the layers, the intervals represent the parts of the weights and the factor of the regularization.

For example, the bidirectional LSTM layer has 6 weights by default, and the first 3s belong to the forward layer. The 2nd weight (recurrent kernel) in the forward layer controls the computation of gates for recurrent connections. The kernel for computing cell states lays in units x 2 to units x 3 of the recurrent kernel. We can used the regularization for the kernels:

import keras
from keras_multi_head import MultiHead


model = keras.models.Sequential()
model.add(keras.layers.Embedding(input_dim=5, output_dim=3, name='Embed'))
model.add(MultiHead(
    layer=keras.layers.Bidirectional(keras.layers.LSTM(units=16), name='LSTM'),
    layer_num=5,
    reg_index=[1, 4],
    reg_slice=(slice(None, None), slice(32, 48)),
    reg_factor=0.1,
    name='Multi-Head-Attention',
))
model.add(keras.layers.Flatten(name='Flatten'))
model.add(keras.layers.Dense(units=2, activation='softmax', name='Dense'))
model.build()
  • reg_index: The indices of layer.get_weights(), a single integer or a list of integers.
  • reg_slice: slices or a tuple of slices or a list of the previous choices. If multiple indices are provided in reg_index and reg_slice is not a list, then reg_slice is assumed to be equal for all the indices. The whole array will be used if you leave this argument to None.
  • reg_factor: The factor of the regularization, a float or a list of floats.

Multi-Head Attention

A more specific multi-head layer is provided (since the general one is harder to use). The layer uses scaled dot product attention layers as its sub-layers and only head_num is required:

import keras
from keras_multi_head import MultiHeadAttention

input_layer = keras.layers.Input(
    shape=(2, 3),
    name='Input',
)
att_layer = MultiHeadAttention(
    head_num=3,
    name='Multi-Head',
)(input_layer)
model = keras.models.Model(inputs=input_layer, outputs=att_layer)
model.compile(
    optimizer='adam',
    loss='mse',
    metrics={},
)
model.summary()

The shapes of input and output tensors would be the same if only one layer is presented as input. The input layers will be considered as query, key and value when a list is given:

import keras
from keras_multi_head import MultiHeadAttention

input_query = keras.layers.Input(
    shape=(2, 3),
    name='Input-Q',
)
input_key = keras.layers.Input(
    shape=(4, 5),
    name='Input-K',
)
input_value = keras.layers.Input(
    shape=(4, 6),
    name='Input-V',
)
att_layer = MultiHeadAttention(
    head_num=3,
    name='Multi-Head',
)([input_query, input_key, input_value])
model = keras.models.Model(inputs=[input_query, input_key, input_value], outputs=att_layer)
model.compile(
    optimizer='adam',
    loss='mse',
    metrics={},
)
model.summary()
Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].