Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

Created with love in Canada, visit hostnodejs.com today

Feel like to post an Ad? Learn Details

All Projects → af-ai-center → Swebert

af-ai-center / Swebert

Licence: apache-2.0

Arbetsförmedlingen's Swedish BERT models

Labels

jupyter-notebook

Projects that are alternatives of or similar to Swebert

Visualization Of Global Terrorism Database

📊 Visualization of GTD with py Plotly lib, including amazing graphs and animation 📼

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Extended Mean Field Restricted Boltzmann Machine

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Zero To Jupyterhub K8s

Helm Chart & Documentation for deploying JupyterHub on Kubernetes

Stars: ✭ 888 (+5123.53%)

Mutual labels: jupyter-notebook

ncbi data processing and cloud based genomics

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Lab teaching 2016

Repository for materials/codes from Kording lab teaching 2016

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Word2vec Chinese Demo

my chinese word2vec visulization demo, using chinese wiki as corpus

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Velocity Tutorial San Jose

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Slides from various talks/courses I've given

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

First Steps With Pandas Workshop

Workshop introducing to Pandas

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Geo-aware poisson blending

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

CE599-002 Data Science for Transportation

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Python Data Analytics

Source code for 'Python Data Analytics' by Fabio Nelli

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Hack University Machine Learning

Hack University course "Introduction to Machine Learning" sylabus, course material, python examples, resource links, and data sets

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Udacity's Self-Driving Car Nanodegree Projects

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Seq2seq Signal Prediction

Signal forecasting with a Sequence-to-Sequence (seq2seq) Recurrent Neural Network (RNN) model in TensorFlow - Guillaume Chevalier

Stars: ✭ 890 (+5135.29%)

Mutual labels: jupyter-notebook

Advanced Factorization Of Machine Systems

GSOC 2017 - Apache Organization - # Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Python tools for data analysis

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

Covid Model Deaths

The IHME Covid-19 Statistical deaths model

Stars: ✭ 17 (+0%)

Mutual labels: jupyter-notebook

Tensorflow Tutorial

Some interesting TensorFlow tutorials for beginners.

Stars: ✭ 893 (+5152.94%)

Mutual labels: jupyter-notebook

Machine Learning

A repository of Jupyter/IPython notebooks with the slides and code examples for my machine learning class.

Stars: ✭ 16 (-5.88%)

Mutual labels: jupyter-notebook

View All Similar Projects ➔

Swedish BERT models

Arbetsförmedlingen (The Swedish Public Employment Service) has developed Swedish BERT models which were trained on Swedish Wikipedia with approximately 2 million articles and 300 million words.

Available Model Types

bert-base-swedish-uncased: 12-layer, 768-hidden, 12-heads, 110M parameters
bert-large-swedish-uncased: 24-layer, 1024-hidden, 16-heads, 340M parameters

Usage

The models can be used as part of the transformers package like any other built-in or community-uploaded model.

This means that both tokenizer and model can be instantiated using the from_pretrained() method of the BERT-related transformers classes like so:

pretrained_model_name = 'af-ai-center/bert-base-swedish-uncased'

tokenizer = BertTokenizer.from_pretrained(pretrained_model_name)

# PyTorch
model = BertModel.from_pretrained(pretrained_model_name)

# TensorFlow
model = TFBertModel.from_pretrained(pretrained_model_name)

Getting Started

The notebook getting_started_with_swebert.ipynb shows some more details on how to use the models.

Make sure to run it in a virtual environment with the following packages installed:

pip install torch tensorflow transformers tokenizers notebook

Remarks

Note that the corpus that our Swedish BERT models are trained on is significantly smaller than in the case of the original English BERT models.
We are part of an ongoing effort to create more sophisticated Swedish language models, see https://www.ri.se/sv/vad-vi-gor/projekt/sprakmodeller-svenska-myndigheter

Contact

[email protected]

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 17

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (0) 🔗