Cheap and reliable Node.js hosting starts at $3/month, and $1/month static HTML hosting

A generative network for animal vocalizations. For dimensionality reduction, sequencing, clustering, corpus-building, and generating novel 'stimulus spaces'. All with notebook examples using freely available datasets.

Stars: ✭ 50 (-57.26%)

Mutual labels: jupyter-notebook, segmentation

Mit Deep Learning

Tutorials, assignments, and competitions for MIT Deep Learning related courses.

Stars: ✭ 8,912 (+7517.09%)

Mutual labels: jupyter-notebook, segmentation

Deeplabv3 Plus

Tensorflow 2.3.0 implementation of DeepLabV3-Plus

Stars: ✭ 32 (-72.65%)

Mutual labels: jupyter-notebook, segmentation

Dlcv for beginners

《深度学习与计算机视觉》配套代码

Stars: ✭ 1,244 (+963.25%)

Mutual labels: segmentation, detection

Caffe Model

Caffe models (including classification, detection and segmentation) and deploy files for famouse networks

Stars: ✭ 1,258 (+975.21%)

Mutual labels: segmentation, detection

3dunet abdomen cascade

Stars: ✭ 91 (-22.22%)

Mutual labels: jupyter-notebook, segmentation

Social Media Depression Detector

😔 😞 😣 😖 😩 Detect depression on social media using the ssToT method introduced in our ASONAM 2017 paper titled "Semi-Supervised Approach to Monitoring Clinical Depressive Symptoms in Social Media"

Stars: ✭ 45 (-61.54%)

Mutual labels: jupyter-notebook, detection

Picanet

Stars: ✭ 35 (-70.09%)

Mutual labels: jupyter-notebook, detection

Jacinto Ai Devkit

Training & Quantization of embedded friendly Deep Learning / Machine Learning / Computer Vision models

Stars: ✭ 49 (-58.12%)

Mutual labels: segmentation, detection

Albumentations

Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125

Stars: ✭ 9,353 (+7894.02%)

Mutual labels: segmentation, detection

Multiclass Semantic Segmentation Camvid

Tensorflow 2 implementation of complete pipeline for multiclass image semantic segmentation using UNet, SegNet and FCN32 architectures on Cambridge-driving Labeled Video Database (CamVid) dataset.

Stars: ✭ 67 (-42.74%)

Mutual labels: jupyter-notebook, segmentation

Mtcnn

MTCNN face detection implementation for TensorFlow, as a PIP package.

Stars: ✭ 1,689 (+1343.59%)

Mutual labels: jupyter-notebook, detection

Medicaldetectiontoolkit

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

Stars: ✭ 917 (+683.76%)

Mutual labels: segmentation, detection

Training extensions

Trainable models and NN optimization tools

Stars: ✭ 857 (+632.48%)

Mutual labels: segmentation, detection

Fcn.tensorflow

Tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation (http://fcn.berkeleyvision.org)

Stars: ✭ 1,230 (+951.28%)

Mutual labels: jupyter-notebook, segmentation

View All Similar Projects ➔

(Check out my Medium post (https://goo.gl/Ussdr1) for more details of this project.)

A U-net based on Tensorflow for objection detection (or segmentation) of satellite images

The goal of this project is to develop models for Dstl Satellite Imagery Feature Detection contest on kaggle. The result scores 0.46 on the public test data set and 0.44 on the private test data set, would rank No. 7 out of 419 on the private leaderboard.

The training dataset includes 25 images, each with 20 channels (RGB band (3 channels) + A band (8 channels) + M band (8 channels) + P band (1 channel)), and the corresponding labels of objects. There are 10 types of overlapping objects labeled with contours (wkt type of data), including 0. Buildings, 1. Misc, 2. Road, 3. Track, 4. Trees, 5. Crops, 6. Waterway, 7. Standing water, 8. Vehicle Large, 9. Vehicle Small.

This code converts the contours into masks, and then trains a pixel-wise binary classifier for each class of object. A U-net with batch norm developed in tensorflow is used as the classification model. A combination of cross entropy and soft Jaccard index, and the Adam optimizer are the loss function and the optimizer respectively. The following figures show examples of the training features and labels from one of the training examples. The code to generate these figures can be found in visualization.ipynb.

This figure shows the statistics of percentage area for all classes of all the training data. (Note: on some images, the sum is over 100% because of overlap between classes.)

Prerequisites

Suggested hardware

Nvidia K80 Tesla GPU
61 GB RAM

The model was developed and trained on a p2.xlarge instance on AWS, which comes with the above hardware. At the beginning of the training for each class, all the 25 training images and the corresponding labels are loaded into RAM to avoid file I/O during the training, which can slow down the training. Therefore a large RAM (up to 50 GB) is required. The batch size and patches size of images in training and predictions are also customized for the ~11 GB memory on K80 GPU. These parameters should be adjusted according to your hardware.

Software and Packages

python == 2.7
tensorflow == 1.0.1
descartes == 1.1.0
matplotlib == 2.0.0
numpy == 1.12.0
opencv-python == 3.3.0.9
pandas == 0.20.3
seaborn == 0.7.1
shapely == 1.6.0
simplejson == 3.10.0
tifffile == 0.12.0

To install all the requirements:

pip install -r requirements.txt
conda install -c https://conda.binstar.org/menpo opencv3

Download the data

Download the data from contest website: https://www.kaggle.com/c/dstl-satellite-imagery-feature-detection/data

Put the data into the ./data/ folder.

Train the model

The model is built to train a voxel-wise binary classifier for each of the 10 classes. Change the parameter class_type to a number of 0-9 in ./hypes/hypes.json to switch between classes. Run the following in the terminal to train a model for each class:

python train.py |& tee output.txt

All the print out is saved in output.txt. All other logs for each training is saved at a folder in ./log_dir, with a folder name of ./log_dir/month-day_hour-min_lossfunction, including a TF checkpoint for every 1000-batch, a summary point for every 100-batch, and the hyper parameters for the training. The last TF checkpoint is used to generate predictions. The final version of this code includes all the labeled data for training. You can set the test_names in ./utils/train_utils.py, and exclude them from the train_names parameters to perform cross validation.

Visualize the training

To monitor the training on the fly using tensorboard, run the following code in terminal:

tensorboard --port 6006 --logdir summary_path --host 127.0.0.1

The following figures are examples of learning curves for the training of class 0, Bldg.

Make predictions

Modify the save_path parameter of saver.restore() in inference.py to the path of the last checkpoint and change the class_type in ./hypes/hypes.json to the desired class type to generate predictions:

python inference.py |& tee test_output.txt

All the print out will be saved in test_output.txt. The predictions will be saved in a CSV file ./submission/class_{class_type}.csv.

Merge submission and submit

To merge the prediction files of all classes (e.g. ./submission/class_0.csv for class 0), run the following in terminal:

python merge_submission.py

A few errors of non-noded intersection were encountered during my submission. This can be fixed by running python topology_exception.py for each of the error. The script topology_exception.py will create a hole around the point, which can be found from the error message. You could also run the following in a python console:

repair_topology_exception('submission/valid_submission.csv', 
                           precision=6, 
                           image_id='6100_0_2',
                           n_class=4,
                           point= (0.0073326816112855523, -0.0069418340919529765),
                           side=1e-4)

Results

The online evaluation returns a score of 0.44, which would rank No. 7 on the private leaderboard. The following figures show the comparison between the true label and predicted label.

Remarks

The model was primarily developed based on performance on class 0 (Bldg). It can be further improved for other classes, by customizing model parameters for each class.
Check out my medium post for more details about this project.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].

Stars: ✭ 117

Visit Git Page 🔗Visit User Page 🔗Visit Issues Page (1) 🔗