All Projects → microsoft → arcticseals

microsoft / arcticseals

Licence: MIT license
A deep learning project in cooperation with the NOAA Marine Mammal Lab to detect & classify arctic seals in aerial imagery to understand how they’re adapting to a changing world.

Programming Languages

Jupyter Notebook
11667 projects
python
139335 projects - #7 most used programming language
javascript
184084 projects - #8 most used programming language
C#
18002 projects
Cuda
1817 projects
C++
36643 projects - #6 most used programming language

Projects that are alternatives of or similar to arcticseals

landcover
Land Cover Mapping
Stars: ✭ 180 (+480.65%)
Mutual labels:  aiforearth
SpeciesClassification
AI for Earth Species Classification API
Stars: ✭ 81 (+161.29%)
Mutual labels:  aiforearth
AIforEarthDataSets
Notebooks and documentation for AI-for-Earth-managed datasets on Azure
Stars: ✭ 217 (+600%)
Mutual labels:  aiforearth
aerial wildlife detection
Tools for detecting wildlife in aerial images using active learning
Stars: ✭ 177 (+470.97%)
Mutual labels:  aiforearth
AIforEarth-API-Platform
The AI for Earth API Platform is a distributed infrastructure designed to provide a secure, scalable, and customizable API hosting, designed to handle the needs of long-running/asynchronous machine learning model inference. It is to be used with the AI For Earth API Framework (https://github.com/microsoft/AIforEarth-API-Development).
Stars: ✭ 43 (+38.71%)
Mutual labels:  aiforearth
planetary-computer-sdk-for-python
Planetary Computer SDK for Python
Stars: ✭ 59 (+90.32%)
Mutual labels:  aiforearth
AIforEarth-API-Development
This is an API Framework for AI models to be hosted locally or on the AI for Earth API Platform (https://github.com/microsoft/AIforEarth-API-Platform).
Stars: ✭ 72 (+132.26%)
Mutual labels:  aiforearth

Overview

This is the workspace for the a collaboration between Microsoft AI for Earth, NOAA Fisheries, and the University of Washington, aimed at automating the detection of arctic wildlife in aerial imagery.

Imagery

Imagery is now available publicly on lila.science, an open data repository for labeled images related to conservation biology.

alt text

Labels

The data directory contains the following label/metadata files:

  • train.csv (5,110 records): Hotspot detection data for which we have all corresponding imagery data (see below). Currently all of these hotspots refer to images in dataset ArcticSealsData01.
  • test.csv (1,314 records): Same format and distrbution of train.csv, suitable for cross-validation.

Each record in the CSV files refers to a hotspot that the NOAA thermal detection system picked up and that was classified by a human into either "Animal" (true positive) or "Anomaly" (false positive). Each hotspot is unique (no duplicates). The column schema is as follows:

  • hotspot_id: unique ID
  • timestamp: GMT/UTC timestamp (always corresponds to thermal image timestamp)
  • filt_thermal16: Filename of the 16-bit PNG containing the raw FLIR image data
  • filt_thermal8: Filename of the 8-bit JPG containing the annotated FLIR image data (hotspots circled)
  • filt_color: Filename of the 8-bit JPG containing a color image taken at or near the same time as the thermal image. The timestamp encoded in the filename may be different from the thermal timestamp by up to 60 seconds (but typically less than 1 second).
  • x_pos/y_pos: Location of the hotspot in the thermal image
  • thumb_*: Bounding box of the hotspot in the color image. NOTE: some of these values are negative, as the bounding box is always 512x512 even if the hotspot is at the edge of the image.
  • hotspot_type: "Animal" or "Anomaly"
  • species_id: "Bearded Seal", "Ringed Seal", "UNK Seal", "Polar Bear" or "NA" (for anomalies)

Raw Hotspot Data

In the data directory there is also a raw.csv (14,910 records) containing all hotspot detections from the NOAA 2016 survey flights (includes more seals but also more types of animals, more anomalies, hotspots marked as duplicates, etc.).

Code

The project is meant to accomodate many different approaches, frameworks, languages, etc. Linux is the primary supported dev environment, though some GUI tools are Windows-only.

Organization

Team members are welcome to add whatever code you like to this repo, but please follow these guidelines:

  • Put your source code in its own directory inside the src directory.
  • Add a README.md file to your code directory explaining what your code does and how it works.
  • If there are dependencies that need to be installed and/or build steps that need to be performed, add any necessary code to the build.bat script to run the relevant package manager commands, compile steps, etc., to ensure your code is fully runnable locally.
    • Alternatively, it is also ok if your code only builds from within an IDE; if so just make a note of that in your README.md.
  • If applicable, add a script that runs your code to the root directory. If it takes command line arguments, please show help text if it is run without arguments.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Note that the project description data, including the texts, logos, images, and/or trademarks, for each open source project belongs to its rightful owner. If you wish to add or remove any projects, please contact us at [email protected].