Metadata-Version: 2.4
Name: maite-datasets
Version: 0.0.4
Summary: A collection of Image Classification and Object Detection task datasets conforming to the MAITE protocol.
Author-email: Andrew Weng <andrew.weng@ariacoustics.com>, Ryan Wood <ryan.wood@ariacoustics.com>, Shaun Jullens <shaun.jullens@ariacoustics.com>
License-Expression: MIT
License-File: LICENSE
Classifier: Development Status :: 4 - Beta
Classifier: Framework :: Pytest
Classifier: License :: OSI Approved :: MIT License
Classifier: Operating System :: OS Independent
Classifier: Programming Language :: Python :: 3 :: Only
Classifier: Programming Language :: Python :: 3.9
Classifier: Programming Language :: Python :: 3.10
Classifier: Programming Language :: Python :: 3.11
Classifier: Programming Language :: Python :: 3.12
Requires-Python: >=3.9
Requires-Dist: defusedxml>=0.7.1
Requires-Dist: numpy>=1.24.2
Requires-Dist: pillow>=10.3.0
Requires-Dist: requests>=2.32.3
Requires-Dist: typing-extensions>=4.12
Provides-Extra: tqdm
Requires-Dist: tqdm>=4.66; extra == 'tqdm'
Description-Content-Type: text/markdown

# MAITE Datasets

MAITE Datasets are a collection of public datasets wrapped in a [MAITE](https://mit-ll-ai-technology.github.io/maite/) compliant format.

## Installation

To install and use `maite-datasets` you can use pip:

```bash
pip install maite-datasets
```

For status bar indicators when downloading, you can include the extra `tqdm` when installing:

```bash
pip install maite-datasets[tqdm]
```

## Available Datasets

| Task           | Dataset          | Description                                                         |
|----------------|------------------|---------------------------------------------------------------------|
| Classification | CIFAR10          | [CIFAR10](https://www.cs.toronto.edu/~kriz/cifar.html) dataset.     |
| Classification | MNIST            | A dataset of hand-written digits.                                   |
| Classification | Ships            | A dataset that focuses on identifying ships from satellite images.  |
| Detection      | AntiUAVDetection | A UAV detection dataset in natural images with varying backgrounds. |
| Detection      | MILCO            | A side-scan sonar dataset focused on mine-like object detection.    |
| Detection      | Seadrone         | A UAV dataset focused on open water object detection.               |
| Detection      | VOCDetection     | [Pascal VOC](http://host.robots.ox.ac.uk/pascal/VOC/) dataset.      |

## Usage

Here is an example of how to import MNIST for usage with your workflow.

```python
>>> from maite_datasets.image_classification import MNIST

>>> mnist = MNIST(root="data", download=True)
>>> print(mnist)
MNIST Dataset
-------------
    Corruption: None
    Transforms: []
    Image_set: train
    Metadata: {'id': 'MNIST_train', 'index2label': {0: 'zero', 1: 'one', 2: 'two', 3: 'three', 4: 'four', 5: 'five', 6: 'six', 7: 'seven', 8: 'eight', 9: 'nine'}, 'split': 'train'}
    Path: /home/user/maite-datasets/data/mnist
    Size: 60000

>>> print("tuple("+", ".join([str(type(t)) for t in mnist[0]])+")")
tuple(<class 'numpy.ndarray'>, <class 'numpy.ndarray'>, <class 'dict'>)
```

## Additional Information

For more information on the MAITE protocol, check out their [documentation](https://mit-ll-ai-technology.github.io/maite/).

## Acknowledgement

### CDAO Funding Acknowledgement

This material is based upon work supported by the Chief Digital and Artificial
Intelligence Office under Contract No. W519TC-23-9-2033. The views and
conclusions contained herein are those of the author(s) and should not be
interpreted as necessarily representing the official policies or endorsements,
either expressed or implied, of the U.S. Government.
