Sentinelhub grant: Sponsoring request ID 1c081a - Towards a Smart Eco-epidemiological Model of Dengue in Colombia using Satellite in Collaboration with MIT Critical Data Colombia.

Project proposal (READ MORE)

Project Organisation: MIT Critical Data Colombia.

Model Description

Summary

Here below find all the dataset's versions and descriptions. The dataset employed in this study spans 81 municipalities in Colombia from 2016 to 2018, resulting in a total of 12,636 satellite images. The metadata associated with these images encompasses static sociodemographic variables, indices of poverty, and access to education. Additionally, dynamic epidemiological and climatic metadata were collected, providing a comprehensive spatiotemporal context for the analyses.

Baseline method from satellite extractor: Raw data from Sentinel 2LC1 with recursive artifact removal, clouds removal based on LeastCC and Nearest Interpolation for spatial resolution.

Reading data

The data can be read as (example):

from datasets import load_dataset

dataset = load_dataset("MITCriticalData/Unlabeled_top_10_cities_forward_backward_alg")

Alternatively, for datasets<1.11.0 the lecture of .tiff and .json files is not compatible. In such case we recommend to download the data as:

wget path_to_data/images.zip && unzip images.zip -d images/

wget path_to_data/annotations.zip && unzip annotations.zip -d annotations/

Licensing Information

The dataset is released under the terms of MIT. By using this, you are also bound to the respective Terms of Use and License of the original source.

Author & Mantainer

Sebastián Andrés Cajas Ordóñez

Acknowledgements

This work is supported in part by Oracle for Research through Oracle Cloud Credits and related resources provided by Oracle for Research, as well as the European Space Agency’s Network of Resources Initiative under the sponsoring request ID 1c081a, and MISTI-Colombia, Cali seed fund. LAC is funded by the National Institute of Health through R01 EB017205, DS-I Africa U54 TW012043-01 and Bridge2AI OT2OD032701, and the National Science Foundation through ITEST 2148451.

Contributors

MIT Critical data Colombia Team: Sebastian A. Cajas, David Restrepo, Kuan-Ting Kuo, Dana Moukheiber, Atika Rahman Paddo, Saptarshi Purkayastha, Leo Anthony Celi, Po-Chih Kuo, Juan Sebastián Osorio-Valencia, Kuan-Ting Ku, Braiam Escobar, Diego M. López, Cheng Che Tsai, Wilson Arbey Diaz, Luis Jesús Martínez, Alessa Álvarez, Siyi Tang, Amara Tariq, Imon Banerjee, Aakanksha Rana, Maria Patricia Arbelaez-Montoy, Cheng Che Tsai, Laura Sofía Daza Rosero, Jhon Fredy Romero Núñez, Wilson Arbey Diaz, Luis Jesús Martínez, Saketh Sundar, Alessa Álvarez, Siyi Tang, Amara Tariq, Imon Banerjee, Aakanksha Rana, Ivan Darío Velez, Maria Patricia Arbelaez-Montoya.

Citation

Please cite our work if you find the resources in this repository useful:

@article{cajasmulti,
  title={A Multi-Modal Satellite Imagery Dataset for Public Health Analysis in Colombia},
  author={Cajas, Sebastian A and Restrepo, David and Moukheiber, Dana and Kuo, Kuan Ting and Wu, Chenwei and Chicangana, David Santiago Garcia and Paddo, Atika Rahman and Moukheiber, Mira and Moukheiber, Lama and Moukheiber, Sulaiman and others}
}

@article{kuo2024denguenet,
  title={DengueNet: Dengue Prediction using Spatiotemporal Satellite Imagery for Resource-Limited Countries},
  author={Kuo, Kuan-Ting and Moukheiber, Dana and Ordonez, Sebastian Cajas and Restrepo, David and Paddo, Atika Rahman and Chen, Tsung-Yu and Moukheiber, Lama and Moukheiber, Mira and Moukheiber, Sulaiman and Purkayastha, Saptarshi and others},
  journal={arXiv preprint arXiv:2401.11114},
  year={2024}
}

@article{moukheiber2024multimodal,
  title={A multimodal framework for extraction and fusion of satellite images and public health data},
  author={Moukheiber, Dana and Restrepo, David and Cajas, Sebasti{\'a}n Andr{\'e}s and Montoya, Mar{\'\i}a Patricia Arbel{\'a}ez and Celi, Leo Anthony and Kuo, Kuan-Ting and L{\'o}pez, Diego M and Moukheiber, Lama and Moukheiber, Mira and Moukheiber, Sulaiman and others},
  journal={Scientific Data},
  volume={11},
  number={1},
  pages={634},
  year={2024},
  publisher={Nature Publishing Group UK London}
}