Clusters of atmopsheric and oceanic variables and teleconnections that are candidate drivers for Tropical Cyclogenesis

doi:10.26050/WDCC/CLINT_TC

Dainelli, Filippo

ExperimentDOI
Summary
This project provides the dataset employed for the development of a machine learning framework designed to detect and interpret Tropical Cyclone Genesis (TCG) activity across six major tropical ocean basins: North Atlantic, Northeast Pacific, Northwest Pacific, North Indian, South Indian, and South Pacific.
The dataset includes pre-processed environmental and climatic variables relevant to TCG dynamics, aggregated at the basin level with monthly resolution from January 1980 to December 2022. All data are derived from the ERA5 reanalysis dataset, with a spatial resolution of 2.5° × 2.5°. ERA5 reanalysis data were accessed through the DKRZ data pool, made available by DKRZ Data Management. The atmospheric and oceanic variables provided are absolute vorticity at 850 hPa, maximum potential intensity (MPI), mean sea level pressure (MSLP), relative humidity at 700 hPa, sea surface temperature (SST), relative vorticity at 850 hPa, vertical wind shear between 850 and 200 hPa, and vertical velocity at 500 hPa. Several of these variables are derived from ERA5 primary variables and represent physically meaningful diagnostics used widely in tropical cyclone research. To reduce spatial dimensionality, each variable has been clustered within each basin using the K-means algorithm, and the area-weighted mean value of each cluster is reported as a time series.
Additionally, the dataset includes monthly values of a suite of large-scale climate indices known to influence tropical cyclone activity: Atlantic Meridional Mode (AMM), Niño3.4, North Atlantic Oscillation (NAO), Pacific Decadal Oscillation (PDO), Pacific-North American Pattern (PNA), Southern Oscillation Index (SOI), Tropical Northern Atlantic Index (TNA), Tropical Southern Atlantic Index (TSA), and the Western Pacific Index (WP).
Lastly, for each basin, the dataset contains monthly counts of tropical cyclogenesis events, enabling evaluation of predictive models and interpretability methods.
This dataset is intended to support research in seasonal TCG detection, and it enables reproducibility of the methods developed in the associated study.
Project
CLINT (Climate Intelligence)
Contact
Filippo Dainelli (
 dainelli.filippo@nullgmail.com
)
Spatial Coverage
Longitude 0 to 360 Latitude -40 to 40
Temporal Coverage
1980-01-01 to 2022-12-31
Use constraints
Creative Commons Attribution 4.0 International (https://creativecommons.org/licenses/by/4.0/)
Data Catalog
World Data Center for Climate
Size
26.45 MiB (27735252 Byte)
Format
NetCDF
Status
completely archived
Creation Date
Review Date
2025-10-02
Cite as
Dainelli, Filippo (2025). Clusters of atmopsheric and oceanic variables and teleconnections that are candidate drivers for Tropical Cyclogenesis. World Data Center for Climate (WDCC) at DKRZ. https://doi.org/10.26050/WDCC/CLINT_TC

BibTeX RIS
Funding
European Commission - Horizon 2020 Framework Programme
Grant/Award No: 101003876 - CLImate INTelligence: Extreme events detection, attribution and adaptation design using machine learning
Description
Summary:
Findable: 6 of 7 level;
Accessible: 3 of 7 level;
Interoperable: 6 of 6 level;
Reusable: 5 of 6 level
Method
F-UJI WDCC service v3.5.0 metrics_v0.8
Method Description
Checks performed by WDCC. Metrics documentation: https://doi.org/10.5281/zenodo.15045911 Metric Version: metrics_v0.8
Method Url
Result Date
2025-10-29
Result Date
2025-10-22
Description
1. Number of data sets is correct and > 0: passed;
2. Size of every data set is > 0: passed;
3. The data sets and corresponding metadata are accessible: passed;
4. The data sizes are controlled and correct: passed;
5. The spatial-temporal coverage description (metadata) is consistent to the data: passed;
6. The format is correct: passed;
7. Variable description and data are consistent: passed
Method
WDCC-TQA checklist
Method Description
Checks performed by WDCC. The list of TQA metrics are documented in the 'WDCC User Guide for Data Publication' Chapter 8.1.1
Method Url
Result Date
2025-10-22
Contact typePersonORCIDOrganization
-
-
-
-

Attached Datasets ( 6 )

Details for selected entry

Additional Info

Details for selected entry

Parent project(s)

Climate Intelligence

[Entry acronym: CLINT_TC] [Entry id: 5311644]