7  DAC-GDAC data-management

This chapter describes the data management organization between Argo DACs and GDACS.

7.1 File submission from DAC to GDACs

Each DAC regularly submits all its new files to both USGODAE and Coriolis GDACs.On both GDACs, each DAC has an ftp account with:

  • a submit directory to submit files

  • a reject directory that contains the submitted files that were rejected by GDACs files format checker.

Seven types of files are accepted on GDAC:

  • A float metadata file

  • A float trajectory file

  • A float technical data file

  • A float’s cycle file

  • The DAC’s geylist

  • A removal file

  • A compressed file containing a series of above files

Each GDAC checks the file format. If agreed, the file is pushed on the GDAC ftp server or processed. Otherwise, the file is moved in the reject directory, an error message is sent to the DAC contact point. Rejected files are kept in the reject directory for one month at least.

7.2 Archived DOI datasets

Once a month, a snapshot of the whole Argo dataset is archived. This snapshot is added to the collection of monthly snapshots available with Argo GDAC DOI:

A digital object identifier (DOI) is a unique identifier for an electronic document or a dataset. Argo data-management assigns DOIs to its documents and datasets for two main objectives:

  • Citation: in a publication the DOI is efficiently tracked by bibliographic surveys

  • Traceability: the DOI is a direct and permanent link to the document or data set used in a publication

Monthly snapshot file naming convention and content

YYYYMM-ArgoData.tar.gz

  • YYYYMM year and month

Example : 201906-ArgoData.tar.gz is the June 2019 snapshot of Argo data. The snapshot file has to be unzipped (gunzip file.tar.gz) and untared (tar xvf file.tar) in two directories: dac and geo.

  • dac directory : content of the GDAC “dac” branch

    • xxx_core.tar.gz xxx dac core Argo files

    • xxx_bgc.tar.gz xxx dac bgc-Argo files

    • argo_bio-profile_index.txt.gz bgc Argo profiles index file

    • argo_bio-traj_index.txt.gz bgc Argo trajectories file

    • argo_merge-profile_index.txt.gz merged Argo profiles index

    • argo_profile_detailled_index.txt.gz Argo profiles detailed index

    • argo_synthetic-profile_index.txt.gz Synthetic Argo profile files

    • ar_index_global_meta.txt.gz Index of metadata files

    • ar_index_global_prof.txt.gz Index of profile files

    • ar_index_global_tech.txt.gz Index of technical files

    • ar_index_global_traj.txt.gz Index of trajectories

  • geo directory: content of the GDAC “geo” branch
    One compressed file per day and per ocean

7.3 Compressed files data submission

A DAC can push to GDAC a compressed file containing a series of files. The GDAC will process all its content. This is useful to submit an important batch of files (example : delayed mode data). The compressed file is a tar-ed file or directory compressed with gzip.

Compressed file naming convention

XXX.tar.gz*

XXX : the compressed file name, with no specific requirement

  • tar : tar suffix

  • gz : gzip suffix

Example: coriolis-201210-DelayedMode.tar.gz