7 DAC-GDAC data-management
This chapter describes the data management organization between Argo DACs and GDACS.
7.1 File submission from DAC to GDACs
Each DAC regularly submits all its new files to both USGODAE and Coriolis GDACs.On both GDACs, each DAC has an ftp account with:
a submit directory to submit files
a reject directory that contains the submitted files that were rejected by GDACs files format checker.
Seven types of files are accepted on GDAC:
A float metadata file
A float trajectory file
A float technical data file
A float’s cycle file
The DAC’s geylist
A removal file
A compressed file containing a series of above files
Each GDAC checks the file format. If agreed, the file is pushed on the GDAC ftp server or processed. Otherwise, the file is moved in the reject directory, an error message is sent to the DAC contact point. Rejected files are kept in the reject directory for one month at least.
7.2 Archived DOI datasets
Once a month, a snapshot of the whole Argo dataset is archived. This snapshot is added to the collection of monthly snapshots available with Argo GDAC DOI:
- Argo float data and metadata from Global Data Assembly Centre (Argo GDAC) https://doi.org/10.17882/42182
A digital object identifier (DOI) is a unique identifier for an electronic document or a dataset. Argo data-management assigns DOIs to its documents and datasets for two main objectives:
Citation: in a publication the DOI is efficiently tracked by bibliographic surveys
Traceability: the DOI is a direct and permanent link to the document or data set used in a publication
Monthly snapshot file naming convention and content
YYYYMM-ArgoData.tar.gz
- YYYYMM year and month
Example : 201906-ArgoData.tar.gz is the June 2019 snapshot of Argo data. The snapshot file has to be unzipped (gunzip file.tar.gz) and untared (tar xvf file.tar) in two directories: dac and geo.
dac directory : content of the GDAC “dac” branch
xxx_core.tar.gz xxx dac core Argo files
xxx_bgc.tar.gz xxx dac bgc-Argo files
argo_bio-profile_index.txt.gz bgc Argo profiles index file
argo_bio-traj_index.txt.gz bgc Argo trajectories file
argo_merge-profile_index.txt.gz merged Argo profiles index
argo_profile_detailled_index.txt.gz Argo profiles detailed index
argo_synthetic-profile_index.txt.gz Synthetic Argo profile files
ar_index_global_meta.txt.gz Index of metadata files
ar_index_global_prof.txt.gz Index of profile files
ar_index_global_tech.txt.gz Index of technical files
ar_index_global_traj.txt.gz Index of trajectories
geo directory: content of the GDAC “geo” branch
One compressed file per day and per ocean
7.3 Compressed files data submission
A DAC can push to GDAC a compressed file containing a series of files. The GDAC will process all its content. This is useful to submit an important batch of files (example : delayed mode data). The compressed file is a tar-ed file or directory compressed with gzip.
Compressed file naming convention
XXX.tar.gz*
XXX : the compressed file name, with no specific requirement
tar : tar suffix
gz : gzip suffix
Example: coriolis-201210-DelayedMode.tar.gz