source: trunk/libIGCM/AA_pack_output @ 1468

Last change on this file since 1468 was 1468, checked in by mafoipsl, 6 years ago

On curie : set output files for all jobs.

On irene :

  • set output files for all jobs like curie
  • add the possibility to use xlarge nodes (for free) for post-processing or skylake nodes
  • add a question to set project used for post-processing and delete -A option in libIGCM_sys_irene
  • display only available projects for computing or post-processing on irene
  • Property licence set to
    The following licence information concerns ONLY the libIGCM tools
    ==================================================================

    Copyright © Centre National de la Recherche Scientifique CNRS
    Commissariat à l'Énergie Atomique CEA

    libIGCM : Library for Portable Models Computation of IGCM Group.

    IGCM Group is the french IPSL Global Climate Model Group.

    This library is a set of shell scripts and functions whose purpose is
    the management of the initialization, the launch, the transfer of
    output files, the post-processing and the monitoring of datas produce
    by any numerical program on any plateforme.

    This software is governed by the CeCILL license under French law and
    abiding by the rules of distribution of free software. You can use,
    modify and/ or redistribute the software under the terms of the CeCILL
    license as circulated by CEA, CNRS and INRIA at the following URL
    "http://www.cecill.info".

    As a counterpart to the access to the source code and rights to copy,
    modify and redistribute granted by the license, users are provided only
    with a limited warranty and the software's author, the holder of the
    economic rights, and the successive licensors have only limited
    liability.

    In this respect, the user's attention is drawn to the risks associated
    with loading, using, modifying and/or developing or reproducing the
    software by the user in light of its specific status of free software,
    that may mean that it is complicated to manipulate, and that also
    therefore means that it is reserved for developers and experienced
    professionals having in-depth computer knowledge. Users are therefore
    encouraged to load and test the software's suitability as regards their
    requirements in conditions enabling the security of their systems and/or
    data to be ensured and, more generally, to use and operate it in the
    same conditions as regards security.

    The fact that you are presently reading this means that you have had
    knowledge of the CeCILL license and that you accept its terms.
  • Property svn:keywords set to Revision Author Date
File size: 13.1 KB
Line 
1#-Q- curie ######################
2#-Q- curie ## CURIE   TGCC/CEA ##
3#-Q- curie ######################
4#-Q- curie #MSUB -r PACKOUTPUT     # Nom du job
5#-Q- curie #MSUB -o PACKOUTPUT.out_%I
6#-Q- curie #MSUB -e PACKOUTPUT.out_%I
7#-Q- curie #MSUB -n 1              # Reservation du processus
8#-Q- curie #MSUB -T 36000          # Limite de temps elapsed du job
9#-Q- curie #MSUB -q ::default_node::
10#-Q- curie #MSUB -c ::default_core::
11#-Q- curie #MSUB -Q normal
12#-Q- curie #MSUB -A ::default_project::
13#-Q- curie set +x
14#-Q- irene ######################
15#-Q- irene ## IRENE   TGCC/CEA ##
16#-Q- irene ######################
17#-Q- irene #MSUB -r PACKOUTPUT     # Job name
18#-Q- irene #MSUB -o PACKOUTPUT.out_%I
19#-Q- irene #MSUB -e PACKOUTPUT.out_%I
20#-Q- irene #MSUB -n 1              # Number of cores
21#-Q- irene #MSUB -T 36000          # Maximum elapsed time
22#-Q- irene #MSUB -q ::default_node::
23#-Q- irene #MSUB -c ::default_core::
24#-Q- irene #MSUB -Q normal
25#-Q- irene #MSUB -A ::default_post_project::
26#-Q- irene #MSUB -m store,work,scratch
27#-Q- irene set +x
28#-Q- ada #!/bin/ksh
29#-Q- ada #######################
30#-Q- ada ## ADA         IDRIS ##
31#-Q- ada #######################
32#-Q- ada # @ job_type = mpich
33#-Q- ada # @ requirements = (Feature == "prepost")
34#-Q- ada # Temps Elapsed max. d'une requete hh:mm:ss
35#-Q- ada # @ wall_clock_limit = 10:00:00
36#-Q- ada # Memory required for ncrcat
37#-Q- ada # @ as_limit = 30Gb
38#-Q- ada # Nom du travail LoadLeveler
39#-Q- ada # @ job_name   = PACKOUTPUT
40#-Q- ada # Fichier de sortie standard du travail
41#-Q- ada # @ output     = $(job_name).$(jobid)
42#-Q- ada # Fichier de sortie d'erreur du travail
43#-Q- ada # @ error      =  $(job_name).$(jobid)
44#-Q- ada # pour recevoir un mail en cas de depassement du temps Elapsed (ou autre pb.)
45#-Q- ada # @ notification = error
46#-Q- ada # @ environment  = $DEBUG_debug ; $BigBrother ; $postProcessingStopLevel ; $MODIPSL ; $libIGCM ; $libIGCM_SX ; $POST_DIR ; $Script_Post_Output ; $SUBMIT_DIR ; $DateBegin ; $DateEnd ; $PeriodPack ; $StandAlone ; $MASTER ; wall_clock_limit=$(wall_clock_limit)
47#-Q- ada # @ queue
48#-Q- lxiv8 ######################
49#-Q- lxiv8 ## OBELIX      LSCE ##
50#-Q- lxiv8 ######################
51#-Q- lxiv8 #PBS -N PACKOUTPUT
52#-Q- lxiv8 #PBS -m a
53#-Q- lxiv8 #PBS -j oe
54#-Q- lxiv8 #PBS -q medium
55#-Q- lxiv8 #PBS -o PACKOUTPUT.$$
56#-Q- lxiv8 #PBS -S /bin/ksh
57#-Q- ifort_CICLAD ######################
58#-Q- ifort_CICLAD ##   CICLAD    IPSL ##
59#-Q- ifort_CICLAD ######################
60#-Q- ifort_CICLAD #PBS -N PACKOUTPUT
61#-Q- ifort_CICLAD #PBS -m a
62#-Q- ifort_CICLAD #PBS -j oe
63#-Q- ifort_CICLAD #PBS -q std
64#-Q- ifort_CICLAD #PBS -S /bin/ksh
65#-Q- default #!/bin/ksh
66#-Q- default ##################
67#-Q- default ## DEFAULT HOST ##
68#-Q- default ##################
69
70#**************************************************************
71# Author: Sebastien Denvil
72# Contact: Sebastien.Denvil__at__ipsl.jussieu.fr
73# $Revision::                                          $ Revision of last commit
74# $Author::                                            $ Author of last commit
75# $Date::                                              $ Date of last commit
76# IPSL (2006)
77#  This software is governed by the CeCILL licence see libIGCM/libIGCM_CeCILL.LIC
78#
79#**************************************************************
80
81#set -eu
82#set -vx
83
84date
85
86#D- Task type DO NOT CHANGE (computing, post-processing or checking)
87TaskType=post-processing
88
89########################################################################
90
91#D- Flag to determine if this job in a standalone mode
92#D- Default : value from AA_job if any
93StandAlone=${StandAlone:=true}
94
95#D- Path to libIGCM
96#D- Default : value from AA_job if any
97# WARNING For StandAlone use : To run this script on some machine (ulam and cesium)
98# WARNING you must check MirrorlibIGCM variable in sys library.
99# WARNING If this variable is true, you must use libIGCM_POST path instead
100# WARNING of your running libIGCM directory.
101libIGCM=${libIGCM:=::modipsl::/libIGCM}
102
103#-D- $hostname of the MASTER job when SUBMIT_DIR is not visible on postprocessing computer.
104MASTER=${MASTER:=ada|curie}
105
106#D- Flag to determine begin date for restart pack
107#D- Default : value from AA_job if any
108DateBegin=${DateBegin:=20000101}
109
110#D- Flag to determine end date for restart pack
111#D- Default : value from AA_job if any
112DateEnd=${DateEnd:=20691231}
113
114#D- Flag to determine pack period
115#D- Default : value from AA_job if any
116PeriodPack=${PeriodPack:=10Y}
117
118#D- Uncomment to run interactively
119#D- For testing purpose, will be remove
120#SUBMIT_DIR=${PWD}
121#RUN_DIR_PATH=${SCRATCHDIR}/Pack_Test
122
123#D- Increased verbosity (1, 2, 3)
124#D- Default : value from AA_job if any
125Verbosity=${Verbosity:=3}
126
127#D- Low level debug : to bypass lib test checks and stack construction
128#D- Default : value from AA_job if any
129DEBUG_debug=${DEBUG_debug:=false}
130
131########################################################################
132
133. ${libIGCM}/libIGCM_debug/libIGCM_debug.ksh
134. ${libIGCM}/libIGCM_card/libIGCM_card.ksh
135. ${libIGCM}/libIGCM_date/libIGCM_date.ksh
136#-------
137. ${libIGCM}/libIGCM_sys/libIGCM_sys.ksh
138. ${libIGCM}/libIGCM_config/libIGCM_config.ksh
139. ${libIGCM}/libIGCM_post/libIGCM_post.ksh
140#-------
141RUN_DIR=${RUN_DIR_PATH}
142IGCM_sys_MkdirWork ${RUN_DIR}
143IGCM_sys_Cd ${RUN_DIR}
144#-------
145( ${DEBUG_debug} ) && IGCM_debug_Check
146( ${DEBUG_debug} ) && IGCM_card_Check
147( ${DEBUG_debug} ) && IGCM_date_Check
148
149########################################################################
150
151#set -vx
152
153# ------------------------------------------------------------------
154# Test if all was right before proceeding further
155# ------------------------------------------------------------------
156IGCM_debug_Verif_Exit
157
158if [ ${StandAlone} = true ] ; then
159    CARD_DIR=${SUBMIT_DIR}
160else
161    CARD_DIR=${RUN_DIR_PATH}
162    IGCM_sys_Get_Master ${SUBMIT_DIR}/config.card ${RUN_DIR_PATH}
163    IGCM_sys_Get_Master ${SUBMIT_DIR}/run.card    ${RUN_DIR_PATH}
164    IGCM_sys_Get_Master ${SUBMIT_DIR}/COMP        ${RUN_DIR_PATH}
165    IGCM_sys_Get_Master ${SUBMIT_DIR}/POST        ${RUN_DIR_PATH}
166fi
167
168#==================================
169# First of all
170#
171# Read libIGCM compatibility version in config.card
172# Read UserChoices section
173# Read Ensemble section
174# Read Post section
175# Define all netcdf output directories
176#==================================
177IGCM_config_CommonConfiguration ${CARD_DIR}/config.card
178
179# ------------------------------------------------------------------
180# Activate BigBrother so as to supervise this job
181# ------------------------------------------------------------------
182IGCM_debug_BigBro_Initialize
183
184#==================================
185# Read ListOfComponents section
186# to drive the loop over find
187IGCM_card_DefineArrayFromSection ${CARD_DIR}/config.card ListOfComponents
188
189#==================================
190# Test and set up directories
191#==================================
192IGCM_sys_TestDirArchive ${R_SAVE}
193[ $? != 0 ] && IGCM_debug_Exit "IGCM_sys_TestDirArchive"
194
195# Where to store used file list /!\ TEMPORARY /!\
196STORE_DEBUG=${R_SAVE}/DEBUG
197
198# Switch to script variables meaning (try to be compatible with ipsl_pack TGCC moving procedure)
199JobName=${config_UserChoices_JobName}
200echo $JobName $DateBegin $DateEnd
201
202# ------------------------------------------------------------------
203# Test if all was right before proceeding further
204# ------------------------------------------------------------------
205IGCM_debug_Verif_Exit
206
207IGCM_debug_Print 1 "Check coherence between PackFrequency and PeriodLength"
208IGCM_post_CheckModuloFrequency PeriodPack config_UserChoices_PeriodLength NbPeriodPerFrequency
209# ------------------------------------------------------------------
210# Test if all was right before proceeding further
211# ------------------------------------------------------------------
212IGCM_debug_Verif_Exit
213
214IGCM_debug_Print 1 "We must process ${NbPeriodPerFrequency} files for each pack"
215
216# Init loop
217date_begin_pack=${DateBegin}
218date_end_simulation=${DateEnd}
219number_pack=1
220
221IGCM_debug_PrintVariables 3 date_begin_pack
222IGCM_debug_PrintVariables 3 date_end_simulation
223
224while [ ${date_begin_pack} -le ${date_end_simulation} ] ; do
225
226  IGCM_debug_PrintVariables 3 number_pack
227  DaysTemp=$( IGCM_date_DaysInCurrentPeriod ${date_begin_pack} ${PeriodPack} )
228  date_end_pack=$( IGCM_date_AddDaysToGregorianDate ${date_begin_pack} $(( ${DaysTemp} - 1 )) )
229
230  for comp in ${config_ListOfComponents[*]} ; do
231    dirList=$( find ${R_BUFR}/${comp}/Output -maxdepth 1 -mindepth 1 -type d )
232    for dir in ${dirList} ; do
233      # dirID is like ATM.Output.MO
234      dirID=$( echo $dir | sed "s:${R_BUFR}/::" | sed "s:/:.:g" )
235      # Sort what's in the directory
236      find ${dir} -type f -name "${JobName}*.nc" -ls | sort -k 11 > liste_files.${dirID}.txt
237      # How much file type. Example : 1M_histmthCOSP.nc, 1M_histmth.nc, 1M_histmthNMC.nc, 1M_paramLMDZ_phy.nc
238      # /!\ fileType include the .nc extension /!\
239      fileType=$( gawk '{print $11}' liste_files.${dirID}.txt | gawk -F$dir/ '{print $2}' | sed "s:${JobName}_[0-9]\{8,9\}_[0-9]\{8,9\}_::g" | sort | uniq )
240      # Loop over the file type and pack them when in between date_begin_pack and date_end_pack
241      for myType in ${fileType} ; do
242        grep ${myType} liste_files.${dirID}.txt > liste_files.${dirID}.${myType}.txt
243        nbfile=0
244        for file in $( gawk '{print $11}' liste_files.${dirID}.${myType}.txt ); do
245          extract_date_file=$( echo ${file}  | sed -e "s/.*${JobName}_[0-9]*_//" )
246          date_file=$( echo ${extract_date_file} | sed 's/\([0-9]\{8\}\)_.*$/\1/g' )
247          # echo pack n°${number_pack}  ${date_file} ${date_begin_pack} ${date_end_pack}
248          if [ ${date_file} -le ${date_end_pack} ] && [ ${date_file} -ge ${date_begin_pack} ] ; then
249            echo ${file} >> liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt
250            ncdump -h ${file} | grep -E 'float|double' | cut -f 1 -d '(' | cut -f 2 -d ' ' >> liste_variables_${myType}_${date_begin_pack}_${date_end_pack}.txt
251            (( nbfile = nbfile + 1 ))
252          fi
253        done
254
255        if [ ${nbfile} = 0 ] ; then
256          IGCM_debug_Print 1 "We found no file to process"
257          IGCM_debug_Print 1 "We should have found ${NbPeriodPerFrequency} files"
258          IGCM_debug_Print 1 "As some files can be produced only for some selected period we consider we can move to the next file type"
259          continue
260        fi
261
262        # Select list of variables to work with
263        list_var=$( cat liste_variables_${myType}_${date_begin_pack}_${date_end_pack}.txt | sort | uniq -c | awk -v nbfile=$nbfile '{if ($1 != nbfile) {print $2}}' | paste -s -d ',' )
264        liste_file_tmp=$( for i in $( cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt ) ; do basename $i ; done )
265        # Create packed files
266        IGCM_debug_Print 1 "Ncrcat ongoing for ${dir} and ${myType}"
267        if [ ! ${nbfile} = ${NbPeriodPerFrequency} ] ; then
268          IGCM_debug_Print 1 "Number of files to process is not equal to what it should be"
269          IGCM_debug_Print 1 "We found ${nbfile} files and it should have been ${NbPeriodPerFrequency} files"
270          IGCM_debug_Exit "ERROR in number of files to process. STOP HERE INCLUDING THE COMPUTING JOB"
271          IGCM_debug_Verif_Exit
272        fi
273        output=${JobName}_${date_begin_pack}_${date_end_pack}_${myType}
274        #cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt | xargs ncrcat -v ${list_var} -o ${output}
275        if [ X${list_var} = X ] ; then
276          IGCM_sys_ncrcat -p ${dir} ${liste_file_tmp} --output ${output}
277        else
278          IGCM_sys_ncrcat -x -v ${list_var} -p ${dir} ${liste_file_tmp} --output ${output}
279        fi
280        # ------------------------------------------------------------------
281        # Test if all was right before proceeding further
282        # ------------------------------------------------------------------
283        IGCM_debug_Verif_Exit
284        # Save it
285        IGCM_sys_Put_Out ${output} ${R_SAVE}/$( echo $dir | sed "s:${R_BUFR}/::" )/${output}
286        # Clean file produced by ncrcat
287        IGCM_sys_Rm ${output}
288        # ------------------------------------------------------------------
289        # Test if all was right before proceeding further
290        # ------------------------------------------------------------------
291        IGCM_debug_Verif_Exit
292        # Clean files used by ncrcat
293        cat liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt | xargs rm
294        # Save the list of files that has been pack (ncrcat)
295        #mv liste_pack_${myType}_${date_begin_pack}_${date_end_pack}.txt ${STORE_DEBUG}
296        IGCM_debug_Print 1 "Ncrcat and cleaning done for ${dir} and ${myType}"
297        echo
298      done
299    done
300  done
301  (( number_pack = number_pack + 1 ))
302  # Add 1 day to date_end_pack to have the new date_begin_pack
303  date_begin_pack=$( IGCM_date_AddDaysToGregorianDate ${date_end_pack} 1 )
304done
305
306# Flush post-processing submission
307if [ -f ${R_BUFR}/FlushPost_${DateEnd}.ksh ] ; then
308  . ${R_BUFR}/FlushPost_${DateEnd}.ksh
309  IGCM_FlushPost
310  #IGCM_sys_Rm -f ${R_BUFR}/FlushPost_${DateEnd}.ksh
311fi
312
313# Clean RUN_DIR_PATH (necessary for cesium and titane only)
314IGCM_sys_RmRunDir -Rf ${RUN_DIR_PATH}
315
316# ------------------------------------------------------------------
317# Finalize BigBrother to inform that the jobs end
318# ------------------------------------------------------------------
319IGCM_debug_BigBro_Finalize
320
321date
Note: See TracBrowser for help on using the repository browser.