Changes between Version 54 and Version 55 of Doc/CheckDebug


Ignore:
Timestamp:
11/08/19 10:34:44 (5 years ago)
Author:
acosce
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • Doc/CheckDebug

    v54 v55  
    9999# Analyzing the Job output : Script_Output # 
    100100Reminder --> This file contains three parts:  
    101  * copying the input files 
     101 * prepare parameters files, and copying the input files 
    102102 * running the model  
    103103 * post processing 
     
    108108####################################### 
    109109 
    110  1st part (copying the input files) 
     110 1st part (prepare parameters files, copying the input files) 
    111111 
    112112####################################### 
     
    130130   * you didn't specify `OnQueue`in the `run.card` file in case you wanted to continue the simulation;  
    131131   * one of the input files was missing (e.g. it doesn't exist, the machine has a problem,...);  
    132    * the frequencies (`RebuildFrequency`, `PackFrequency` ...) do not match `PeriodLength`. 
     132   * the frequencies (`PackFrequency` ...) do not match `PeriodLength`. 
     133   * `PeriodLength` do not match with `DateBegin` and `DateEnd` 
    133134 
    134135 * if the file ends in the middle of the second part, it's most likely because you didn't request enough memory or CPU time. 
     
    142143{{{ 
    143144======================================================================== 
    144 EXECUTION of : mpirun -f ./run_file > out_run_file 2>&1 
     145EXECUTION of : /usr/bin/time ccc_mprun -E-K1  -f ./run_file 
    145146Return code of executable : 1 
    146147IGCM_debug_Exit :  EXECUTABLE 
     148 
     149!!!!!!!!!!!!!!!!!!!!!!!!!! 
     150!!   ERROR TRIGGERED    !! 
     151!!   EXIT FLAG SET      !! 
     152!------------------------! 
     153 
     1540 - IGCM_debug_Exit (_0_) 
     155IGCM_sys_Mkdir : /path_of_your_simulation/Debug 
     156IGCM_sys_Cp : out_execution /path_of_your_simulation/Debug/JobName_PeriodDateBegin_PeriodDateEnd_out_execution_error 
     157======================================================================== 
     158 
     159}}} 
     160If the following message is displayed : 
     161{{{ 
     162======================================================================== 
     163EXECUTION of : mpirun -f ./run_file > out_run_file 2>&1 
     164======================================================================== 
     165}}} 
     166 
     167If there is a message indicating that the `restartphy.n` file doesn't exist it means that the model simulation was completed but before the end date of your simulation. If this happens you must refer to the output log of each model of your simulation. 
     168For example, the output file of the ocean model is stored on the file server under this name: 
     169{{{ 
     170IGCM_sys_Put_Out : ocean.output xxxxxxxx/OCE/Debug/xxxxxxxx_ocean.output 
     171}}} 
     172you can retrieve them in the RUN_DIR directory of your simulation. 
     173 
     174[[NoteBox(tip,In general\, if your simulation stops you can look for the keyword "IGCM_debug_CallStack" in this file. This keyword will come after a line explaining the error you are experiencing., 600px)]] 
     175{{{ 
     176Example :  
     177 
     178--Debug1--> IGCM_comp_Update 
     179 
     180IGCM_debug_Exit :  IGCM_comp_Update missing executable create_etat0_limit.e 
    147181 
    148182!!!!!!!!!!!!!!!!!!!!!!!!!! 
    149183!! IGCM_debug_CallStack !! 
    150184!------------------------! 
    151  
    152 !------------------------! 
    153 IGCM_sys_Cp : out_run_file xxxxxxxxxxxx_out_run_file_error 
    154 ======================================================================== 
    155 }}} 
    156 If the following message is displayed : 
    157 {{{ 
    158 ======================================================================== 
    159 EXECUTION of : mpirun -f ./run_file > out_run_file 2>&1 
    160 ======================================================================== 
    161 }}} 
    162  
    163 If there is a message indicating that the `restartphy.n`" file doesn't exist it means that the model simulation was completed but before the end date of your simulation. If this happens and if your model creates an output log other than the simulation output log, you must refer to this log. 
    164 For example, the output file of the ocean model is stored on the file server under this name: 
    165 {{{ 
    166 IGCM_sys_Put_Out : ocean.output xxxxxxxx/OCE/Debug/xxxxxxxx_ocean.output 
    167 }}} 
    168 For LMDZ your output log is the same as the simulation output log and it has not been copied to the storage space. If your simulation has been performed on $SCRATCHDIR (TGCC) you can retrieve it there. Otherwise, you must restart your simulation using $WORKDIR (IDRIS) as the working directory keeping all needed files. You must also change the RUN_DIR_PATH variable. See [#run_dir_path here] before restarting it. 
    169  
    170  
    171 [[NoteBox(tip,In general\, if your simulation stops you can look for the keyword "IGCM_debug_CallStack" in this file. This keyword will come after a line explaining the error you are experiencing., 600px)]] 
    172 {{{ 
    173 Example :  
    174  
    175 --Debug1--> IGCM_comp_Update 
    176  
    177 IGCM_debug_Exit :  IGCM_comp_Update missing executable create_etat0_limit.e 
    178  
    179 !!!!!!!!!!!!!!!!!!!!!!!!!! 
    180 !! IGCM_debug_CallStack !! 
    181 !------------------------! 
    182185}}} 
    183186 
     
    191194If the simulation failed due to abnormal exit from the executable, a `Debug` directory is created in the working directory. It contains output text files of all model components for your configuration. You should read them to look for errors. For example : 
    192195 
    193  * `xxx_out_gcm.e_error` --> lmdz  text output 
     196 * `xxx_out_gcm.e.err` and `xxx_out_gcm.e.out`(or `xxx_out_lmdz.e.err` and `xxx_out_lmdz.e.out`) --> lmdz  text output  
    194197 * `xxx_out_orchidee` --> orchidee text output  
    195198 * `xxx_ocean.output` --> nemo text output 
     
    224227# CCYY = year up to which you are deleting everything (this year included). By default, it's the current year in run.card 
    225228}}} 
    226  
    227 ## On Irene : How to use the ddt debugger for the coupled model (or any other MPMD mode)  ## 
    228  
    229 ## Debug on Irene ## 
    230229 
    231230