Opened 11 years ago

Closed 10 years ago

#170 closed defect (fixed)

probleme at obelix : RunChecker and TS

Reported by: jgipsl Owned by: sdipsl
Priority: critical Milestone: libIGCM_v2.3 metrics
Component: system Version:
Keywords: Cc:

Description

using libIGCM trunk rev 964

At obelix, the RunChecker.job crash with following error message :

./RunChecker.job[167]: IGCM_check_SearchCatalog[114]: unset: warning: SUBMIT_DIR: is read only
./RunChecker.job[167]: IGCM_check_SearchCatalog: line 192: SUBMIT_DIR: is read only

Find exemple here :

cd /home/scratch01/jghattas/MYPOSTTEST/modipsl_libItrunk/libIGCM
./RunChecker.job -u jghattas -p /home/scratch01/jghattas/MYPOSTTEST/modipsl_libItrunk/config/ORCHIDEE_OL/testrun

At obelix, also the TS do not work, they are not even launched because the rebuild_fromWorkdir crash in the end on an "export SUBMIT_DIR". Error message :

# Exemple from file /home/scratch01/jghattas/IGCM_OUT/OL2/TEST/test/testrun/Out/rebuild_fromWorkdir.19821231.out
/var/spool/torque/mom_priv/jobs/520464.obelix1.SC[156]: IGCM_FlushRebuild: line 38: SUBMIT_DIR: is read only


And in Script_Output,
there is another error message, minor I think :

# exemple from /home/scratch01/jghattas/IGCM_OUT/OL2/TEST/test/testrun/Out/Script_Output_testrun.000001.30514
cat: write error: Broken pipe

Change History (11)

comment:1 Changed 11 years ago by jgipsl

Commit [968] corrects error on RunChecker and create_TS.
Last error in Script_output remains.

comment:2 Changed 11 years ago by jgipsl

The remaining problem occures when concating the out_orchidee_000x files into one file. The file is producced and looks ok so I don't know if the error message makes a difference.


The probleme occures when the files($file) are to big.
The limit in size is between 175171 and 227169.

The corresponing line is 1057 in libIGCM_comp.ksh :

cat ${file} | sed "s/\(.*\)/${i_}\1/" ${file} >> ${file_out}

comment:3 Changed 11 years ago by sdipsl

try this:

cat ${file} | xargs sed "s/\(.*\)/${i_}\1/" ${file} >> ${file_out}

comment:4 Changed 11 years ago by jgipsl

Doesn't work...

comment:5 Changed 11 years ago by sdipsl

One possible work-around would be to try to increase ulimit -s. it is 10240 by default. On Linux, the maximum amount of space for command arguments is 1/4th of the amount of available stack space. So on obelix what is the value of this.
ulimit -s

If not 10240 then change that
ulimit -s 10240

If you are not allowed to do that then ask the obelix sys admin to do it for you

Trop bizarre de se parler en anglais :-)

comment:6 Changed 11 years ago by jgipsl

C'est déjà mis a unlimited sur obelix... donc pas de chance, pas d'amélioration...

comment:7 Changed 11 years ago by sdipsl

  • Milestone set to libIGCM_v2.2 metrics
  • Priority changed from major to critical

comment:8 Changed 10 years ago by sdipsl

Est ce toujours un problème?

Si oui qu'en pense les admins de obelix? Je trouve que la balle est dans leur camp étant la taille de fichier à partir de laquelle le problème se pose.

comment:9 Changed 10 years ago by jgipsl

Francois Marabelle(LSCE) dit :

Je pense que le script est faux. il faudrait faire:

cat ${file} | sed "s/\(.*\)/${i_}\1/"  >> ${file_out}

au lieu de:

cat ${file} | sed "s/\(.*\)/${i_}\1/" ${file} >> ${file_out}

Josefine : J'ai testé sur obelix, ca marche. Ca ne donne plus des messages d'erreur et le fichier out_orchidee est identique a celui d'avant.

comment:10 Changed 10 years ago by sdipsl

  • Owner changed from somebody to sdipsl
  • Status changed from new to assigned

comment:11 Changed 10 years ago by sdipsl

  • Resolution set to fixed
  • Status changed from assigned to closed
Note: See TracTickets for help on using tickets.