wiki:Doc/ComputingCenters/TGCC/Irene

Version 1 (modified by acosce, 6 years ago) (diff)

--

Working on the Irène machine


Update 18/06/18

Documentation under development

1. On-line users manual

2. Job manager commands

  • ccc_msub job -> submit a job
  • ccc_mdel ID -> kill the job with the specified ID number
  • ccc_mstat -u login -> display all jobs submitted by login
  • ccc_mpp -> display all jobs submitted on the machine. ccc_mpp -n to avoid colors.
  • ccc_mpp -u $(whoami) ->display your jobs.

3. Irene environment and files system

  • Before working on Irène you can source the ipsl environment
    source ~p86ipsl/.envi_irene
    
    --> Be carefule this environment can be update during next weeks according to TGCC recommendations
  • Default compiler is intel/17.0.6.256 with default librairy MPI mpi/openmpi/2.0.4. WARNING : there is a bug in the compiler intel-17.0.4.196
  • Computing nodes: the nodes of partition skylake have 48 cores each, which is 3 times more than the computing nodes from the standard partition of Curie;
  • Filesystem accesses MUST be explicit: from the login nodes, you will see the WORK, SCRATCH, STORE spaces as you probably are used to. However, when submitting any job through ccc_msub or ccc_mprun, you must specify -m work, -m scratch, -m store, or combine them like in -m work,scratch; this constraint has the advantage that your jobs won't be supended if a filesystem you don't need becomes unavailable;
  • Compute nodes are diskless, meaning that /tmp is not hosted on a local hard drive anymore, but on system memory instead. It offers up to 16 GB (compared to 64 GB on Curie). Please note that any data written to it is reduces the size of the memory that remains available for computations. In our case it change the number of core use for post-treatment like pack_output.
  • The default time limit for a job submission is 2hours (7200s) contrary to 24h (86400s) on curie
  • The SCRATCH space purge policy has been simplified: any file that stays 60 days without being read or modified will be purged, as well as any directory that remains empty for 30 days; the purge policy does not depend on disk occupation threshold any longer

4. How to work with last LMDZOR_v6 or last LMDZORINCA_v6 ?