slurm_tips
Differences
This shows you the differences between two versions of the page.
| Next revision | Previous revision | ||
| slurm_tips [2025/07/07 18:39] – created bbruzzo | slurm_tips [2025/11/20 16:44] (current) – bbruzzo | ||
|---|---|---|---|
| Line 1: | Line 1: | ||
| ====== Cheatsheet SLURM ====== | ====== Cheatsheet SLURM ====== | ||
| + | |||
| + | ===== Ver logs de SLURM desde login ===== | ||
| + | < | ||
| ==Actualizar partición== | ==Actualizar partición== | ||
| + | Con el siguiente comando se actualiza temporalmente el tiempo máximo que puede tener un job para ingresar a una partición pero **hasta que se reincie el slurmctld** donde vuelve a tomar el que esté definido en slurm.conf: | ||
| < | < | ||
| + | |||
| + | ===== Monitorear consumo de QOS ===== | ||
| + | |||
| + | < | ||
| + | |||
| + | El output va a ser algo asi: | ||
| + | |||
| + | < | ||
| + | UsageRaw=56629.000000 | ||
| + | GrpJobs=N(1) GrpJobsAccrue=N(0) GrpSubmitJobs=N(1) GrpWall=N(59.72) | ||
| + | GrpTRES=cpu=N(64), | ||
| + | GrpTRESMins=cpu=944(943), | ||
| + | GrpTRESRunMins=cpu=N(64), | ||
| + | MaxWallPJ= | ||
| + | MaxTRESPJ= | ||
| + | MaxTRESPN= | ||
| + | MaxTRESMinsPJ= | ||
| + | MinPrioThresh= | ||
| + | MinTRESPJ= | ||
| + | PreemptMode=OFF | ||
| + | Priority=0 | ||
| + | Account Limits | ||
| + | cuentaprueba | ||
| + | MaxJobsPA=N(1) MaxJobsAccruePA=N(0) MaxSubmitJobsPA=N(1) | ||
| + | MaxTRESPA=cpu=N(64), | ||
| + | User Limits | ||
| + | utest(10054) | ||
| + | MaxJobsPU=N(1) MaxJobsAccruePU=N(0) MaxSubmitJobsPU=N(1) | ||
| + | MaxTRESPU=cpu=N(64), | ||
| + | | ||
| + | Ver la línea: | ||
| + | < | ||
| + | |||
| + | Donde 944 es la cantidad de horas disponibles y 943 es las utilizadas al momento. | ||
| + | |||
| + | ===== Cambiar de estado ===== | ||
| + | Drain: | ||
| + | < | ||
| + | scontrol update NodeName=cn0xx State=DRAIN Reason=" | ||
| + | </ | ||
| + | |||
| + | Undrain: | ||
| + | < | ||
| + | scontrol update NodeName=cn0xx State=DOWN Reason=" | ||
| + | scontrol update NodeName=cn0xx State=RESUME | ||
| + | </ | ||
| + | |||
| + | ===== Agregar usuario administrador ===== | ||
| + | < | ||
| + | sacctmgr modify user name=< | ||
| + | sacctmgr modify user name=< | ||
| + | </ | ||
slurm_tips.1751913570.txt.gz · Last modified: by bbruzzo
