In this newsletter:
- RCAC meeting
- Shaheen Maintenance: 23rd of March and April
- Seminar on Performance and Debugging Analysis Tools. March 3rd 2021.
- AI users survey
- Tip of the week: MPICH_STATS_DISPLAY
- Follow us on Twitter
- Previous Announcements
- Previous Tips
Shaheen Maintenance: 23rd of March and April
We would like to announce our next maintenance session on Shaheen on the 23rd of March 2021 between the hours of 8 and 5pm. We plan to apply the latest patches and security updates, as well as reboot and fix the hardware on the system. Access to the files and login nodes should be possible during the outage.
We would also like to give you an advanced notice for the longer Shaheen outage towards the end of April. The datacentre team will be performing their annual PPM on the power supply equipment. At the same time, we will upgrade Shaheen existing project and scratch filesystems. This is an essential step before bringing our newly acquired filesystem online and providing more project storage space. We estimate that the combined Shaheen outage should take around 4-6 days. We will communicate the details closer to the date. As always, please contact us at email@example.com should you have any concerns or questions.
The project submission deadline for the next RCAC meeting is 28 February 2021. Please note that the RCAC meetings are held once per month. Projects received on or before the submission deadline will be included in the agenda for the subsequent RCAC meeting.The detailed procedures, updated templates and forms are available here: https://www.hpc.kaust.edu.sa/account-applications
Seminar on Performance and Debugging Analysis Tools: March 3rd 2021.
How can I check the performance of my code on CPUs and/or GPUs?
KAUST Supercomputing Core Lab is organizing a seminar on Performance and Debugging Analysis Tools with contributions from HPE and ARM Wednesday, 3 March 2021 from 4 to 6pm KSA Time. This seminar will help the community developing and using C/C++/Fortran and python codes to debug and check the performance of their code and optimize their workflow on KSL HPC systems using CPUs and/or GPUs.
To receive the zoom link, please register here: https://hpc.kaust.link/Registration_Performance_tools21
More details about the event: https://www.hpc.kaust.edu.sa/KSL_Seminar21_Performance_Debugger
AI users survey
As you are aware, the GPU resources at KSL have been in high demand nearing major conference deadlines. For better management of the current resources and planning for future expansions/extensions, it is important for us to keep up with the changing demand in workloads running on these GPU resources. We have designed a GPU workload characterization user survey to capture the state of your workloads. Please spare some time and complete this survey. We also kindly request AI faculty members to circulate the survey link to your students, postdocs and research scientists. Maximizing the number of responses will help us make more informed decisions which serve you well in near and far future.
Tip of the week: MPICH_STATS_DISPLAY
To get a summary of MPI statistics, available through the MPI Tools Interface, set MPICH_STATS_DISPLAY to 1, all the ouput will be written by rank 0 to stderr. If set to 2, all ranks will produce an individualized statistics summary and write to file on a per-rank basis. By default, only basic information is provided.
For a more detailed summary, you can set MPICH_STATS_VERBOSITY to 2. More details are available in the man page intro_mpi.
Follow us on Twitter
Follow all the latest news on HPC within the Supercomputing Lab and at KAUST, on Twitter @KAUST_HPC.