Announcements

KAUST Supercomputing Laboratory Newsletter 13th January 2016

XSEDE HPC Workshop: OpenMP

The registration page for the XSEDE HPC Monthly Workshop Series - January 20th - OpenMP session is up.
The portal registration page can be found here:
https://portal.xsede.org/course-calendar/-/training-user/class/454/sessi...
If there is enough interest, KSL will investigate the possibility of streaming this course live in the Library computer room on the 20th of January.

KAUST Supercomputing Laboratory Newsletter 6th January 2016

XSEDE HPC Workshop: OpenMP

The registration page for the XSEDE HPC Monthly Workshop Series - January 20th - OpenMP session is up.
The portal registration page can be found here:

https://portal.xsede.org/course-calendar/-/training-user/class/454/sessi...

If there is enough interest, KSL will investigate the possibility of streaming this course live in the Library computer room on the 20th of January.

KAUST Supercomputing Laboratory Newsletter 29th December 2015

Firewall Upgrade

Please note the following alert from KAUST IT services:

Firewall upgrade has been scheduled on Thursday 31st December 2015 from 17:00 to 21:00 AST.

All services hosted will face intermittent downtime during the upgrade. These services include KSL website and may affect your login to Shaheen II and Neser.

RCAC Meeting

The deadline for project submission for the next RCAC meeting is Thursday 31st December 2015. Please note that RCAC meeting are held every month.

KAUST Supercomputing Laboratory Newsletter 10th December 2015

RCAC Meeting

The next scheduled RCAC (Research Computing Allocation Committee) meeting is Thursday 17th December.

Retirement of Neser and Associated Scratch Storage

As previously announced, Neser will be decommissioned on January and the last date that jobs will be scheduled to run is 31st December 2015.

We will continue to have the Shaheen I/Neser ‘home' and ‘project' filesystems available until at least 31st July 2016. However, please note that the ‘scratch' filesystem will be taken off-line and deleted on the 1st February 2016.

KAUST Supercomputing Laboratory Newsletter 25th November 2015

Shaheen II Filesystem

We are pleased to confirm that the problem affecting the filesystem availability has been resolved and the system is fully available for use.

Maintenance Session 1st December 2015

The next maintenance session on Shaheen II will be on Tuesday 1st December from 12:00 until 17:00.

XSEDE HPC Workshop: OpenACC

The registration page for the XSEDE HPC Monthly Workshop Series - December 3 - OpenACC session is up.

The portal registration page can be found here:

KAUST Supercomputing Laboratory Newsletter 17th November 2015

SLURM workq_high and workq_low

Please note that workq_high and workq_low will be removed from the system on the 1st December. If you have either of these partitions specified in your job control file, they should be removed. As workq is the default partition, this does not need to be specified.

Neser Last Day of Service

Please note this system will be decommissioned in January, the last day that any job will be able to run is 31st December 2015.

Shaheen II Availability

We are pleased to confirm that Shaheen was brought back online at 08:00 this morning and is fully available for use.

Shaheen II Emergency Shutdown

Dear Users

 

We have encountered a major issue affecting the availability of the Lustre filesystem.

 

Cray have recommended that we perform and immediate shutdown of the system to prevent data loss.

 

We are working on identifying the reason for the failure and will update you when we have more information.

 

Shaheen II hardware is complete!

Dear Shaheen II Users, 

Over the last couple of weeks you have experienced considerable disruption in using Shaheen II due to a combination of scheduled maintenance sessions and unforeseen failures in hardware and software. We sincerely apologize for the inconvenience these might have caused. Our team works hard to minimize these downtimes, keeping as our most important goal to ensure you are highly productive on our systems.

Shaheen II Status

Good Morning

 

We are pleased to confirm that the issues we encountered with system following last week’s maintenance session have now been resolved.

 

The system is now fully available to run jobs.

 

We would also like to remind you that the next maintenance session on Shaheen II will be from 08:00 on the 8th November for 3 days (until 08:00 on the 11th November).

 

Announcements, 27th October 2015

Extended maintenance sessions in October.

Maintenance work on Shaheen II is taking longer than originally envisioned. Service will not be returned by tomorrow but we are hopeful that Shaheen II will be operational by the end of the week. There will be occasional disruption to the CDLs, but at least one CDL will be available for users to login to during this period.

Announcements, 14th October 2015

Extended maintenance sessions in October.

We would like to remind our users of the extended outage on Shaheen II in October for necessary maintenance:

25th October for 3 days

There will be no access to the system during these periods.

Tip of the week: Queues on Shaheen II Cray XC40

Two different queue are available on Shaheen II:

Announcements, 8th September 2015

KSL Workshop Towards High Efficiency Computing with Allinea

KAUST Supercomputing Laboratory presents the Allinea Software workshop on HPC profiling and debugging: "Towards High Efficiency Computing with Allinea" on October 4th, starting at 9am.

Workshop topics include:

Announcements, 11th August 2015

Maintenance Session Tuesday 18th August

The next maintenance session will be on Tuesday 18th August from 12:00 until 17:00. There will be no access to the system during this period. This will affect Shaheen I, Neser and Shaheen II. Important security updates, custom patches and several bug fixes will be applied to the XC40, which will require the whole system to be rebooted.

Tip of the Week: What command did I type before ?

History command

Announcements, 26th May 2015

Shaheen II Cray XC40 Workshop Announcement

Date: 7th June to 11th June 2015

Where: KAUST: Auditorium Al-Haytham (down the steps between Bldg2 and Bldg3)

KAUST Supercomputing Lab and Cray are offering a series of three courses:

*Sunday 7th June to Tuesday 9th, 2015 Introduction to the new Shaheen II Cray XC40

*Wednesday 10th June 2015 Efficient Parallel I/O

*Thursday 11th June 2015 Port and optimize your own code on the Cray XC40

Announcements, 28th April 2015

Shaheen-I job size limitation

We only have a limited number of spare parts for Shaheen, and yesterday we exhausted our stock of node cards.

We have had another node card failure this morning, which means that we are now in the situation where we are ‘cannibalising’ the system to supply parts.

With immediate effect we have taken two node cards offline in rack 00.

This means that we can no longer run 16 rack jobs and the maximum size job that can be run on Shaheen is now 12288 nodes (12 racks).

Announcements, 7th April 2015

Power Outage Thursday 9th April to Monday 13th April

In preparation for the introduction of the new Cray supercomputer, there will be a site-wide power outage to the Data Centre currently housing Shaheen1 and Neser. All services, including Shaheen and Neser, will be shut down from 16:00 on Thursday 9th April until approximately 11:00 on Monday 13th April.

We apologise for the late notice and for any inconvenience that this may cause.

Announcements, 17th March 2015

Shaheen and Neser unavailable 18th-20th March 2015

In preparation for the introduction of the new Cray supercomputer, there will be a site-wide power outage to the Data Centre currently housing Shaheen1 and Neser. All services, including Shaheen1 and Neser, will be shut down from 17:00 on Wednesday 18th March until approximately 11:00 on 20th March.

Announcements, 17th February 2015

Annual Power Maintenance

Due to annual power maintenance in Building 1, all systems will be shutdown from 16:00 on Thursday 26th February until approximately 10:00 on Sunday 1st March.

Tip of the Week: Using TotalView Debugger for Parallel jobs on Neser

In this example, we will be using the Intel compiler along with OpenMPI/1.6.4/intel( intel-compilers/11.1 and openmpi/1.6.4/intel should be loaded). In addition, it is required that you connect to Neser using the ssh –X option, to get the Totalview GUI ( Graphical User Interface).

Announcements, 10th February 2015

Disruption to Shaheen earlier today

Earlier today, a fault occurred on both Shaheen front end nodes, rendering them completely unresponsive. To remedy the situation, both front end nodes were rebooted. We apologise for any disruption this may have caused. The root cause of the problem is still under investigation.

Maintenance Session Tuesday 17th February

The next maintenance session will be on Tuesday 17th February from 12:00 until 17:00. There will be no access to the system during this period.

Announcements, 3rd February 2015

KSL Presents the XSEDE HPC Monthly Workshop on OpenACC This Friday

Registration deadline Tomorrow. For registration and further information click here.

Invitation to attend the second KAUST-NVIDIA workshop on "Accelerating Scientific Applications using GPUs" on February 17th, 2015

The Supercomputing Laboratory is pleased to announce the second KAUST-NVIDIA one-day workshop about accelerating scientific applications using GPUs on February 17th.

The registration is free but required using this link.

Pages