ALICE User Documentation Wiki v02
From ALICE Documentation
Revision as of 19:23, 28 October 2020 by Schulzrf
Welcome to the ALICE HPC user documentation.
ALICE is a computing facility for state-of-the-art research and education of Leiden University. With ALICE you have the world of computing at your fingertips. On this wiki, you can find the information you'll need to get started and become more skilled in using computing to support your research and education.
We appreciate any questions or comments on the content of the documentation so that we can improve the range of information that we supply here.
If you are unsure about where to go next, have a look below.
What is ALICE?
Please check out the About ALICE pages to get some background information, a quick overview and see how to acknowledge it.
What's new with ALICE?
To get information about updates, upgrades, events, planned maintenance and more, have a look at the News page.
Here are the most recent news:
- 06 Oct 2022 - New user wiki: So far, there have been separate user wikis for ALICE HPC cluster and the SHARK HPC cluster at LUMC. However, there is a great deal of overlap in terms of information that you as a user need to work on ALICE or SHARK. Therefore, the support teams of both clusters are starting to move to a new joined HPC user wiki. The new wiki is live and can be found here: https://pubappslu.atlassian.net/wiki/spaces/HPCWIKI/. The old wikis are now frozen and no new content will be added to them. The new wiki provides information specific to each cluster in addition to a user guide and tutorials which apply to both clusters. There is also a news section, a calendar where we publish events, information about user meetings and workshops.
- 21 Sep 2022 - Access to ALICE: On 26 Sept 2022 between 18:00 and 18:30, access to ALICE will not be possible due to maintenance on the University cloud platform.
- 24 Aug 2022 - ALICE available again: Maintenance on ALICE is over. The cluster is online again and available to all users. We apologize for the delay.
- 23 Aug 2022 - ALICE system maintenance not finished and continues tomorrow: We managed to solve many of the issues that we faced yesterday. We are waiting for the completion of synchronization processes which are part of the high-availability setup procedure. If all goes well, we just need to run a few tests to verify that the new high-availability setup is working properly and all the nodes are coming back. Unfortunately, it was not possible to do today anymore. In case the setup fails after all, we are prepared to revert back all the changes and bring ALICE online again. In any case, we expect ALICE to be online again sometime tomorrow afternoon. We are sorry for the delay, but the new high-availability setup is vital for ALICE which is why have been working hard to get it done.
- 22 Aug 2022 - ALICE is offline due to system maintenance - Continues tomorrow: We encountered unexpected technical issues during our highest priority task for this maintenance day, the high-availability setup. Because this is a critical component for the continuing stability of ALICE and we require the cluster to be offline, we decided to continue solving the issues tomorrow and keep the cluster offline.
- 17 Aug 2022 - REMINDER - ALICE system maintenance on 22 Aug 2022: We will perform system maintenance on ALICE on 22 Aug 2022 between 09:00 and 18:00 CEST. Our primary focus will be the high-availability set up of ALICE in addition to other maintenance tasks. This will require us to take all compute and login nodes of the cluster offline. It will not be possible to run any jobs and access data on ALICE. The login nodes will be rebooted and all active terminal or X2Go sessions will be terminated. Until maintenance starts, you can continue to use ALICE as usual and submit jobs. Slurm will also continue to run your job if the requested running time will allow it to finish before the maintenance starts. If you have any questions, please contact the ALICE Helpdesk.
- 01 Aug 2022 - ALICE system maintenance on 22 Aug 2022 - First announcement: We will perform system maintenance on ALICE on 22 Aug 2022 between 09:00 and 18:00 CEST. Our primary focus will be the high-availability set up of ALICE in addition to other maintenance tasks. This will require us to take all compute and login nodes of the cluster offline. It will not be possible to run any jobs and access data on ALICE. Until maintenance starts, you can continue to use ALICE as usual and submit jobs. Slurm will also continue to run your job if the requested running time will allow it to finish before the maintenance starts. If you have any questions, please contact the ALICE Helpdesk.
- 01 Jun 2022 - Disabled access to old scratch storage: As previously announced, we have disabled access to the old scratch storage. We will keep the data available until 30 June 2022. Afterwards, we will start to delete data so that we can repurpose the storage within ALICE. You can request temporary access by contacting the ALICE Helpdesk. See also the wiki page: Data Storage.
System Maintenance on ALICE will take place on 22 Aug 2022 between 09:00 and 18:00 CEST (See the Maintenance Announcement)
Just Getting Started?
If you're new to ALICE, please check out the User Guide.
What more can I do with ALICE?
If you already have experience with ALICE and/or HPC, have a look at the Advanced Guide pages. Please note that many of the pages here are still under construction and subject to change.
What else is there about ALICE?
If you need more information on general topics, such as hardware, storage, and policies, please take a look at the Documentation pages. Please note that many of the pages here are still under construction and subject to change.
Have a question or feedback on ALICE?
If you have a question about ALICE, need help with using it or want to give us some feedback, please see the Support page to know how you can connect with us.
Status of ALICE?
Would you like to know how busy ALICE is and if all nodes are up, then please have a look at the Current Status Overview.