Actions

Difference between revisions of "ALICE User Documentation Wiki v02"

From ALICE Documentation

(Created page with "thumb|Off to research computing Wonderland __NOTOC__ ALICE is the computing facility for excellent research of Leiden University. With ALICE you have the w...")
 
 
(9 intermediate revisions by the same user not shown)
Line 2: Line 2:
 
__NOTOC__
 
__NOTOC__
  
ALICE is the computing facility for excellent research of Leiden University. With ALICE you have the world of computing at your fingertips. On this wiki, you can find the information you'll need to get started and become more skilled in using computing to support your research.
+
'''Welcome to the ALICE HPC user documentation.'''
  
For background information, read the [[About ALICE]] page.  
+
ALICE is a computing facility for state-of-the-art research and education of Leiden University. With ALICE you have the world of computing at your fingertips. On this wiki, you can find the information you'll need to get started and become more skilled in using computing to support your research and education.  
  
 +
We appreciate any questions or comments on the content of the documentation so that we can improve the range of information that we supply here.
  
Please know that this wiki is currently a work in process. We appreciate any questions or comments on the contents so that we can improve the range of information that we supply here.   
+
If you are unsure about where to go next, have a look below.
  
 +
==What is ALICE?==
 +
Please check out the [[About ALICE]] pages to get some background information, a quick overview and see how to acknowledge it.
  
''IMPORTANT NOTE: ALICE is still in a build-up phase. Configurations are still subject to change. You may, therefore, experience unexpected behaviour for the time being.''
+
==What's new with ALICE?==
 +
To get information about updates, upgrades, events, planned maintenance and more, have a look at the [[News]] page.  
  
 +
Here are the most recent news:
  
Use of the ALICE cluster must be acknowledged in any and all publications. For more info see: [[About ALICE]]
+
{{:Latest News}}
 
 
  
 
----
 
----
{{:Maintenance announcements}}
+
{{:Next Maintenance}}
 
----
 
----
  
==Getting Started==
+
==Just Getting Started?==
If you're new to ALICE, please check out the [[Getting Started]] page.
+
If you're new to ALICE, please check out the [[User_Guides|User Guide]].
 
 
==[[Gaining access]]==
 
Access to the cluster and file transfer are done via SSH and SCP/SFTP. Select one of the below links for more detail or click on the heading of the paragraph for a full overview.
 
 
 
*[[Gaining_access#Account|Getting an account]]
 
*[[Gaining_access#Login|Login to ALICE]]
 
*[[Gaining_access#Key_based_authentication|Public key authentication]]
 
*[[Gaining_access#File_systems|Accessible file storage]]
 
*[[Gaining_access#File_transfer_to_and_from_ALICE|File transfer]]
 
 
 
==[[Policies#Access policy|Access Policy]]==
 
Access needs to be granted actively (by the creation of an account on the cluster by the ALICE Cluster workgroup. Use of resources is limited by the scheduler. Depending on the availability of queues ('partitions') granted to a user, priority to the system's resources is regulated on the basis of Faculty/institute/PI levels.
 
 
 
==Software==
 
 
 
===Cluster Monitoring Software and Scheduler===
 
ALICE uses Bright Cluster Manager software for overall cluster management and Slurm as the scheduler.
 
 
 
*[[Bright Cluster Manager|Monitor cluster status with BCM]]
 
*[[Ganglia Cluster Monitoring]]
 
*[[Slurm|Compose, submit and manage jobs with Slurm]]
 
*[[Running interactive jobs]]
 
  
===Installed software===
+
==What more can I do with ALICE?==
[[Accessing software|Globally installed software, modules]]<br />
+
If you already have experience with ALICE and/or HPC, have a look at the [[Advanced Guide]] pages. Please note that many of the pages here are still under construction and subject to change.
  
==Cluster configuration==
+
==What else is there about ALICE?==
Find [[hardware description|here a hardware description]] of the ALICE cluster.
+
If you need more information on general topics, such as hardware, storage, and policies, please take a look at the [[Documentation]] pages. Please note that many of the pages here are still under construction and subject to change.
  
 +
==Have a question or feedback on ALICE?==
 +
If you have a question about ALICE, need help with using it or want to give us some feedback, please see the [[Support]] page to know how you can connect with us.
  
==Frequently Asked Questions==
+
==Status of ALICE?==
Find [[faq|here a list of frequently asked questions]] for the ALICE cluster.
+
Would you like to know how busy ALICE is and if all nodes are up, then please have a look at the [[Current Status Overview]].

Latest revision as of 19:23, 28 October 2020

Off to research computing Wonderland


Welcome to the ALICE HPC user documentation.

ALICE is a computing facility for state-of-the-art research and education of Leiden University. With ALICE you have the world of computing at your fingertips. On this wiki, you can find the information you'll need to get started and become more skilled in using computing to support your research and education.

We appreciate any questions or comments on the content of the documentation so that we can improve the range of information that we supply here.

If you are unsure about where to go next, have a look below.

What is ALICE?

Please check out the About ALICE pages to get some background information, a quick overview and see how to acknowledge it.

What's new with ALICE?

To get information about updates, upgrades, events, planned maintenance and more, have a look at the News page.

Here are the most recent news:

Latest News

  • 25 Feb. 2021 - Next major maintenance window on 08 March 2021: Please have a look at the maintenance page for details on our planned work and how it affects you.
  • 12 Feb. 2021 (Update 22 Feb. 2021) - SSH Connection Stability: If you recently started experiencing that your ssh connection is breaking up after a few minutes of being idle, please check the settings below for you ssh configuration for ALICE. If this does not solve the issue, please contact the ALICE Helpdesk.
    • for Linux, MacOS, Windows using OpenSSH command line connection: Make sure you use "ServerAliveInterval 60" and "ServerAliveCountMax 3" to your ssh config settings.
    • MobaXterm: Go to Settings -> SSH -> SSH settings and enable "SSH keepalive"
    • PuTTY: Go to Settings -> Connection -> Set a non-0 value in "Settings between keepalives" (e.g., 60)
  • 25 Jan. 2021 - Outlook for ALICE in 2021: We have updated the section outlining our expansions plans for ALICE in 2021 (Future plans). Two major items this year will be the addition of a new parallel file storage system and the expansion of the GPU nodes. But there is more on our agenda, so stay tuned...
  • 08 Jan. 2021 - SURF HPC Workshops: SURF is offering HPC-related workshops on various topics. You can find a list of upcoming workshops (and more) on the SURF website (Link). Workshops of interest to HPC users are:
    • Webinar Introduction Supercomputing
    • Webinar Introduction HPC Cloud
    • Using the Amsterdam Modeling Suite in HPC systems
    • SURF Research Week

Next Maintenance

The next major maintenance window will be on 8 March 2021.

What are we going to do?

  • We will integrate an NVIDA Tesla T4 GPU into login node 2. We have purchased two GPUs, i.e., one for each login node. We will start with login node 2 because there are generally fewer users on it.

What does this mean for you?

  • We will have to take login node 2 offline in order to integrate the GPU.
  • All active connections, screens or tmux sessions will be terminated.
  • We hope that we can bring login node 2 back until noon.
  • After a testing phase, we will make the T4 available and announce details on it use.

Do you have questions?

Please contact the (ALICE Helpdesk)


Just Getting Started?

If you're new to ALICE, please check out the User Guide.

What more can I do with ALICE?

If you already have experience with ALICE and/or HPC, have a look at the Advanced Guide pages. Please note that many of the pages here are still under construction and subject to change.

What else is there about ALICE?

If you need more information on general topics, such as hardware, storage, and policies, please take a look at the Documentation pages. Please note that many of the pages here are still under construction and subject to change.

Have a question or feedback on ALICE?

If you have a question about ALICE, need help with using it or want to give us some feedback, please see the Support page to know how you can connect with us.

Status of ALICE?

Would you like to know how busy ALICE is and if all nodes are up, then please have a look at the Current Status Overview.