Actions

Difference between revisions of "Latest News"

From ALICE Documentation

(Latest News)
(Latest News)
Line 1: Line 1:
 
=== Latest News ===
 
=== Latest News ===
 +
*'''30 Aug. 2021 - Node020 reserved to testing''' We have been working on the configuration of the new BeeGFS storage system. To this purpose, we have reserved node020 for running tests.
 
*'''23 Jul. 2021 - Leiden University network maintenance on 31 Jul/01 Aug:''' Maintenance on the network of Leiden University will take place on the weekend of 31 July/01 August. During this time ALICE will continue to run, but in total isolation, i.e., with no internet access. This means that you will not be able to login to ALICE and jobs cannot for example pull code, download data or access license servers. During the maintenance, the status will be tracked here [[News#Next_Maintenance|Next maintenance]]
 
*'''23 Jul. 2021 - Leiden University network maintenance on 31 Jul/01 Aug:''' Maintenance on the network of Leiden University will take place on the weekend of 31 July/01 August. During this time ALICE will continue to run, but in total isolation, i.e., with no internet access. This means that you will not be able to login to ALICE and jobs cannot for example pull code, download data or access license servers. During the maintenance, the status will be tracked here [[News#Next_Maintenance|Next maintenance]]
 
*'''29 Jun. 2021 - ALICE system maintenance finished (Update):''' System maintenance has finished and ALICE is available again.  
 
*'''29 Jun. 2021 - ALICE system maintenance finished (Update):''' System maintenance has finished and ALICE is available again.  
Line 12: Line 13:
 
*** The time limit on the mem partition has been changed from Infinite to 14 days.
 
*** The time limit on the mem partition has been changed from Infinite to 14 days.
 
*** Resources on the testing partitions are now limited to 15 CPUs per node, a maximum amount of memory per node of 150G, a default memory per cpu of 10G.
 
*** Resources on the testing partitions are now limited to 15 CPUs per node, a maximum amount of memory per node of 150G, a default memory per cpu of 10G.
*'''28 Jun. 2021 - ALICE system maintenance continues tomorrow:''' During our maintenance, we encountered a few issues with the Infiniband switch and login node 01. Because of the issues, we also did not finish updating the GPU nodes. We will continue working on these item tomorrow (Tuesday, 29 June 2021) until at least 12:00.  ALICE will remain offline for maintenance.
 
*'''27 Jun. 2021 - ALICE offline for system maintenance:''' More information here [[News#Next_Maintenance|Next maintenance]].
 
*'''25 Jun. 2021 - System maintenance on ALICE:''' ALICE will undergo system maintenance on '''28 June 2021'''. More information here [[News#Next_Maintenance|Next maintenance]].
 
*'''2 Jun. 2021 - Rclone available on ALICE:''' Rclone is available on ALICE and there are instructions on how to set it up to transfer files to and from SurfDrive and ResearchDrive: [[Data_Transfer |Data transfer to and from ALICE]]. This is a new feature and feedback on your experience is very welcome.
 

Revision as of 07:59, 30 August 2021

Latest News

  • 30 Aug. 2021 - Node020 reserved to testing We have been working on the configuration of the new BeeGFS storage system. To this purpose, we have reserved node020 for running tests.
  • 23 Jul. 2021 - Leiden University network maintenance on 31 Jul/01 Aug: Maintenance on the network of Leiden University will take place on the weekend of 31 July/01 August. During this time ALICE will continue to run, but in total isolation, i.e., with no internet access. This means that you will not be able to login to ALICE and jobs cannot for example pull code, download data or access license servers. During the maintenance, the status will be tracked here Next maintenance
  • 29 Jun. 2021 - ALICE system maintenance finished (Update): System maintenance has finished and ALICE is available again.
    • However, two one issue remains.
    • Login node 1 is down due to technical issues on the node. Login2 is running and can be used instead. Connections that are intended to login1 are automatically routed to login2. There should be no need to change your ssh configs.
    • The Infiniband network is down due to technical issues on the Infiniband switch.
    • List of changes:
      • Login node 1 is running and the NVIDIA Tesla T4 has been integrated successfully. Instructions on using the T4 will follow soon.
      • Slurm version 20.11.7 is now running on ALICE
      • EasyBuild 4.4.0 is used for the Intel and AMD branch
      • The partitions notebook-gpu, notebook-cpu, playground-cpu, playground-gpu have been removed.
      • The time limit on the mem partition has been changed from Infinite to 14 days.
      • Resources on the testing partitions are now limited to 15 CPUs per node, a maximum amount of memory per node of 150G, a default memory per cpu of 10G.