Difference between revisions of "NGI DE CH Operations Center:Operations Meeting:03022012"
Jump to navigation
Jump to search
Line 29: | Line 29: | ||
* Goegrid | * Goegrid | ||
* GSI | * GSI | ||
* ITWM | * ITWM (Martin Braun) | ||
ntr | |||
* KIT (GridKa, FZK-LCG2) | * KIT (GridKa, FZK-LCG2) | ||
* KIT (Uni Karlsruhe) | * KIT (Uni Karlsruhe) | ||
Line 44: | Line 45: | ||
* Uni Bonn | * Uni Bonn | ||
* Uni Dortmund | * Uni Dortmund | ||
* Uni Dresden | * Uni Dresden (Ralph Mueller Pfefferkorn) | ||
* Uni Freiburg | since about two months problem with our file system, especially with the central nfs file system. The nfs system becomes | ||
overloaded. 100s of jobs with 100s of files. | |||
Paolo/CSCS: We had the same problems. It was fixed by changing the CREAM grubber and we went from Lustre to gpfs and SSD disks | |||
for the metadata and for the inode's table. | |||
* Uni Freiburg (Anton Gamel) | |||
- problems with gsi ssh -> increased movers | |||
- installed additional dCache servers | |||
* Uni Mainz-Maigrid | * Uni Mainz-Maigrid | ||
* Uni Siegen | * Uni Siegen | ||
* Uni Wuppertal | * Uni Wuppertal | ||
; SwiNG | ; SwiNG | ||
* CSCS | * CSCS (Paolo) | ||
- maintenance two days ago: firmware update of the disks, lost 4 disks/CMS pool (in contact with CMS) | |||
- test CERNVMFS in preproduction | |||
* PSI | * PSI | ||
* Switch | * Switch |
Revision as of 17:11, 13 February 2012
Introduction
- Minutes of last meeting
Announcements
- Meetings/conferences
NGI-DE/NGI-CH/D-Grid Workshop in April Note: There is also a dCache Workshop in April. Date should be chosen carefully.
The EGI Community Forum (http://go.egi.eu/cf12) will be in Munich 26-30th March 2012 and held in conjunction with the 2nd EMI Technical Conference. Abstract submission was open until 2/12/11.
- Availability/reliability statistics
- Monitoring
Nagios Update 15
- Staged rollout/updates
Round the sites
- NGI-DE
- BMRZ-FRANKFURT (Uni Frankfurt)
- DESY-HH
- DESY-ZN
- FZJuelich
- Goegrid
- GSI
- ITWM (Martin Braun)
ntr
- KIT (GridKa, FZK-LCG2)
- KIT (Uni Karlsruhe)
- LRZ
- MPI-K
- MPPMU
+ gftp crashes every few hours 'cause ("OutOfMemoryError" using both OpenJDK and Sun JDK). The issue has been solved upgrading java JDK 1.6 package. + Increased the number of movers in order to reduce pending transfers. + CREAM2 upgraded (glite-CREAM moved from 3.2.13-1 to 3.2.14-1.sl5, glite-SGE_utils.x86_64 3.2.3-1.sl5 + Security fix on Apel service + LFC failures due to "Bad magic number": hanging gpfs connections causing lfc timeouts. The work around was to change CREAM config to decrease gpfs load.
- RWTH Aachen
- SCAI
- Uni Bonn
- Uni Dortmund
- Uni Dresden (Ralph Mueller Pfefferkorn)
since about two months problem with our file system, especially with the central nfs file system. The nfs system becomes overloaded. 100s of jobs with 100s of files. Paolo/CSCS: We had the same problems. It was fixed by changing the CREAM grubber and we went from Lustre to gpfs and SSD disks for the metadata and for the inode's table.
- Uni Freiburg (Anton Gamel)
- problems with gsi ssh -> increased movers - installed additional dCache servers
- Uni Mainz-Maigrid
- Uni Siegen
- Uni Wuppertal
- SwiNG
- CSCS (Paolo)
- maintenance two days ago: firmware update of the disks, lost 4 disks/CMS pool (in contact with CMS) - test CERNVMFS in preproduction
- PSI
- Switch
Note: please update your entry at https://wiki.egi.eu/wiki/NGI_DE:Sites if needed.
Status ROD
- Any problematic tickets?
- Handover of the ROD shift
- ROD shift schedule https://wiki.egi.eu/wiki/NGI_DE_CH_Operations_Center:Operations_Teams#Shifts_rotation_table
AOB
If you have additional topics to be discussed during the meeting, please submit them in advance via our email list email list.