Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "NGI DE:Germany-QR4"

From EGIWiki
Jump to navigation Jump to search
(Redirected page to Germany-QR4)
 
(9 intermediate revisions by one other user not shown)
Line 1: Line 1:
__NOTOC__
#redirect [[Germany-QR4]]
 
{| border="1" cellspacing="0" cellpadding="2"
|-
!scope="col"| Quarterly Report Number
!scope="col"| NGI Name
!scope="col"| Partner Name
!scope="col"| Author
|-
|QR 4
|NGI_DE
|Germany
|Jie Tao
|-
|}
 
<!--
Fill the second line of the table replacing the <...> stuff with your data.
-->
 
==1. MEETINGS AND DISSEMINATION ==
<!--
=====GENERAL GUIDELINES FOR ALL EVENTS REPORTED IN THE FOLLOWING SECTIONS:=====
*please do not provide a list of participants, only give the number of people that attended
*for outcome, please list tangible agreements, decisions instead of listing program points or presentations you made. Otherwise put: “-“
*include your local events only if there was any EGI-related topic on the agenda
*provide an indico URL to your presentation (if available) or to the event itself.
**If your presentation is not available online, please send the slides to erika.swiderski@egi.eu.
Note: Complete the tables below by adding as many rows as needed. -->
 
===1.1. CONFERENCES/WORKSHOPS ORGANISED===
 
 
{| border="1" cellspacing="0" cellpadding="2"
<!-- |+Provided by each SA1 partner. Just report events relevant to SA1 activities.-->
|-
!scope="col"|Date||Location||Title||Participants||Outcome (Short report & Indico URL)
|-
|April 6-7, 2011||Karlsruhe, Germany||CSIRT face-to-face meeting||EGI, NGIs security staff||Activity update, Security monitoring plan, security training, discussions
https://www.egi.eu/indico/conferenceDisplay.py?confId=438
|}
 
===1.2. OTHER CONFERENCES/WORKSHOPS ATTENDED===
 
{| border="1" cellspacing="0" cellpadding="2"
<!-- |+Provided by each SA1 partner. Just report events relevant to SA1 activities.-->
|-
!scope="col"|Date||Location||Title||Participants||Outcome (Short report & Indico URL)
<!-- formatting text -->
|-
|March 17, 2011||Göttingen, Germany||dCache workshop||NGI_DE sites (KIT, FZJ, LRZ-LMU, MPPMU, etc.)||Discussion of dCache issues
|-
|April 11-15, 2011||Vilnius, Lithuania||EGI User Forum/EMI technical conference||NGI_DE sites(KIT, FZJ, SCAI, TUDresden-ZIH, MPPMU, etc.)||Presentations, e.g. UNICORE Tutorial; participation on various workshops
|-
|April 13-14, 2011||Helsinki, Finland||DEISA/PRACE symposium||NGI_DE staff of site LRZ-LMU||http://www.deisa.eu/news_press/symposium
|-
|}
 
<!--
Please, fill the fields replacing <...> sections with your data. You can add a line copyng the two lines:
|<Date>||<Location>||Title||Participants||Outcome
|-
-->
===1.3. PUBLICATIONS===
<!--List all publications as bullet points, detailing: Publication title, author(s), journal title, number/issue, date. Also mention any articles published further to interviews given by members of your activity.-->
 
{| border="1" cellspacing="0" cellpadding="2"
|-
!scope="col"|Publication title||Journal / Proceedings title||align="left" |Journal references<br> ''Volume number<br> Issue<br><br>Pages from - to''||align="left" |Authors ''<br>1.<br>2.<br>3.<br>Et al?''
<!--
|-
|<Publication title>||<Journal/Proceedings>||Vol:<volume number><br>Issue:<Issue><br>Pg: <from> - <to>||1.<Author 1><br>2.<Author2><br>3.<Author3><br> 
(Copy the two rows above to add a new entry in the table)
-->
|}
 
== 2. ACTIVITY REPORT  ==
<!--''Note: just report activities relevant to this Quarter.''-->
 
===2.1. Progress Summary===
<!-- Provide your test below -->
In the reported period NGI_DE keeps the Grid running smoothly in the region. Availability and Reliability was kept high (96% in average). Operation problems are discussed in the regular coordination meeting. The ROD team works well. Grid services are maintained up-to-date. Security update was done by all sites. The sites regularly participated in Grid operations meetings as well as GOCDB regionalization and UNICORE integration task force meetings.
 
Two VO subgroups, /dteam/NGI_DE and /dteam/NGI_CH, were created. Most of the sites already support theses VOs and passed the test. New support unit for NGI_DE monitoring was integrated in helpdesk to cover the case of problems with the common monitoring tools like nagios, dashboard, myegi etc. The regional nagios monitoring instance is now using the VO subgroup "/ops/NGI/Germany". All the NGI-DE and NGI-CH sites enabled the support for this subgroup.
 
===2.2. Main Achievements===
<!--
Provide your text below
-->
The main achievements of NGI_DE in the report period are the successful operation of the Grid infrastructure in the entire region. Some sites participated in Staged Rollout. All sites have patched the security vulnerabilities in time. Regional Nagios instance was updated to the latest version SAM09. New CREAM-CEs were installed. The following are concrete achievements of some sites:
 
* MPPMU replaced the monbox, migrated the CREAM CE and sBDII to new HW, added some WNs and storage systems, and installed 240TB disk space. Currently, this site is moving forward towards the glite 3.2 on SL(C)5 infrastructure (until now SLES are used on the CEs). The migration/upgrade is planned to be finished in May.
* DESY-HH increased storage capacity.
* In TUDresden-ZIH, the production infrastructure works quite stable; some minor problems with dCache were fixed. This site also migrated monbox to glite-APEL and started to support VO BIOmed in the reported period.
* Uni-Freibiurg installed a set of new worker nodes and a second CREAM CE. It also installed 400TB new storage.
* KIT adopted new CREAM CEs and WMS in production. It also updated successfully the top level BDII and migrated monbox to glite-APEL. This site prepared the annual NGI_DE conference in May.
* WUPPERTAL successfully performed upgrade to the new dCache release 1.9.12-1 (the new "golden release") and upgrade the Cluster file system Lustre to Lustre 1.8.4 (SFS 3.2-3).
* FZJ contributed to the UNICORE Integration Task Force and organized the UNICORE Tutorial for resource providers at EGI User Forum. An early adoption of EMI 1.0 UNICORE services is planned for the next quarter.
* ITWM upgraded their CEs as well as SEs and performed BDII staged rollout.
* BMRZ-Frankfurt updated CEs, WNs, sBDII etc.
* SCAI performed BDII site migration to gLite 3.2, decommission of glite-wms2, and the preparation of VOMS, SE and LFC migration. Torque/Maui was also upgraded to 2.5.x.
* In LRZ-LMU, Dell storage server was brought into production in an inhomogeneous environment (SLES10 / SLES11); dCache was maintained up to date in the current stable release branch (1.9.5). LRZ-LMU also deployed dCache on the new pools and migrated the whole dCache cluster to use BDB based file meta-data. A dCache migration to the next golden release and using new high performance (1Tb RAM) service node for dCache were planned for the near future.
 
===2.3. Issues and mitigation===
 
{| border="1" cellspacing="0" cellpadding="2"
|-
!scope="col"| Issue Description
!scope="col"| Mitigation Description
|-
|ITWM encountered a small problem with gLite Apel migration.||The problem was solved by replacing node type “apel” in GOCDB for monbox with “glite-apel”.
|-
|FZJ: a ticket https://rt.egi.eu/rt/Ticket/Display.html?id=975 is still open.||A solution is needed for properly adding UNICORE services to the EGI infrastructure.
|-
|SCAI: gLite 3.2 bdii_site sometimes had slapped stuck with 100% CPUs in the default configuration not answering to any requests.||As suggested on the LCG-ROLLOUT mailing list, a change in the cache parameters seems to have helped.
|-
|MPPMU encountered problems with the work of getting its new CREAM node into a pre-production system. The goal is to first have a system which is able to roll out into a system in which the site can test and get some standard nagios tests and thus can verify that everything works before the system is put into production.||Further observing the problem in order to find a solution.
|}
 
<!--
Please, fill the table below. You can add a line copyng the two lines
| Issue Description || Issue mitigation
-->

Latest revision as of 11:29, 10 May 2011

Redirect to: