Alert.png The wiki is deprecated and due to be decommissioned by the end of September 2022.
The content is being migrated to other supports, new updates will be ignored and lost.
If needed you can get in touch with EGI SDIS team using operations @ egi.eu.

Difference between revisions of "Agenda-09-01-2017"

From EGIWiki
Jump to navigation Jump to search
 
(16 intermediate revisions by 2 users not shown)
Line 1: Line 1:
{{TOC right}}  
{{Template:Op menubar}} {{Template:Doc_menubar}} {{TOC_right}}
[[Category:Grid Operations Meetings]]


= General information =
= General information =
Line 8: Line 9:
= UMD/CMD/Preview  =
= UMD/CMD/Preview  =


* CMD-OS (OpenStack) released http://repository.egi.eu/category/os-distribution/cmd-os-1/
** Keystone-VOMS 9.0.3
** ooi 0.3.2
** gridsite 2.3.3
** Cloud BDII Information provider 0.6.12
* Xrootd in EPEL-testing ( 4.5.0) looking for sites to test it
* Xrootd in EPEL-testing ( 4.5.0) looking for sites to test it
* Update to frontier-squid-3 in UMD4
** major upgrade and it has some incompatibilities with frontier-squid-2 based versions, as detailed here: https://twiki.cern.ch/twiki/bin/view/Frontier/InstallSquid#Upgrading
** https://ggus.eu/index.php?mode=ticket_info&ticket_id=125691


== Preview repository ==
== Preview repository ==
Released on 2016-12-22
* '''[[Preview 1.7.0]]''' [https://appdb.egi.eu/store/software/preview.repository/releases/1.0/1.7.0/ AppDB info] (sl6): ARC 15.03 update 11, DPM 1.9.0, FTS3 3.5.7, glite-yaim-core-5.1.4-1
* '''[[Preview 2.7.0]]''' [https://appdb.egi.eu/store/software/preview.repository/releases/2.0/2.7.0/ AppDB info] (CentOS 7): ARC 15.03 update 11, DPM 1.9.0, FTS3 3.5.7, glite-yaim-core-5.1.4-1
Note: EGI provides the preview repository '''without any additional quality assurance process''', but the products are released as they are provided by the product team. '''EGI recommends the use of the UMD repositories''', which contain software verified through the quality assurance process of UMD.


= Operations =
= Operations =


== Update to frontier-squid-3 in UMD4 ==
== Feedback from Helpdesk ==
 
*  [2016-12-13] Services using JGlobus fail with RFC proxies from certificates from some CAs
* major upgrade and it has some incompatibilities with frontier-squid-2 based versions, as detailed here: https://twiki.cern.ch/twiki/bin/view/Frontier/InstallSquid#Upgrading
** Affecting  dCache < v2.14, BeStMan
* https://ggus.eu/index.php?mode=ticket_info&ticket_id=125691
** Services using JGlobus fail with RFC proxies having Non-Repudiation key usage flag set, e.g. those created by usual voms-proxy-init from Grid Canada certificate
** https://ggus.eu/?mode=ticket_info&ticket_id=124650


== IPv6 readiness plans ==
== IPv6 readiness plans ==
Line 57: Line 73:


== Monthly Availability/Reliability ==
== Monthly Availability/Reliability ==
*Underperformed sites in the past A/R reports with issues not yet fixed:
** '''AsiaPacific''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=125427 GGUS 125427]
*** MY-USM-GCL, TW-NCUHEP
** '''NGI_DE''' [https://ggus.eu/index.php?mode=ticket_info&ticket_id=123836 GGUS 123836]
***TUDresden-ZIH: set-up a new CREAM-CE, the CA probes were failing. Issues on SRM service. Proposed the suspension.
**'''NGI_NL''': [https://ggus.eu/?mode=ticket_info&ticket_id=123532 GGUS 123532]
***BelGrid-UCL: UNKNOWN status returned by CREAM probes, asked a recomputation
*Sites suspended after past A/R reports:
**SZDG (IDGF)
*Underperformed sites after 3 consecutive months and underperformed NGIs:
**NGI_HR: low A/R overall figures, mainly due to a site [https://ggus.eu/index.php?mode=ticket_info&ticket_id=125837 GGUS 125837]
**NGI_UA: UA-NSCMBR [https://ggus.eu/index.php?mode=ticket_info&ticket_id=125839 GGUS 125839]


== ARGO proposal to use GOCDB as the only source of topology information ==
== ARGO proposal to use GOCDB as the only source of topology information ==
Line 72: Line 103:
** By Q2 2017: support for multiple service endpoints
** By Q2 2017: support for multiple service endpoints


== VAPOR (TO BE UPDATED) ==
== VAPOR ==


* [http://operations-portal.egi.eu/vapor/releases VAPOR 2.1] released in September, it replaces GSTAT
* [http://operations-portal.egi.eu/vapor/releases VAPOR 2.1] released in September, it replaced GSTAT
* important for presenting the amount of computing and storage resources of the infrastructure
* important for presenting the amount of computing and storage resources of the infrastructure
* each NGI should review the information provided by their sites and let us know any inconsistency: http://operations-portal.egi.eu/vapor/resources/GL2ResSummary
** we need your feedback to improve the service
** some known issues will be fixed in the next release
* new version 2.2 is about to be released:
* new version 2.2 is about to be released:
** please test it going on the dev instance http://operations-portal.egi.eu/vapor_dev
** please test it going on the dev instance http://operations-portal.egi.eu/vapor_dev
** working on some known issues of the previous version
* each NGI should review the information provided by their sites and let us know any inconsistency: https://operations-portal.egi.eu/vapor_dev/resources/GL2ResSummary
** we need your feedback to improve the service
** report any comment into https://ggus.eu/index.php?mode=ticket_info&ticket_id=124872
** report any comment into https://ggus.eu/index.php?mode=ticket_info&ticket_id=124872


Line 87: Line 118:
== Next meeting ==
== Next meeting ==


* '''Jan 9th, 2016''' https://indico.egi.eu/indico/event/3139/
* '''Feb 13th, 2016''' https://indico.egi.eu/indico/event/3140/
* '''new calendar available until June 2017''' https://indico.egi.eu/indico/category/32/
* '''new calendar available until June 2017''' https://indico.egi.eu/indico/category/32/

Latest revision as of 15:27, 25 October 2017

Main EGI.eu operations services Support Documentation Tools Activities Performance Technology Catch-all Services Resource Allocation Security


Documentation menu: Home Manuals Procedures Training Other Contact For: VO managers Administrators


General information

UMD/CMD/Preview

Preview repository

Released on 2016-12-22

Note: EGI provides the preview repository without any additional quality assurance process, but the products are released as they are provided by the product team. EGI recommends the use of the UMD repositories, which contain software verified through the quality assurance process of UMD.

Operations

Feedback from Helpdesk

  • [2016-12-13] Services using JGlobus fail with RFC proxies from certificates from some CAs

IPv6 readiness plans

Decommissioning of mon.egi.eu

  • on December 6th 2016 ARGO team decommissioned the old SAM GridMon box mon.egi.eu, housing central ATP and POEM.
    • VO SAM instances not be affected as they are using local ATP and POEM.
    • Remaining NGI SAM instances rely on central ATP and will no longer get topology updates, so this gives you extra incentive to decommission them.

cloudmon.egi.eu dismissed

Software upgrades for OpenStack cloud RCs (TO BE UPDATED)

  • keystone-VOMS and cloud-info-provider updates available, need to be installed on all OpenStack sites
  • as keystone-VOMS last version is only compatible with Liberty and Mitaka, in case OpenStack is Kilo (or older) an upgrade plan of OpenStack has been asked
  • according to EGI policies https://wiki.egi.eu/wiki/PROC16_Decommissioning_of_unsupported_software OpenStack Kilo or older should NOT be running on the infrastructure! we are asking for discussing this point at the next OMB (October 27)
  • as many sites are finding difficulties in planning upgrades against the very tight release cycle of OpenStack, please come with suggestion and reply with details in the tickets in order to shape the best (shared) proposal
  • ticket campaign ONGOING for all OpenStack sites, asked to upgrade to keystone-VOMS >=8.0.3, cloud-info-provider >=0.6, and plans for the future (OpenStack version currently deployed, plans for upgrades, usual specific RC upgrade schedule), UPDATE:
    • INDIGO-CATANIA-STACK and INFN-CATANIA-STACK moving to Mitaka (no plan)
    • IISAS-GPUCloud Liberty
    • FZJ user isolation bug fixed in Newton, not in Mitaka (investigating about a backport to Mitaka), waiting for solution
    • IN2P3-IRES Mitaka
    • CETA-GRID using Icehouse, planning mid-term upgrade (Newton?)
    • IISAS-FedCloud Mitaka from Ubuntu 16.04 LTS installed
    • BIFI upgrading to Mitaka
    • SCAI upgraded to Mitaka
    • INFN-PADOVA-STACK FIXED using Liberty
    • IFCA-LCG2 using Liberty
    • CYFRONET-CLOUD running Juno, evaluating Mitaka
    • TR-FC1-ULAKBIM FIXED using Liberty
    • NCG-INGRID-PT, using Mitaka, up to date
    • RECAS-BARI preparing upgrade to Mitaka from Ubuntu 16.04 LTS (deadline by end of year)
    • 100IT Liberty (evaluating Mitaka)

Monthly Availability/Reliability

  • Underperformed sites in the past A/R reports with issues not yet fixed:
    • AsiaPacific GGUS 125427
      • MY-USM-GCL, TW-NCUHEP
    • NGI_DE GGUS 123836
      • TUDresden-ZIH: set-up a new CREAM-CE, the CA probes were failing. Issues on SRM service. Proposed the suspension.
    • NGI_NL: GGUS 123532
      • BelGrid-UCL: UNKNOWN status returned by CREAM probes, asked a recomputation
  • Sites suspended after past A/R reports:
    • SZDG (IDGF)
  • Underperformed sites after 3 consecutive months and underperformed NGIs:

ARGO proposal to use GOCDB as the only source of topology information

  • Timescale:
    • New GOC-DB release on Dec 7th including a boolean ‘monitored’ flag for the service endpoints
    • Then creation of a web UI view for uncertified sites in ARGO
    • Uncertified sites will be asked to fill in the service endpoints information. Follow the How to add URL service endpoint information into GOC-DB
    • As information is added in the GOCDB, uncertified sites/services will be picked up by the ARGO Monitoring Engine and they will start to be monitored
    • By Q2 2017: support for multiple service endpoints

VAPOR

AOB

Next meeting