Opened 6 months ago

Closed 5 months ago

#3116 closed defect (fixed)

repos.osgeo.org needs admin attention

Reported by: peterafrigis Owned by: jive
Priority: major Milestone: Unplanned
Component: SysAdmin/Repo Keywords: repo
Cc: jive

Description

Automated GeoServer builds are failing for the past 3-4 days with the error message:

Error: 4,838 [ERROR] Failed to execute goal on project gs-grib: Could not resolve dependencies for project org.geoserver.extension:gs-grib:jar:2.25-SNAPSHOT: Failed to collect dependencies at org.geotools:gt-grib:jar:31-SNAPSHOT -> edu.ucar:cdm:jar:4.6.15 -> edu.ucar:httpservices:jar:4.6.15: Failed to read artifact descriptor for edu.ucar:httpservices:jar:4.6.15: The following artifacts could not be resolved: edu.ucar:httpservices:pom:4.6.15 (absent): Could not transfer artifact edu.ucar:httpservices:pom:4.6.15 from/to osgeo-releases (https://repo.osgeo.org/repository/release/): status code: 500, reason phrase: Server Error (500) -> [Help 1]

Jody believes one of the repos is having problems; we access via a cache at repos.osgeo.org

Could the repo admin please take a look at the problem to resolve it?

If you need to contact a GeoServer developer to help test, https://app.gitter.im/#/room/#geoserver_geoserver:gitter.im or geoserver-devel@… or me personally: gs@…

Thank you

Peter

Change History (11)

comment:1 by jive, 6 months ago

Andrea collected some details - with the conclusion that it may be corrupted storage on the repo server? He indicated clearing the cache and allowing it to redownload the artifacts did help.

Checking the pom.xml file causing problems:

So I expect this is a problem with repo.osgeo.org storage / cache.

in reply to:  1 comment:2 by robe, 6 months ago

Replying to jive:

Andrea collected some details - with the conclusion that it may be corrupted storage on the repo server? He indicated clearing the cache and allowing it to redownload the artifacts did help.

Checking the pom.xml file causing problems:

So I expect this is a problem with repo.osgeo.org storage / cache.

@jive,

Is this something you need help with or you have it under control?

comment:3 by robe, 5 months ago

Cc: jive added
Component: SysAdminSysAdmin/Repo
Owner: changed from sac-tickets@… to jive

@jive,

Okay I guess it's best we continue on here instead of the discourse feed. So as you mentioned, the last successful run was Jan 1st

https://build.geoserver.org/view/geotools/job/geotools-main/

So I think that rules out the major upgrade I did on #2932 as that was 3 weeks ago.

I'm going to check the errors in the logs and do a more intensive disk check if I can't determine from the logs. Might also be something about how docker volumes work that something is locked and maybe the container just needs a reboot.

comment:4 by robe, 5 months ago

@jive,

One more question, from the above, I'm not sure how to tell if anything is even wrong.

Can you give me some command or web call I can use to replicate the issue, all the above links when I go there show something, so I think I'm missing something.

comment:5 by robe, 5 months ago

I do see here - https://repo.osgeo.org/#admin/system/tasks that

System - Repository Health Check: ucar-cache

has been failing. The other cache healthchecks return okay. So I guess that gets to your point about the ucar-cache being corrupt. Though that job doesn't say how long it's been failing and it runs every hour.

I also do notice it looks like the docker container restarted 2 weeks ago, and I can't remember restarting it, though it's possible I did. So perhaps something happened during that restart or it restarted on it's own because of some corruption.

comment:6 by jive, 5 months ago

To check:

  1. Visit to artifact for which a problem is reported: https://repo.osgeo.org/#browse/browse:ucar-cache:edu%2Fucar%2Fhttpservices%2F4.6.15%2Fhttpservices-4.6.15.pom
  1. Click on the path download link
  1. Success is a small text file; failure is 500 error
Last edited 5 months ago by jive (previous) (diff)

comment:7 by jive, 5 months ago

Last edited 5 months ago by jive (previous) (diff)

comment:8 by peterafrigis, 5 months ago

I am attempting to follow along. The "download link" is on the right hand side, under Summary > Path > click this link (e.g. https://repo.osgeo.org/repository/ucar-cache/edu/ucar/httpservices/4.6.15/httpservices-4.6.15.pom)

I still get a 500 Internal Server error

comment:9 by jive, 5 months ago

I now see a ucar-all-cache (so maybe someone deleted and remade the cache to point to https://artifacts.unidata.ucar.edu/repository/unidata-all/ )?

It still as "unidata-artifacts" routing rule so will only be used to cache "/edu/ucar/.*"

The cache does not yet contain that artifact; running https://build.geoserver.org/view/geotools/job/geotools-main/ to see if it will use the cache correctly and fetch the expected artifact.

Last edited 5 months ago by jive (previous) (diff)

comment:10 by jive, 5 months ago

Okay build is now managing netCDF module so I think this can be closed.

I found that Andrea had done the change described above (thanks Andrea)

comment:11 by jive, 5 months ago

Resolution: fixed
Status: newclosed
Note: See TracTickets for help on using tickets.