Opened 15 years ago
Closed 15 years ago
#162 closed defect (fixed)
CSW Harvest in 2.4.2 issues
Reported by: | ddnebert | Owned by: | josegar74 |
---|---|---|---|
Priority: | major | Milestone: | v2.4.3 |
Component: | General | Version: | v2.4.2 |
Keywords: | harvest, csw, native | Cc: |
Description
We set up an instance of the server attempted to register two remote catalogs for direct and full periodic harvesting. One is the same version of GeoNetwork, the other is a CSW 2.0.2 solution which is also queryable by CSW externally. The harvest of the other GeoNetwork node appears to work, though there is no feature to select all records, to issue a Null Query (standard practice to get all), or to specify a global BBOX - another trick to fetching all records.
For the CSW 2.0.2 (ebRIM Profile and baseline), I included the service URL: http://geossregistries.info:1090/GEOSSCSW202/discovery?Request=GetCapabilities&Service=CSW&Version=2.0.2 , two search criteria blocks "component" and "service" (without quotes). I set Groups to All and selected all three checkboxes and selected other information resources as the category.
I selected "Run" and get a java error "raised exception when searching" when I hover over the warning icon. This is a less-than-helpful diagnostic. How do I successfully harvest this CSW?
Change History (5)
follow-up: 2 comment:1 by , 15 years ago
Milestone: | → v2.5.0 |
---|---|
Version: | → v2.4.2 |
comment:2 by , 15 years ago
Owner: | changed from | to
---|
comment:3 by , 15 years ago
Milestone: | v2.5.0 → v2.4.2 |
---|---|
Owner: | changed from | to
comment:4 by , 15 years ago
Owner: | changed from | to
---|---|
Status: | new → assigned |
comment:5 by , 15 years ago
Resolution: | → fixed |
---|---|
Status: | assigned → closed |
The http://geossregistries.info:1090/GEOSSCSW202/discovery?Request=GetCapabilities&Service=CSW&Version=2.0.2 server seem having some issues:
- When requested summary records returns a full record (fixed in harvester to manage full record)
- Seem always returning all records independent of filter. For example:
<ogc:Filter xmlns:ogc="http://www.opengis.net/ogc">
<ogc:PropertyIsEqualTo>
<ogc:PropertyName>dc:title</ogc:PropertyName> <ogc:Literal>non valid title</ogc:Literal>
</ogc:PropertyIsEqualTo>
</ogc:Filter>
returns 348 matches. Any value used in title returns same number of records.
- The metadata returned is not correctly escaped: some metadata contains unescaped characters like '&' in abstract, what is causing errors in saxon
Summarizing, fixed Harvester.getRecordInfo to manage full record if returned by the server instead of summary record. The rest seem a problem of this CSW server.
Replying to ddnebert:
Here is the console output of the CSW harvest error:
912252 [Thread-122] WARN geonetwork.harvester - Raised exception when searching
ust be terminated by the matching end-tag "</link>". 912252 [Thread-122] WARN geonetwork.harvester - Raised exception while harvesti ng from : Air Quality Community CSW (CswHarvester) 912252 [Thread-122] WARN geonetwork.harvester - (C) Class : OperationAborted Ex 912252 [Thread-122] WARN geonetwork.harvester - (C) Message : Raised exception
OperationAbortedEx : Raised exception when searching
ter.java:397)
r.java:223)
er.java:94)
wHarvester.java:211)
tractHarvester.java:379)
)