wiki:WAFHarvest

Version 14 (modified by kliu, 13 years ago) ( diff )

--

Proposal title

Date 2011/09/20
Contact(s) K. Liu(GMU), D. Nebert(USGS/FGDC), A. Warnock(A/WWW Enterprises), C. Yang(GMU)
Last edited
Status complete
Assigned to release 2.7.x
Resources Indicate if the required resources are available to complete the proposal
Ticket # #XYZ

Overview

The purpose of this proposal is to allow users to harvest metadata from WAF catalog. In particular, the WAF contains XML files metadata or OGC Getcapabilities file.

...

Proposal Type

  • Type: GUI Change, Core Change, Module Change
  • App: GeoNetwork
  • Module: Harvester, Kernel, Harvest Interface
  • Documents:
  • Email discussions:
  • Other wiki discussions:

Voting History

  • Vote proposed by X on Y, result was +/-n (m non-voting members).

Motivations

Some catalogs are published by WAF protocal. A Web Accessible Folder (WAF) is an HTTP accessible directory of files, typically metadata files in XML format in which all files are visible to a web browser or client. Crawlers are able to parse the file listings and provide a search interface on these documents. In addition, some WAF contains OGC Service Getcapbilities file such as: http://mrdata.usgs.gov/wms.html, http://mrdata.usgs.gov/wfs.html

To allow users to get the metadata from the WAF catalogs, we added the WAF harvest to current GeoNetwork.

Proposal

We extend the Web DAV server harvest to Web Access Folder/ Web DAV server harvest as below: To enable WAF harvest, a subtype list box is added to the Web Access Folder/ Web Dav server harvest management page. If the harvesting url is under WAF protocol, administrators may choose WAF harvest subtype. Some WAF url as below: http://capita.wustl.edu/DataspaceMetadata_ISO/ http://mrdata.usgs.gov/wms.html http://mrdata.usgs.gov/wfs.html

...

Backwards Compatibility Issues

New libraries added

jsoup.jar is used to parse the WAF page

Risks

Participants

*Kai Liu (GMU) *Douglas D. Nebert (USGS/FGDC) *A. Warnock(A/WWW Enterprises) *Chaowei Yang (GMU)

Attachments (3)

Download all attachments as: .zip

Note: See TracWiki for help on using the wiki.