Changes between Version 40 and Version 41 of rdfstore


Ignore:
Timestamp:
Oct 29, 2012, 5:45:04 AM (12 years ago)
Author:
simonp
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • rdfstore

    v40 v41  
    33|| '''Date''' || 2012-10-28 ||
    44|| '''Contact(s)''' || Simon Pigot ||
    5 || '''Last edited''' || ||
     5|| '''Last edited''' || 2012-10-29 ||
    66|| '''Status''' || draft, being discussed, in progress ||
    77|| '''Assigned to release''' || Not yet assigned to a release ||
     
    6565TBA
    6666
    67 === Issues ===
     67=== Issues with current patch ===
    6868
    69 Object Identifiers: One of the stated key advantages of RDF is that objects are identified once and then reused. In the work done to date, I don't see how converting a record to RDF will identify the individual objects for reuse eg. if a piece of contact info is present in two different metadata records, then how is that object uniquely identified? Perhaps the object identifier could be derived from an md5sum on the content of the object?
     69==== XML2RDF ====
     70
     71Object Identifiers: One of the stated key advantages of RDF is that objects are identified once and then reused. In the work done to date, I can see that Responsible Parties are assigned identifiers based on the name of the responsible person . Presumably the RDF store would refuse to ingest the same responsible party if it was repeated in another record. But there are a few problems with this approach:
     72 - Name of responsible party may not be present (it's optional) or may be hidden
     73 - The RDF URI is hardcoded to use a CSIRO URL - probably should be GeoNetwork or some other URL
     74Perhaps the object identifier could be derived from an md5sum on the content of the object or it may use the XLink URL if the fragment being processed is XLinked.
     75
     76Doesn't support XLinked records?
     77
     78Patch uses Apache JENA RDF store but GeoNetwork already uses the OpenRDF-Sesame RDF store for its SKOS vocabularies - would be better to use one of these RDF stores, not both? 
     79
     80When storing metadata records in the database table, they are allocated an integer ID in DataManager. I need to check that the DataManagerRDF code continues this practice otherwise the permissions system and other parts of GeoNetwork will not be able to manage the record properly.
     81
     82==== Implications for wider adoption of RDF Store and Encoding in GeoNetwork ====
    7083
    7184Profile support in ISO19115 mapping: introduce additional rdf namespaces/concepts?
    72 
    73 Relationship to DCAT proposal?
    7485
    7586Speed of RDF triple stores versus Lucene? Free text search in Apache JENA RDF triple store/sparql queries is supported by using Lucene to help - see LARQ sub-project: http://jena.apache.org/documentation/larq/index.html
     
    7788Spatial searching: At present we do mixed spatial and textual searches for OGC CSW query support by filtering Lucene searches with query results from spatial database via GeoTools. How would this work in SPARQL? OGC GeoSPARQL would be part of the approach here I suppose: http://code.google.com/p/geospatialweb/ How mature is the GeoSPARQL implementation for Apache JENA or OpenRDF-sesame? (see links section: these projects don't look like they are mainstream parts of RDF frameworks yet?). Also, we use GeoTools to parse the OGC filter language - how would this be converted to a GeoSPARQL query? (hmm).
    7889
    79 Two RDF triple stores now used in !GeoNetwork: OpenRDF/sesame and now Apache JENA?
    8090
    81 Need to tie metadata permissions system used in GeoNetwork (in database) to record in RDF triple store.
     91Relationship to DCAT proposal?
    8292
    8393=== Backwards Compatibility Issues ===