Changes between Version 40 and Version 41 of rdfstore
- Timestamp:
- Oct 29, 2012, 5:45:04 AM (12 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
rdfstore
v40 v41 3 3 || '''Date''' || 2012-10-28 || 4 4 || '''Contact(s)''' || Simon Pigot || 5 || '''Last edited''' || ||5 || '''Last edited''' || 2012-10-29 || 6 6 || '''Status''' || draft, being discussed, in progress || 7 7 || '''Assigned to release''' || Not yet assigned to a release || … … 65 65 TBA 66 66 67 === Issues ===67 === Issues with current patch === 68 68 69 Object Identifiers: One of the stated key advantages of RDF is that objects are identified once and then reused. In the work done to date, I don't see how converting a record to RDF will identify the individual objects for reuse eg. if a piece of contact info is present in two different metadata records, then how is that object uniquely identified? Perhaps the object identifier could be derived from an md5sum on the content of the object? 69 ==== XML2RDF ==== 70 71 Object Identifiers: One of the stated key advantages of RDF is that objects are identified once and then reused. In the work done to date, I can see that Responsible Parties are assigned identifiers based on the name of the responsible person . Presumably the RDF store would refuse to ingest the same responsible party if it was repeated in another record. But there are a few problems with this approach: 72 - Name of responsible party may not be present (it's optional) or may be hidden 73 - The RDF URI is hardcoded to use a CSIRO URL - probably should be GeoNetwork or some other URL 74 Perhaps the object identifier could be derived from an md5sum on the content of the object or it may use the XLink URL if the fragment being processed is XLinked. 75 76 Doesn't support XLinked records? 77 78 Patch uses Apache JENA RDF store but GeoNetwork already uses the OpenRDF-Sesame RDF store for its SKOS vocabularies - would be better to use one of these RDF stores, not both? 79 80 When storing metadata records in the database table, they are allocated an integer ID in DataManager. I need to check that the DataManagerRDF code continues this practice otherwise the permissions system and other parts of GeoNetwork will not be able to manage the record properly. 81 82 ==== Implications for wider adoption of RDF Store and Encoding in GeoNetwork ==== 70 83 71 84 Profile support in ISO19115 mapping: introduce additional rdf namespaces/concepts? 72 73 Relationship to DCAT proposal?74 85 75 86 Speed of RDF triple stores versus Lucene? Free text search in Apache JENA RDF triple store/sparql queries is supported by using Lucene to help - see LARQ sub-project: http://jena.apache.org/documentation/larq/index.html … … 77 88 Spatial searching: At present we do mixed spatial and textual searches for OGC CSW query support by filtering Lucene searches with query results from spatial database via GeoTools. How would this work in SPARQL? OGC GeoSPARQL would be part of the approach here I suppose: http://code.google.com/p/geospatialweb/ How mature is the GeoSPARQL implementation for Apache JENA or OpenRDF-sesame? (see links section: these projects don't look like they are mainstream parts of RDF frameworks yet?). Also, we use GeoTools to parse the OGC filter language - how would this be converted to a GeoSPARQL query? (hmm). 78 89 79 Two RDF triple stores now used in !GeoNetwork: OpenRDF/sesame and now Apache JENA?80 90 81 Need to tie metadata permissions system used in GeoNetwork (in database) to record in RDF triple store. 91 Relationship to DCAT proposal? 82 92 83 93 === Backwards Compatibility Issues ===