Changes between Version 1 and Version 2 of loadingxmlcontentcharsetdetection


Ignore:
Timestamp:
Oct 7, 2011, 2:45:06 AM (13 years ago)
Author:
simonp
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • loadingxmlcontentcharsetdetection

    v1 v2  
    1818 * '''Module''': Jeeves
    1919
    20 === Links ===
    21  * '''Documents''':
    22  * '''Email discussions''':
    23  * '''Other wiki discussions''':
    24 
    2520=== Voting History ===
    26  * Vote proposed by X on Y, result was +/-n (m non-voting members).
     21 * Vote proposed by Simon on 2011/10/07, result was +/-n (m non-voting members).
    2722
    2823----
    2924
    3025== Motivations ==
    31 The current configuration is .... A change to ....
     26
     27GeoNetwork should be able to load and convert XML files that contain characters from character sets other than UTF-8. For example, loading a file with characters from the WINDOWS-1252 charset causes batch import to fail with a message like:
     28
     29
    3230
    3331== Proposal ==
    34 An in depth proposal can be found here : link
    35 ...
     32
     33jeeves.utils.Xml - loadFile method needs to be modified to read the file as a stream of bytes, detect the character set and convert to UTF-8 as required. By default this character set detection capability is enabled by setting the java system property jeeves.filecharsetdetectandconvert. 
    3634
    3735=== Backwards Compatibility Issues ===
    3836
     37None because character set detection and conversion can be disabled on startup by setting the java system property jeeves.filecharsetdetectandconvert to disabled. eg. export JAVA_OPTS="-Djeeves.filecharsetdetectandconvert=disabled" if using tomcat or by editing bin/start-geonetwork.sh for jetty.
     38
    3939=== New libraries added ===
    40 Explain which and why new libraries are required for that proposal ...
     40
     41juniversalchardet - character set detection jar
    4142
    4243== Risks ==
    4344
     45None known.
     46
    4447== Participants ==
    45  * List of participants and role (if necessary) in current GIP
     48 * Simon Pigot