Changes between Version 1 and Version 2 of loadingxmlcontentcharsetdetection
- Timestamp:
- 10/07/11 02:45:06 (13 years ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
loadingxmlcontentcharsetdetection
v1 v2 18 18 * '''Module''': Jeeves 19 19 20 === Links ===21 * '''Documents''':22 * '''Email discussions''':23 * '''Other wiki discussions''':24 25 20 === Voting History === 26 * Vote proposed by X on Y, result was +/-n (m non-voting members).21 * Vote proposed by Simon on 2011/10/07, result was +/-n (m non-voting members). 27 22 28 23 ---- 29 24 30 25 == Motivations == 31 The current configuration is .... A change to .... 26 27 GeoNetwork should be able to load and convert XML files that contain characters from character sets other than UTF-8. For example, loading a file with characters from the WINDOWS-1252 charset causes batch import to fail with a message like: 28 29 32 30 33 31 == Proposal == 34 An in depth proposal can be found here : link 35 ... 32 33 jeeves.utils.Xml - loadFile method needs to be modified to read the file as a stream of bytes, detect the character set and convert to UTF-8 as required. By default this character set detection capability is enabled by setting the java system property jeeves.filecharsetdetectandconvert. 36 34 37 35 === Backwards Compatibility Issues === 38 36 37 None because character set detection and conversion can be disabled on startup by setting the java system property jeeves.filecharsetdetectandconvert to disabled. eg. export JAVA_OPTS="-Djeeves.filecharsetdetectandconvert=disabled" if using tomcat or by editing bin/start-geonetwork.sh for jetty. 38 39 39 === New libraries added === 40 Explain which and why new libraries are required for that proposal ... 40 41 juniversalchardet - character set detection jar 41 42 42 43 == Risks == 43 44 45 None known. 46 44 47 == Participants == 45 * List of participants and role (if necessary) in current GIP48 * Simon Pigot