In order to actually read the text correctly I have, at the suggestion of somewhere in the list) used the uniencode/decode construction as in...
Code: Select all
--a text file so get its contents
revZipExtractItemToVariable pArchive, sOPS & aMan[sNItemRef][href] , "temp"
--process html
put unidecode(uniencode(temp, "UTF8")) into temp
put temp into aMan[sNItemRef][content_html]
I tried removing the "unidecode(uniencode(temp, "UTF8"))" line but that didn't work giving me errors in the XML.
So I don't know if this construction is now wrong, or the zip or XML libraries have changed.
First things first. Is "unidecode(uniencode(temp, "UTF8"))" still the right way to go or should I be doing something else?