develooper Front page | perl.perl4lib | Postings from June 2006

MARC::Charset problem

Thread Next
From:
moconnor59
Date:
June 22, 2006 05:18
Subject:
MARC::Charset problem
Message ID:
1150968872.749367.237550@r2g2000cwb.googlegroups.com
Hi,

I'm using MARC::Charset::marc8_to_utf8() v0.95 to transcode some
Library of Congress data to utf8, however I'm finding a problem with
character 'ΓΈ' (hex 0xB2 - lowercase scandinavian o / latin small
letter o with stroke), this character is transcoding to 0xF8 - which is
not valid utf8 - when it should transcode to 0xC3B8. (According to the
documentation, 0xF8 seems to be the ucs transcoding of this character).

Is this a bug in MARC::Charset or am I missing something?

Thanks,

Michael


Thread Next


nntp.perl.org: Perl Programming lists via nntp and http.
Comments to Ask Bjørn Hansen at ask@perl.org | Group listing | About