photools.com Community

IMatch Discussion Boards => General Discussion and Questions => Topic started by: RobiWan on November 24, 2017, 09:13:19 PM

Title: Thesaurus import with german umlauts
Post by: RobiWan on November 24, 2017, 09:13:19 PM
Hello,

has anyone tried to import the thesaurus with German umlauts. This works for me but the umlauts are then all broken.

Title: Re: Thesaurus import with german umlauts
Post by: thrinn on November 24, 2017, 09:24:42 PM
Works for me. Just make sure that your input file is properly encoded (UTF-8).
Title: Re: Thesaurus import with german umlauts
Post by: sinus on November 25, 2017, 08:45:37 AM
Quote from: thrinn on November 24, 2017, 09:24:42 PM
Works for me. Just make sure that your input file is properly encoded (UTF-8).

Yes, correct, me too.
Title: Re: Thesaurus import with german umlauts
Post by: RobiWan on November 25, 2017, 10:33:43 AM
Quote from: thrinn on November 24, 2017, 09:24:42 PM
Just make sure that your input file is properly encoded (UTF-8).

And how I can do that? I can only select in LR "export Keywords" and LR create a simple Textfile. I can read this file with editors like notepad and all umlauts are correct.

Title: Re: Thesaurus import with german umlauts
Post by: Mario on November 25, 2017, 11:05:00 AM
Save it in Windows Notepad and choose the Encoding "UTF-8" at the bottom.
Usually Lr saves the file with UTF-8 encoding automatically, but maybe not in your case.
Title: Re: Thesaurus import with german umlauts
Post by: RobiWan on November 25, 2017, 12:44:49 PM
I can't believe it. I have tried it 3 times on Windows 10 and OSX. All times with the latest Clasic CC version.
Title: Re: Thesaurus import with german umlauts
Post by: Mario on November 25, 2017, 01:54:15 PM
Lr does not write an UTF-8 BOM to indicate the file as UTF-8 encoded. IMatch hence assumes it is ANSI-encoded.
I will change that to assume UTF-8 instead for the next release.

Open the file in Notepad and save it as Unicode. This will write the file as UNICODE with BOM and the import should work.
Title: Re: Thesaurus import with german umlauts
Post by: RobiWan on November 25, 2017, 09:16:25 PM
Quote from: Mario on November 25, 2017, 01:54:15 PM
Open the file in Notepad and save it as Unicode. This will write the file as UNICODE with BOM and the import should work.

Yes this works. Thank you.

Quote from: Mario on November 25, 2017, 01:54:15 PM
Lr does not write an UTF-8 BOM to indicate the file as UTF-8 encoded.

This is true, but I mean its not necessary to set BOM header to tell applications that here is UTF-8 encoding.