What it does
The code page conversion translates your print job from one character set, or code page, to another. Another term for character set is data encoding. Typically the characters in a data file will have one data encoding so that a program that reads this file (or a user looking at a document) can interpret the data as it was intended to be.
- Input character set: this is the code page for the incoming data.
For example, we tend to see Chinese data. This is typically in Big5 format or Traditional Chinese. In this case, we would select "Big5" as the input character set.
- Output character set: this is the code page or encoding for the result data.
You would have to know what you are looking for in the result data. A vendor really can't recommend what you want here. However, for the sake of example, RPM uses this transform internally in the Text Markup transform. We let the user select the input character set in the Text Markup dialog. Then, RPM automatically selects "UTF-8" as the output character set because it works best for us to use UTF-8 for the text markup.
Let's say you have a Big5 text file on an English Windows system. You would use RPM to translate this from Big5 to UCS-2, then use a program like Notepad to look at the file.