Re: Word wrapping in PO

bug-gnu-utils

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Word wrapping in PO

From:	Bruno Haible
Subject:	Re: Word wrapping in PO
Date:	Mon, 17 Dec 2007 12:35:48 +0100
User-agent:	KMail/1.5.4

Hi,

Dwayne Bailey wrote:
> Although in most cases translators are not reading the PO format
> directly but are rather using a PO editor.  There are cases where a
> translator wants to work directly with the format.
> ...
> However, in a few cases the way that Gettext currently wraps words in
> the PO format makes it hard to read in the raw format.

You can always avoid this wrapping by preprocessing your PO file with
"msgcat --width=10000".

> The following
> diff snippet shows how a good wrapping was converted to a much harder to
> read form:
> 
> --- manager/chrome/pippki/pippki.dtd.po (revision 9338)
> +++ manager/chrome/pippki/pippki.dtd.po (working copy)
> @@ -95,9 +95,9 @@
>  "form data, personal certificates, and private keys will be forgotten.
> Are "
>  "you sure you want to reset your master password?"
>  msgstr ""
> -"Indien u u meesterwagwoord terugstel, sal al u gestoorde web- en "
> -"e-poswagwoorde, vormdata, persoonlike sertifikate en private sleutels
> "
> -"vergeet word. Is u seker dat u die meesterwagwoord wil teruglaai?"
> +"Indien u u meesterwagwoord terugstel, sal al u gestoorde web- en e-"
> +"poswagwoorde, vormdata, persoonlike sertifikate en private sleutels
> vergeet "
> +"word. Is u seker dat u die meesterwagwoord wil teruglaai?"
> 
>  #. Values for getpassword.xul
>  #: getPassword.title
> 
> This might not be clear with wrapping in my email.  In summary:
> 
> ...web- en "
> "e-poswagwoorde, ...
> 
> becomes
> 
> web en e-"
> "poswagwoorde...
> 
> The second is much harder to read.

I don't see how to do line breaking here that would avoid this special case,
without using a dictionary-based approach. I don't want a dictionary-based
line breaking in gettext since gettext has to support many languages, the
installed size of > 30 dictionaries would be huge, and the line breaking
feature in gettext is of minor usefulness anyway.

> Suggested solution: wrap only on spaces and do fancy word breaking only
> in cases where it is needed.

Sorry, this does not work for Chinese. The algorithm that gettext uses is
the Unicode line breaking algorithm [1], chosen because it produces
acceptable results in most scripts and most languages.

Bruno

[1] http://www.unicode.org/reports/tr14/

[Prev in Thread]

Current Thread

[Next in Thread]

Word wrapping in PO, Dwayne Bailey, 2007/12/12
- Re: Word wrapping in PO, Bruno Haible <=
  - Re: Word wrapping in PO, Dwayne Bailey, 2007/12/18
    - Re: Word wrapping in PO, Bruno Haible, 2007/12/26

Prev by Date: GNU diff vs. bzr diff
Next by Date: Time command not reporting memory usage :<
Previous by thread: Word wrapping in PO
Next by thread: Re: Word wrapping in PO
Index(es):
- Date
- Thread