bug-standards
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH] standards: rewrite section on quoting


From: Paolo Bonzini
Subject: Re: [PATCH] standards: rewrite section on quoting
Date: Wed, 21 Dec 2011 12:04:58 +0100
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:7.0.1) Gecko/20110930 Thunderbird/7.0.1

On 12/21/2011 12:44 AM, Karl Berry wrote:
> It never occurred to me to associate '...' in error messages with the
> British quoting style and "..." with American.  I know well that that is
> true typographically, but ASCII and computer messages in general are
> such a limited domain, compared to, say, book printing.
> 
> However, I myself don't see any particular problem with the GCS allowing
> either "..." or '...'.
> 
> Paolo, since Paul clearly feels strongly about this, can you change the
> wording to allow either one?  Then we can see what rms says.

Sure.  I just removed any reference to Unicode in the English locale,
since there's no way that it can be right. :)

Paolo

------------------- 8< ------------------------

>From b1e6cfcb360cd3cf8e6b2993fc518746efad4791 Mon Sep 17 00:00:00 2001
From: Paolo Bonzini <address@hidden>
Date: Tue, 20 Dec 2011 09:42:36 +0100
Subject: [PATCH v2] standards: rewrite section on quoting

GCC has been using apostrophes for quotes in the C locale for seven
years, and gnulib is following suit.  Describe this in the GNU coding
standards and add instructions about translating quotes and providing
parseable output.

* doc/standards.texi (Quote Characters): Add note about translating
quote characters and providing parseable output.  Do not anymore suggest
the grave accent as the opening quote character.  Suggest adding an
option such as GNU ls's --quoting-style.
---
 doc/standards.texi |   39 +++++++++++++++++++--------------------
 1 files changed, 19 insertions(+), 20 deletions(-)

diff --git a/doc/standards.texi b/doc/standards.texi
index 99d76f0..e9099c3 100644
--- a/doc/standards.texi
+++ b/doc/standards.texi
@@ -3274,12 +3274,28 @@ Quote Characters
 @cindex quote characters
 @cindex locale-specific quote characters
 @cindex left quote
address@hidden opening quote
address@hidden single quotes
 @cindex grave accent
 
 In the C locale, GNU programs should stick to plain ASCII for quotation
-characters in messages to users: preferably 0x60 (@samp{`}) for left
-quotes and 0x27 (@samp{'}) for right quotes.  It is ok, but not
-required, to use locale-specific quotes in other locales.
+characters in messages to users: preferably 0x27 (@samp{'}) for both
+opening and closing quotes.
+
+It is ok, but not required, to use locale-specific quotes in other
+locales.  Remember however that you should never use Unicode in the
+C locale.
+
+If you support internationalization, translators should be able to provide
+their own quote characters.  By convention, the string @samp{"`"} will
+translate to the opening quote and the string @samp{"'"} will translate
+to the closing quote.
+
+If the output of your program is ever likely to be parsed by another
+program, provide an option that makes this parsing reliable.  For example
+you could escape special characters using conventions from the C language
+or the Bourne shell.  See for example the option @option{--quoting-style}
+of GNU @code{ls}.
 
 The @uref{http://www.gnu.org/software/gnulib/, Gnulib} @code{quote} and
 @code{quotearg} modules provide a reasonably straightforward way to
@@ -3287,23 +3303,6 @@ Quote Characters
 other issues, such as quoting a filename that itself contains a quote
 character.  See the Gnulib documentation for usage details.
 
-In any case, the documentation for your program should clearly specify
-how it does quoting, if different than the preferred method of @samp{`}
-and @samp{'}.  This is especially important if the output of your
-program is ever likely to be parsed by another program.
-
-Quotation characters are a difficult area in the computing world at
-this time: there are no true left or right quote characters in Latin1;
-the @samp{`} character we use was standardized there as a grave
-accent.  Moreover, Latin1 is still not universally usable.
-
-Unicode contains the unambiguous quote characters required, and its
-common encoding UTF-8 is upward compatible with Latin1.  However,
-Unicode and UTF-8 are not universally well-supported, either.
-
-This may change over the next few years, and then we will revisit
-this.
-
 
 @node Mmap
 @section Mmap
-- 
1.7.7.1




reply via email to

[Prev in Thread] Current Thread [Next in Thread]