[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: non portable sed scripts
From: |
Paul Eggert |
Subject: |
Re: non portable sed scripts |
Date: |
Mon, 22 May 2006 10:28:36 -0700 |
User-agent: |
Gnus/5.1008 (Gnus v5.10.8) Emacs/21.4 (gnu/linux) |
Ralf Wildenhues <address@hidden> writes:
>> ``[...] more than one script file, and it fails with longer scripts.
>> Moreover, Solaris @command{/usr/xpg4/bin/sed} dumps core with long
>> scripts.''
>
> Maybe we should remove that altogether: I haven't been able to analyze
> this failure sufficiently to state good circumstances in which it
> happens (and I don't like to propagate possibly incorrect information).
Sounds reasonable.
> you removed mention of bugs of specific grep implementations:
I put those back in by moving it to the Limitations section, and
installed this patch for the doc changes. (I'll look at
sed-safe.patch next.)
2006-05-22 Paul Eggert <address@hidden>,
Ralf Wildenhues <address@hidden>,
Stepan Kasal <address@hidden>
* doc/autoconf.texi (Particular Programs): Do not promise that
we always prefer the GNU version of the program, and that we
search according to PATH; both rules can have exceptions.
Update description of AC_PROG_GREP, AC_PROG_EGREP, AC_PROG_FGREP,
AC_PROG_SED. Move descriptions of limitations
to the Limitations of Usual Tools section.
(Limitations of Usual Tools) <sed>: Mention script length
limitations with Solaris /usr/ucb/sed.
<grep>: Fix wording for empty alternative. Mention that -c and
-l should not be combined, and that -E and -F should not be
combined.
--- autoconf.texi.~1.1020~ 2006-05-22 09:46:56.000000000 -0700
+++ autoconf.texi 2006-05-22 10:22:48.000000000 -0700
@@ -3458,32 +3458,27 @@ best implementation.
@defmac AC_PROG_GREP
@acindex{PROG_GREP}
@ovindex GREP
-On AIX the default @code{grep} silently truncates long lines on the
-input before matching. On Solaris, @code{/usr/bin/grep} does not
-understand the @option{-e} option. On NeXT, @code{grep} understands only a
-single @option{-e} option. This macro looks for @sc{gnu} Grep or
-else the best available @code{grep} or @code{ggrep} in the user's
address@hidden which accepts the longest input lines possible, and which
-accepts and respects multiple @option{-e} options. Set the
-output variable @code{GREP} to whatever is chosen.
+Look for the best available @code{grep} or @code{ggrep} that accepts the
+longest input lines possible, and that supports multiple @option{-e} options.
+Set the output variable @code{GREP} to whatever is chosen.
address@hidden of Usual Tools}, for more information about
+portability problems with the @command{grep} command family.
@end defmac
@defmac AC_PROG_EGREP
@acindex{PROG_EGREP}
@ovindex EGREP
-Check whether @code{$GREP -E} works, or else search the user's
address@hidden for @code{egrep}, and @code{gegrep}, in that order, and set
-output variable @code{EGREP} to the one that accepts the longest input
-lines.
+Check whether @code{$GREP -E} works, or else look for the best available
address@hidden or @code{gegrep} that accepts the longest input lines possible.
+Set the output variable @code{EGREP} to whatever is chosen.
@end defmac
@defmac AC_PROG_FGREP
@acindex{PROG_FGREP}
@ovindex FGREP
-Check whether @code{$GREP -F} works, or else search the user's
address@hidden for @code{fgrep}, and @code{gfgrep}, in that order, and set
-output variable @code{FGREP} to the one that accepts the longest input
-lines.
+Check whether @code{$GREP -F} works, or else look for the best available
address@hidden or @code{gfgrep} that accepts the longest input lines possible.
+Set the output variable @code{FGREP} to whatever is chosen.
@end defmac
@defmac AC_PROG_INSTALL
@@ -3632,9 +3627,10 @@ is found, and otherwise to @samp{:} (do
@defmac AC_PROG_SED
@acindex{PROG_SED}
@ovindex SED
-Set output variable @code{SED} to a Sed implementation on @env{PATH} that
-truncates as few characters as possible. If @sc{gnu} Sed is found,
-use that instead.
+Set output variable @code{SED} to a Sed implementation that conforms to
+Posix and does not have arbitrary length limits. Report an error if no
+acceptable Sed is found. @xref{Limitations of Usual Tools}, for more
+information about portability problems with Sed.
@end defmac
@defmac AC_PROG_YACC
@@ -12778,7 +12774,7 @@ Portable extended regular expressions sh
characters in the string @samp{$()address@hidden|}. For example,
@address@hidden
is not portable, even though it typically matches @address@hidden
-The empty alternative is not portable, use @samp{?} instead. For
+The empty alternative is not portable. Use @samp{?} instead. For
instance with Digital Unix v5.0:
@example
@@ -12954,7 +12950,9 @@ while @acronym{GNU} @command{find} repor
Portable scripts can rely on the @command{grep} options @option{-c},
@option{-l}, @option{-n}, and @option{-v}, but should avoid other
options. For example, don't use @option{-w}, as Posix does not require
-it and Irix 6.5.16m's @command{grep} does not support it.
+it and Irix 6.5.16m's @command{grep} does not support it. Also,
+portable scripts should not combine @option{-c} with @option{-l},
+as Posix does not allow this.
Some of the options required by Posix are not portable in practice.
Don't use @samp{grep -q} to suppress output, because many @command{grep}
@@ -12968,9 +12966,12 @@ doesn't exist) of @code{grep} to @file{/
status of @code{grep} to determine whether it found a match.
Some traditional @command{grep} implementations do not work on long
-input lines. Also, many implementations do not support multiple regexps
+input lines. On AIX the default @code{grep} silently truncates long
+lines on the input before matching.
+
+Also, many implementations do not support multiple regexps
with @option{-e}: they either reject @option{-e} entirely (e.g., Solaris)
-or honor only the last pattern (e.g., @acronym{IRIX} 6.5). To
+or honor only the last pattern (e.g., @acronym{IRIX} 6.5 and NeXT). To
work around these problems, invoke @code{AC_PROG_GREP} and then use
@code{$GREP}.
@@ -12989,7 +12990,10 @@ implementations and with address@hidden
Traditional @command{grep} implementations (e.g., Solaris) do not
support the @option{-E} or @option{-F} options. To work around these
problems, invoke @code{AC_PROG_EGREP} and then use @code{$EGREP}, and
-similarly for @code{AC_PROG_FGREP} and @code{$FGREP}.
+similarly for @code{AC_PROG_FGREP} and @code{$FGREP}. Even if you are
+willing to require support for Posix @command{grep}, your script should
+not use both @option{-E} and @option{-F}, since Posix does not allow
+this combination.
Portable @command{grep} regular expressions should use @samp{\} only to
escape characters in the string @samp{$()address@hidden@}}. For example,
@@ -13214,6 +13218,8 @@ should not contain comments. HP-UX sed
(not counting @samp{:} commands) and
48 labels, which can not be circumvented by using more than one script
file. It can execute up to 19 reads with the @samp{r} command per cycle.
+Solaris @command{/usr/ucb/sed} rejects usages that exceed an limit of
+about 6000 bytes for the internal representation of commands.
Avoid redundant @samp{;}, as some @command{sed} implementations, such as
address@hidden 1.4.2's, incorrectly try to interpret the second
- Re: non portable sed scripts, (continued)
- Re: non portable sed scripts, Ralf Wildenhues, 2006/05/20
- Re: non portable sed scripts, Paul Eggert, 2006/05/21
- Re: non portable sed scripts, Ralf Wildenhues, 2006/05/21
- Re: non portable sed scripts, Ralf Wildenhues, 2006/05/22
- Re: non portable sed scripts, Paul Eggert, 2006/05/22
- Re: non portable sed scripts, Stepan Kasal, 2006/05/22
- Re: non portable sed scripts, Ralf Wildenhues, 2006/05/22
- Re: non portable sed scripts, Stepan Kasal, 2006/05/22
- Re: non portable sed scripts, Ralf Wildenhues, 2006/05/22
- Re: non portable sed scripts, Stepan Kasal, 2006/05/22
- Re: non portable sed scripts,
Paul Eggert <=
- Re: non portable sed scripts, Paul Eggert, 2006/05/23
- Re: non portable sed scripts, Tim Rice, 2006/05/23
- Re: non portable sed scripts, Stepan Kasal, 2006/05/23
- Re: non portable sed scripts, Ralf Wildenhues, 2006/05/23
- Re: non portable sed scripts, Tim Rice, 2006/05/23
Re: non portable sed scripts, Ralf Wildenhues, 2006/05/23
Re: non portable sed scripts, Paul Eggert, 2006/05/23