bug-textutils
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: bug involving non-ASCII characters used in uniq


From: Bob Proulx
Subject: Re: bug involving non-ASCII characters used in uniq
Date: Mon, 11 Aug 2003 01:06:54 -0600
User-agent: Mutt/1.3.28i

Roger Levy wrote:
> I've found that the version of uniq that comes with Redhat 9 doesn't
> seem to work when there are non-ASCII characters in the lines
> (specifically, I've had identical lines fail to be identified when
> they contain characters from the GB2312 character set, simplified
> Chinese).  On the other hand, sort -u works fine on these lines.  Is
> this a known problem?  I would be happy to get involved in any effort
> necessary to fix this problem.

Thanks for the report.  If you could reduce the problem to a small (a
few lines) input file that would make reproducing the problem much
easier.  If it can be reproduced then we can understand it.  Read the
FAQ[1] section on bug reporting.

Among other things we would need to know what version of glibc you are
using and what version of uniq you are using.  I don't think any of us
on this list are running RH systems and so we don't know what goes
into any particular release.  You have to let us know the versions of
the individual components which are related.
  
  uniq --version | head -n 1
  ldd --version | head -n 1

Since you mention RH I always have to ask if this is a locale problem.
Try setting your locale to "C" and see if that works correctly.  RH
and locale issues are an often reported problems.  It probably won't
be a complete solution for you but it will be a clue as to the cause
of the problem.

  export LC_ALL=C

Thanks
Bob

[1] http://www.gnu.org/software/coreutils/faq/coreutils-faq.html




reply via email to

[Prev in Thread] Current Thread [Next in Thread]