tr on OS X
Brandon Allbery
allbery at kf8nh.com
Sat Apr 3 01:00:57 PDT 2010
On Apr 3, 2010, at 03:46 , Scott Haneda wrote:
> Just out of curiosity, if I wanted to do something like:
> $cat /dev/random | tr "a" "b"
>
> How do I get `tr` to work? I do not see anything in the man page
> that would allow me to treat the binary data as correctly.
$ LANG=C tr a b </dev/random
If a multibyte localization is configured (OSX and most Linuxes
default to UTF8, which is a variable length encoding) then tr will
become annoyed when it sees illegal character sequences. In single-
byte encodings like ISO8859/1 you're probably fine as long as you
don't use things like [:upper:], but LANG=C disables all of this and
processes the data as a raw stream of bytes.
tr on many systems was until recently not locale-aware because it was
designed internally to deal with bytes, not characters. Recent GNU
tr, and BSD tr (which is what OSX uses), were rewritten to be locale-
capable. and therefore trip over the illegal-UTF8-sequences problem
when used on binary data unless LANG=C is used to disable all locale
processing.
--
brandon s. allbery [solaris,freebsd,perl,pugs,haskell] allbery at kf8nh.com
system administrator [openafs,heimdal,too many hats] allbery at ece.cmu.edu
electrical and computer engineering, carnegie mellon university KF8NH
-------------- next part --------------
A non-text attachment was scrubbed...
Name: PGP.sig
Type: application/pgp-signature
Size: 195 bytes
Desc: This is a digitally signed message part
URL: <http://lists.macosforge.org/pipermail/macports-users/attachments/20100403/d83132f9/attachment.bin>
More information about the macports-users
mailing list