bash는 변수의 특수 문자를 바꿉니다.

Question

시스템의 "little-endian"/"big-endian" 특성을 지정하기 위해 유니코드 로케일 기반 시스템에서 사용되는 "bom"(바이트 순서 표시)이 있을 수 있습니다.

바라보다https://en.wikipedia.org/wiki/Byte_order_mark

고맙게도 이것은 utf-8 로케일에서 작동하는 것 같습니다. 이는 ASCII 1-177 문자만 예상하는 경우 좋은 것입니다...

다음을 "확인"하기 위해 (일시적으로) C 로케일을 사용하도록 강제되는 sed를 삽입하여 이를 제거할 수 있습니다.

LC_ALL=C sed '1s/^\xEF\xBB\xBF//'

예를 들어 다음과 같이 사용됩니다.

incoming program | LC_ALL=C sed '1s/^\xEF\xBB\xBF//' | somecmd
 # or
< incomingfile LC_ALL=C sed '1s/^\xEF\xBB\xBF//' > outputfile
  #  <incomingfile  : will give "incomingfile" content as stdin to sed 
  # then sed modifies only the first line, replacing the BOM with ""
  #    (the rest is not touched by sed and is transmitted as-is)
  #  > outputfile : directs sed output (ie, incomingfile without the BOM) to "outputfile"

Answer 1