파일을 반복하고 각 문자의 발생 횟수를 계산하려면 어떻게 해야 합니까?

Question 1

나는 다음과 같은 것을 선택할 것입니다 :

grep -o . file | sort | uniq -c

  1 d
  1 e
  1 H
  3 l
  2 o
  1 r
  1 W

또는 대문자와 소문자를 단일 문자로 처리하려는 경우:

grep -o . file | sort | uniq -ic | tr [:lower:] [:upper:]

  1 D
  1 E
  1 H
  3 L
  2 O
  1 R
  1 W

| tr [:lower:] [:upper:]예상되는 출력으로 모두 대문자를 인쇄하는 옵션이 있습니다.

Answer

나는 다음과 같은 것을 선택할 것입니다 :

grep -o . file | sort | uniq -c

  1 d
  1 e
  1 H
  3 l
  2 o
  1 r
  1 W

또는 대문자와 소문자를 단일 문자로 처리하려는 경우:

grep -o . file | sort | uniq -ic | tr [:lower:] [:upper:]

  1 D
  1 E
  1 H
  3 L
  2 O
  1 R
  1 W

| tr [:lower:] [:upper:]예상되는 출력으로 모두 대문자를 인쇄하는 옵션이 있습니다.

Question 2

파일의 각 문자 수를 계산하려면GNU awk

awk 'BEGIN{FS=""} {for (i=1; i<=NF; i++){a[$i]++}}END{for (i in a){print i,":", a[i]}}' file

문자를 대소문자를 구분하지 않고 처리 tolower하거나 toupper다음을 사용할 수 있습니다.

awk 'BEGIN{FS=""} {for (i=1; i<=NF; i++){a[tolower($i)]++}}END{for (i in a){print i,":", a[i]}}' file

샘플 출력

c : 1
d : 3
e : 2
f : 2
h : 1
i : 12
l : 1
m : 1
n : 8
o : 2
p : 1
r : 4
s : 1
t : 6
u : 2
{ : 3
} : 3

Answer

파일의 각 문자 수를 계산하려면GNU awk

awk 'BEGIN{FS=""} {for (i=1; i<=NF; i++){a[$i]++}}END{for (i in a){print i,":", a[i]}}' file

문자를 대소문자를 구분하지 않고 처리 tolower하거나 toupper다음을 사용할 수 있습니다.

awk 'BEGIN{FS=""} {for (i=1; i<=NF; i++){a[tolower($i)]++}}END{for (i in a){print i,":", a[i]}}' file

샘플 출력

c : 1
d : 3
e : 2
f : 2
h : 1
i : 12
l : 1
m : 1
n : 8
o : 2
p : 1
r : 4
s : 1
t : 6
u : 2
{ : 3
} : 3

Question 3

다른 답변을 선호하지만 휴대용 답변이 누락되었으므로 awk를 사용하십시오.

awk '
{
    m=1
    #$0=toupper($0)
    while(m<=length($0)){ #While there are still chars unparsed in the line
        ch=substr($0,m,1) #Get one char of the line
        cnt[ch]++         #Increment its counter
        m++               #Point to the next char
    }
}
END{for(ch in cnt)print cnt[ch],"\t",ch}
' file

대소문자를 구분하지 않으려면 행의 주석 처리를 제거하십시오.

샘플 파일의 출력:

Answer

다른 답변을 선호하지만 휴대용 답변이 누락되었으므로 awk를 사용하십시오.

awk '
{
    m=1
    #$0=toupper($0)
    while(m<=length($0)){ #While there are still chars unparsed in the line
        ch=substr($0,m,1) #Get one char of the line
        cnt[ch]++         #Increment its counter
        m++               #Point to the next char
    }
}
END{for(ch in cnt)print cnt[ch],"\t",ch}
' file

대소문자를 구분하지 않으려면 행의 주석 처리를 제거하십시오.

샘플 파일의 출력:

파일을 반복하고 각 문자의 발생 횟수를 계산하려면 어떻게 해야 합니까?

답변1

답변2

답변3

관련 정보