별도의 단어 목록을 사용하여 파일에서 단어 추출

Question

awk 함수의 반환 값을 사용하여 in 의 행에 in 의 하위 문자열이 포함되어 index있는지 확인할 수 있습니다 .b.txta.txt

index(in, find)

    Search the string in for the first occurrence of the string find, and return 
the position in characters where that occurrence begins in the string in.

예를 들어:

awk '
  NR==FNR{strings[$1]; next}
  {
    m = ""
    for(s in strings){
      if(index($0,s) > 0) m = (m=="") ? s : m ", " s
    }
  }
  m != "" {print $0, ">", m}
' a.txt b.txt
threetwo > three, two
onetwothree > three, two, one
twozero > two

a.txtawk에서는 배열 순회 순서(이 경우 구성된 하위 문자열 배열)가 보장되지 않습니다.

Answer 1

awk 함수의 반환 값을 사용하여 in 의 행에 in 의 하위 문자열이 포함되어 index있는지 확인할 수 있습니다 .b.txta.txt

index(in, find)

    Search the string in for the first occurrence of the string find, and return 
the position in characters where that occurrence begins in the string in.

예를 들어:

awk '
  NR==FNR{strings[$1]; next}
  {
    m = ""
    for(s in strings){
      if(index($0,s) > 0) m = (m=="") ? s : m ", " s
    }
  }
  m != "" {print $0, ">", m}
' a.txt b.txt
threetwo > three, two
onetwothree > three, two, one
twozero > two

a.txtawk에서는 배열 순회 순서(이 경우 구성된 하위 문자열 배열)가 보장되지 않습니다.

별도의 단어 목록을 사용하여 파일에서 단어 추출

답변1

관련 정보