패턴 검색 및 다른 파일에 줄 추가

Question 1

기존 코드를 사용할 수 있습니다. 행을 배열에 저장하고 다섯 번째 요소와 일치시킵니다.

while read -r line; do
    [ -z "$line" ] && continue
    patlist=($line)
    pat=${patlist[4]}
    grep "$pat" --label="$line" -H < KEGG.annotations
done < allKO.txt

반품:

Metabolism Carbohydrate metabolism Glycolisis K07448:>aai:AARI_33320  mrr; restriction system protein Mrr; K07448 restriction system protein
Metabolism Protein metabolism protesome K02217:>aai:AARI_26600  ferritin-like protein; K02217 ferritin [EC:1.16.3.1]

Answer

기존 코드를 사용할 수 있습니다. 행을 배열에 저장하고 다섯 번째 요소와 일치시킵니다.

while read -r line; do
    [ -z "$line" ] && continue
    patlist=($line)
    pat=${patlist[4]}
    grep "$pat" --label="$line" -H < KEGG.annotations
done < allKO.txt

반품:

Metabolism Carbohydrate metabolism Glycolisis K07448:>aai:AARI_33320  mrr; restriction system protein Mrr; K07448 restriction system protein
Metabolism Protein metabolism protesome K02217:>aai:AARI_26600  ferritin-like protein; K02217 ferritin [EC:1.16.3.1]

Question 2

이는 귀하의 요구 사항을 충족하는 것 같습니다.

while read w1 w2 w3 w4 ID
do
    printf "%s " "$w1 $w2 $w3 $w4 $ID"
    if ! grep "$ID" KEGG.annotations
    then
        echo
    fi
done < allKO.txt

그러면 화면에 출력이 기록됩니다. 파일에 출력을 캡처하려면 마지막 줄에 출력( >) 리디렉션(예: )을 추가합니다.> test1

귀하의 예에 따르면 키/ID 필드("스키마")는 다음과 같습니다.다섯~의다섯필드가 탭으로 구분된 파일이라고 말씀 allKO.txt하셨습니다 read w1 w2 w3 w4 ID. 모든 필드에 공백이 없다고 가정합니다.
끝에 공백이 있지만 종료되는 줄 바꿈이 없는 줄 printf(즉, 필드)을 작성( )하세요.allKO.txt
grep( ) 파일 에서 KEGG.annotationsID( from 행의 다섯 번째 필드)를 검색합니다 allKO.txt. 이는 완전한 라인(개행 포함)입니다.
grep실패 하면 printf줄 바꿈이 없기 때문에 개행을 작성하십시오.

이렇게 하면 존재하지 않는 ID가 있는 행이 KEGG.annotations 단순히 출력에 기록됩니다.

Metabolism Protein metabolism proteasome K02217  >aai:AARI_26600 ferritin-like protein; K02217 ferritin [EC:1.16.3.1]
This ID doesn’t exist: K99999

그리고 여러 번 존재하는 ID는 데이터를 반복하지 않고 추가 행에 기록됩니다 allKO.txt.

Metabolism Protein metabolism proteasome K02217  >aai:AARI_26600 ferritin-like protein; K02217 ferritin [EC:1.16.3.1]
This is a hypothetical additional line from KEGG.annotations that mentions “K02217”.

Answer

이는 귀하의 요구 사항을 충족하는 것 같습니다.

while read w1 w2 w3 w4 ID
do
    printf "%s " "$w1 $w2 $w3 $w4 $ID"
    if ! grep "$ID" KEGG.annotations
    then
        echo
    fi
done < allKO.txt

그러면 화면에 출력이 기록됩니다. 파일에 출력을 캡처하려면 마지막 줄에 출력( >) 리디렉션(예: )을 추가합니다.> test1

귀하의 예에 따르면 키/ID 필드("스키마")는 다음과 같습니다.다섯~의다섯필드가 탭으로 구분된 파일이라고 말씀 allKO.txt하셨습니다 read w1 w2 w3 w4 ID. 모든 필드에 공백이 없다고 가정합니다.
끝에 공백이 있지만 종료되는 줄 바꿈이 없는 줄 printf(즉, 필드)을 작성( )하세요.allKO.txt
grep( ) 파일 에서 KEGG.annotationsID( from 행의 다섯 번째 필드)를 검색합니다 allKO.txt. 이는 완전한 라인(개행 포함)입니다.
grep실패 하면 printf줄 바꿈이 없기 때문에 개행을 작성하십시오.

이렇게 하면 존재하지 않는 ID가 있는 행이 KEGG.annotations 단순히 출력에 기록됩니다.

Metabolism Protein metabolism proteasome K02217  >aai:AARI_26600 ferritin-like protein; K02217 ferritin [EC:1.16.3.1]
This ID doesn’t exist: K99999

그리고 여러 번 존재하는 ID는 데이터를 반복하지 않고 추가 행에 기록됩니다 allKO.txt.

Metabolism Protein metabolism proteasome K02217  >aai:AARI_26600 ferritin-like protein; K02217 ferritin [EC:1.16.3.1]
This is a hypothetical additional line from KEGG.annotations that mentions “K02217”.

패턴 검색 및 다른 파일에 줄 추가

답변1

답변2

관련 정보