일치하는 항목을 기준으로 다른 문자열을 다른 파일의 문자열 목록으로 바꿉니다.

Question 1

file1바꾸려는 텍스트가 있고 file2대체 텍스트가 있고 ID=둘 사이에서 조회를 수행 할 수 있다고 가정하면 다음 awk 스크립트를 사용할 수 있습니다(더 인기 있는 것 같습니다).

awk -F'\t' '
  NR==FNR{
    a[$1]=$2                                   # fills the array a with the replacement text
    next
  }
  $3=="gene"{                                  # check only lines with 'gene'
    id=gensub("ID=([^;]*);.*","\\1",1,$9);     # extract the id string
    if(id in a)                                # if the id is part of the array a
       gsub(id,a[id])                          # replace it
  }
  1                                            # print the line
' file2 file1

Answer

file1바꾸려는 텍스트가 있고 file2대체 텍스트가 있고 ID=둘 사이에서 조회를 수행 할 수 있다고 가정하면 다음 awk 스크립트를 사용할 수 있습니다(더 인기 있는 것 같습니다).

awk -F'\t' '
  NR==FNR{
    a[$1]=$2                                   # fills the array a with the replacement text
    next
  }
  $3=="gene"{                                  # check only lines with 'gene'
    id=gensub("ID=([^;]*);.*","\\1",1,$9);     # extract the id string
    if(id in a)                                # if the id is part of the array a
       gsub(id,a[id])                          # replace it
  }
  1                                            # print the line
' file2 file1

Question 2

인기 없는 선택: Tcl. Tcl에는 string map이를 수행하는 훌륭한 명령이 있습니다 .정확히이것. 불행하게도 Tcl은 실제로 Perl 스타일의 단일 라이너용으로 제작되지 않았습니다.

echo '
    # read the mapping file into a list
    set fh [open "mapping" r]
    set content [read $fh]
    close $fh
    set mapping [regexp -all -inline {\S+} $content]

    # read the contents of the data file
    # and apply mapping to field 9 when field 3 is "gene"
    set fh [open "file" r]
    while {[gets $fh line] != -1} {
        set fields [split $line \t]
        if {[lindex $fields 2] eq "gene"} {
            lset fields 8 [string map $mapping [lindex $fields 8]]
        }
        puts [join $fields \t]
    }
    close $fh
' | tclsh

awk를 사용하여 다음과 같이 작성합니다.

awk -F'\t' -v OFS='\t' '
    NR == FNR {repl[$1]= $2; next}
    $3 == "gene" {
        for (seek in repl) 
            while ((idx = index($9, seek)) > 0) 
                $9 = substr($9, 1, idx-1) repl[seek] substr($9, idx + length(seek))
    }
    {print}
' mapping file

Answer

인기 없는 선택: Tcl. Tcl에는 string map이를 수행하는 훌륭한 명령이 있습니다 .정확히이것. 불행하게도 Tcl은 실제로 Perl 스타일의 단일 라이너용으로 제작되지 않았습니다.

echo '
    # read the mapping file into a list
    set fh [open "mapping" r]
    set content [read $fh]
    close $fh
    set mapping [regexp -all -inline {\S+} $content]

    # read the contents of the data file
    # and apply mapping to field 9 when field 3 is "gene"
    set fh [open "file" r]
    while {[gets $fh line] != -1} {
        set fields [split $line \t]
        if {[lindex $fields 2] eq "gene"} {
            lset fields 8 [string map $mapping [lindex $fields 8]]
        }
        puts [join $fields \t]
    }
    close $fh
' | tclsh

awk를 사용하여 다음과 같이 작성합니다.

awk -F'\t' -v OFS='\t' '
    NR == FNR {repl[$1]= $2; next}
    $3 == "gene" {
        for (seek in repl) 
            while ((idx = index($9, seek)) > 0) 
                $9 = substr($9, 1, idx-1) repl[seek] substr($9, idx + length(seek))
    }
    {print}
' mapping file

일치하는 항목을 기준으로 다른 문자열을 다른 파일의 문자열 목록으로 바꿉니다.

답변1

답변2

관련 정보