단일 텍스트 파일에는 쉘 또는 bash 스크립트를 사용하는 여러 작업이 필요합니다

Question

awk다음은 나열된 단계를 수행하는 스크립트 입니다 . 하나의 스크립트에서 모든 작업을 수행 awk하면 여러 번 실행하고 중간 결과를 파일이나 변수에 저장할 필요가 없다는 이점이 있습니다 .

BEGIN { OFS = FS = "\t" }
NR == 1 {
    # Add new column headers

    # First four "mod" headers
    for (i = 1; i <= 4; ++i)
        $(NF + 1) = "mod" i

    # Then a "comment" header
    $(NF + 1) = "comment"

    # Output and continue with next input line
    print
    next
}

# Ignore lines that don't have "Exo" in the first column
$1 != "Exo" { next }

{
    # Working our way "backwards" from column 13 down to 1

    # Shift the last two columns right by three steps
    $13 = $10
    $12 = $9

    # Set column 11 to column 6, or to -1 if it's a dot
    if ($6 == ".")
        $11 = -1
    else
        $11 = $6 

    # Empty the comment column
    $10 = ""

    # Move column 8 into column 9
    $9 = $8

    # Split column 9 into columns 8 and 9
    split($9, a, ":")
    $9 = a[2]
    $8 = a[1]

    # Split column 7 into columns 6 and 7
    split($7, a, ":")
    $7 = a[2]
    $6 = a[1]

    # Column 5 remains unmodified

    # Put -1 in column 4 if it's a dot
    if ($4 == ".") $4 = -1

    # Columns 1, 2, 3 remains unmodified   
}

# Output if we want this line
$7 <= 0.01 { print }

실행하세요:

$ awk -f script.awk Test.txt
Chr     Start   End     Alt     Value   mod1    mod2    mod3    mod4    comment
Exo     0       10      -1      1.50    20      -2      30      0.9             -1      50:50   50
Exo     1       20      -1      1.50    20      -1      30      -1              -1      50:50   50
Exo     3       40      -1      1.50    20      -1      30      -2              -1      50:50   50

나는 여러분의 코드에서 여러분이 Exo이 줄에만 관심이 있다고 가정하고 있으므로 스크립트에서 해당 줄만 보도록 했습니다. 나는 Alttha 열(및 원래 이름이 지정되지 않은 첫 번째 열)의 모든 지점을 로 변경해야 -1하며 코드를 보고 변경할 수도 있다고 가정합니다 .

Answer 1

awk다음은 나열된 단계를 수행하는 스크립트 입니다 . 하나의 스크립트에서 모든 작업을 수행 awk하면 여러 번 실행하고 중간 결과를 파일이나 변수에 저장할 필요가 없다는 이점이 있습니다 .

BEGIN { OFS = FS = "\t" }
NR == 1 {
    # Add new column headers

    # First four "mod" headers
    for (i = 1; i <= 4; ++i)
        $(NF + 1) = "mod" i

    # Then a "comment" header
    $(NF + 1) = "comment"

    # Output and continue with next input line
    print
    next
}

# Ignore lines that don't have "Exo" in the first column
$1 != "Exo" { next }

{
    # Working our way "backwards" from column 13 down to 1

    # Shift the last two columns right by three steps
    $13 = $10
    $12 = $9

    # Set column 11 to column 6, or to -1 if it's a dot
    if ($6 == ".")
        $11 = -1
    else
        $11 = $6 

    # Empty the comment column
    $10 = ""

    # Move column 8 into column 9
    $9 = $8

    # Split column 9 into columns 8 and 9
    split($9, a, ":")
    $9 = a[2]
    $8 = a[1]

    # Split column 7 into columns 6 and 7
    split($7, a, ":")
    $7 = a[2]
    $6 = a[1]

    # Column 5 remains unmodified

    # Put -1 in column 4 if it's a dot
    if ($4 == ".") $4 = -1

    # Columns 1, 2, 3 remains unmodified   
}

# Output if we want this line
$7 <= 0.01 { print }

실행하세요:

$ awk -f script.awk Test.txt
Chr     Start   End     Alt     Value   mod1    mod2    mod3    mod4    comment
Exo     0       10      -1      1.50    20      -2      30      0.9             -1      50:50   50
Exo     1       20      -1      1.50    20      -1      30      -1              -1      50:50   50
Exo     3       40      -1      1.50    20      -1      30      -2              -1      50:50   50

나는 여러분의 코드에서 여러분이 Exo이 줄에만 관심이 있다고 가정하고 있으므로 스크립트에서 해당 줄만 보도록 했습니다. 나는 Alttha 열(및 원래 이름이 지정되지 않은 첫 번째 열)의 모든 지점을 로 변경해야 -1하며 코드를 보고 변경할 수도 있다고 가정합니다 .

단일 텍스트 파일에는 쉘 또는 bash 스크립트를 사용하는 여러 작업이 필요합니다

답변1

관련 정보