한 열의 값을 다른 열의 모든 값과 비교

Question

아래 스크립트는 다음을 수행합니다. 제 생각에는 이것이 여러분이 원하는 것입니다.

file1의 contig가 file2에 없으면 해당 contig에 대한 모든 행을 인쇄합니다.
file2에 존재하는 경우 file1의 각 값에 대해 -10 이상이거나 file2 -10의 해당 contig 값보다 크거나 file2 +10의 값보다 큰 경우에만 인쇄합니다.

#!/usr/bin/env perl

my (%file1, %file2);

## read file1, the 1st argument
open(F1,"$ARGV[0]");
while(<F1>){
    chomp;
    ## Split the line on whitespace into the @F array.
    my @F=split(/\s+/); 

    ## Save all lines in the %file1 hash.
    ## $F[0] is the contig name and $F[1] the value.
    ## The hash will store a list of all values
    ## associated with this contig.
    push @{$file1{$F[0]}},$F[1];
}
close(F1);
## read file2, the second argument
open(F2,"$ARGV[1]"); 
while(<F2>){
    ## remove newlines
    chomp;
    ## save the fields into array @F
    my @F=split(/\s+/); 
    ## Again, save all values associated with each
    ## contig into the %file2 hash. 
    push @{$file2{$F[0]}},$F[1];
}
close(F2);

## For each of the contigs in file1
foreach my $contig (keys(%file1)) {
    ## If this contig exists in file 2
    if(defined $file2{$contig}){
        ## get the list of values for that contig
        ## in each of the two files
        my @f2_vals=@{$file2{$contig}};
        my @f1_vals=@{$file1{$contig}};
        ## For each of file1's values for this contig
        val1:foreach my $val1 (@f1_vals) {
                ## For each of file2's value for this contig
                foreach my $val2 (@f2_vals) {
                    ## Skip to the next value from file1 unless
                    ## this one falls within the desired range.
                    unless(($val1 < $val2-10) || ($val1 > $val2+10)){
                        next val1;
                    }
                }
                ## We will only get here if none of the values
                ## fell within the desired range. If so, we should
                ## print the value from file1.
                print "$contig $val1\n";
            }
    }
    ## If this contig is not in file2, print the
    ## lines from file1. This will print all lines
    ## from file1 whose contig was not in file2.
    else {
        print "$contig $_\n" for @{$file1{$contig}}
    }
}

텍스트 파일(예 foo.pl: )에 저장하고 실행 가능하게 만든 후( chmod a+x foo.pl) 다음과 같이 실행합니다.

./foo.pl file1 file2

귀하의 예에서는 다음을 반환합니다.

$ foo.pl file1 file2 
Contig2 68
Contig3 102
Contig7 79

Answer 1

아래 스크립트는 다음을 수행합니다. 제 생각에는 이것이 여러분이 원하는 것입니다.

file1의 contig가 file2에 없으면 해당 contig에 대한 모든 행을 인쇄합니다.
file2에 존재하는 경우 file1의 각 값에 대해 -10 이상이거나 file2 -10의 해당 contig 값보다 크거나 file2 +10의 값보다 큰 경우에만 인쇄합니다.

#!/usr/bin/env perl

my (%file1, %file2);

## read file1, the 1st argument
open(F1,"$ARGV[0]");
while(<F1>){
    chomp;
    ## Split the line on whitespace into the @F array.
    my @F=split(/\s+/); 

    ## Save all lines in the %file1 hash.
    ## $F[0] is the contig name and $F[1] the value.
    ## The hash will store a list of all values
    ## associated with this contig.
    push @{$file1{$F[0]}},$F[1];
}
close(F1);
## read file2, the second argument
open(F2,"$ARGV[1]"); 
while(<F2>){
    ## remove newlines
    chomp;
    ## save the fields into array @F
    my @F=split(/\s+/); 
    ## Again, save all values associated with each
    ## contig into the %file2 hash. 
    push @{$file2{$F[0]}},$F[1];
}
close(F2);

## For each of the contigs in file1
foreach my $contig (keys(%file1)) {
    ## If this contig exists in file 2
    if(defined $file2{$contig}){
        ## get the list of values for that contig
        ## in each of the two files
        my @f2_vals=@{$file2{$contig}};
        my @f1_vals=@{$file1{$contig}};
        ## For each of file1's values for this contig
        val1:foreach my $val1 (@f1_vals) {
                ## For each of file2's value for this contig
                foreach my $val2 (@f2_vals) {
                    ## Skip to the next value from file1 unless
                    ## this one falls within the desired range.
                    unless(($val1 < $val2-10) || ($val1 > $val2+10)){
                        next val1;
                    }
                }
                ## We will only get here if none of the values
                ## fell within the desired range. If so, we should
                ## print the value from file1.
                print "$contig $val1\n";
            }
    }
    ## If this contig is not in file2, print the
    ## lines from file1. This will print all lines
    ## from file1 whose contig was not in file2.
    else {
        print "$contig $_\n" for @{$file1{$contig}}
    }
}

텍스트 파일(예 foo.pl: )에 저장하고 실행 가능하게 만든 후( chmod a+x foo.pl) 다음과 같이 실행합니다.

./foo.pl file1 file2

귀하의 예에서는 다음을 반환합니다.

$ foo.pl file1 file2 
Contig2 68
Contig3 102
Contig7 79

한 열의 값을 다른 열의 모든 값과 비교

답변1

관련 정보