awk를 사용하여 두 개의 csv 파일을 비교하고 값을 추가합니다.

Question 1

이것은 연결된 질문보다 훨씬 간단합니다. 필요한 것은 다음과 같습니다.

awk -F, -v OFS=, 'NR==FNR{a[$1$2$3]=$4; next}{print $0,a[$1$2$3]}' file1 file2

설명하다

-F,: 입력 필드 구분 기호를 쉼표로 설정합니다.
-v OFS=,: 출력 필드 구분 기호를 쉼표로 설정합니다. 기본적으로 이는 쉼표로 구분된 출력을 인쇄하는 데 유용합니다.
NR==FNR: NR은 현재 줄 번호, FNR은 현재 파일의 줄 번호입니다. 첫 번째 파일을 읽는 경우에만 둘 다 동일합니다.
a[$1$2$3]=$4; next: 이것이 첫 번째 파일인 경우(위 참조) 키가 연결된 첫 번째, 두 번째 및 세 번째 필드인 배열에 네 번째 필드를 저장합니다.
print $0,a[$1$2$3]:현재 행( )과 처음 세 필드와 연관된 배열 $0의 값을 인쇄합니다. a첫 번째 파일에 해당하는 네 번째 필드입니다.

Answer

이것은 연결된 질문보다 훨씬 간단합니다. 필요한 것은 다음과 같습니다.

awk -F, -v OFS=, 'NR==FNR{a[$1$2$3]=$4; next}{print $0,a[$1$2$3]}' file1 file2

설명하다

-F,: 입력 필드 구분 기호를 쉼표로 설정합니다.
-v OFS=,: 출력 필드 구분 기호를 쉼표로 설정합니다. 기본적으로 이는 쉼표로 구분된 출력을 인쇄하는 데 유용합니다.
NR==FNR: NR은 현재 줄 번호, FNR은 현재 파일의 줄 번호입니다. 첫 번째 파일을 읽는 경우에만 둘 다 동일합니다.
a[$1$2$3]=$4; next: 이것이 첫 번째 파일인 경우(위 참조) 키가 연결된 첫 번째, 두 번째 및 세 번째 필드인 배열에 네 번째 필드를 저장합니다.
print $0,a[$1$2$3]:현재 행( )과 처음 세 필드와 연관된 배열 $0의 값을 인쇄합니다. a첫 번째 파일에 해당하는 네 번째 필드입니다.

Question 2

awk -F',' ' # start awk and use comma as a field separator
    FNR == NR { # if processed so far number of rows in current file if equal to overall processed number of rows do things in block {} 
        if (FNR == 1) {next} # if it is first row then continue (skip to next row)
        a[$1] = $2; # create an array indexed with first field, with value of second field
        b[$1] = $3; # another array
        next; # go to next row
    } # end of block executed only for first file
    { # beginning of block which will be executed without any initial conditions
        if (FNR == 1) {print;next} # if first row of file then print it and go to next one
        if (a[$1] == $2) { # if array value which correspond to field first is equal to second field do something (array 'a' has been set in first file, and now we input index to file from second file knowing that first fields of those files are the same)
            print $1,$2,$3,b[$1]; # print field 1-3 and array b[$1]
        }
        else { # if array is not equal
            print $1,a[$1],b[$1],b[$1]; # print stuff
        }
    }
  ' OFS=',' file1.csv file2.csv # OFS means output field separator, so we want to have comma in result too.

Answer

awk -F',' ' # start awk and use comma as a field separator
    FNR == NR { # if processed so far number of rows in current file if equal to overall processed number of rows do things in block {} 
        if (FNR == 1) {next} # if it is first row then continue (skip to next row)
        a[$1] = $2; # create an array indexed with first field, with value of second field
        b[$1] = $3; # another array
        next; # go to next row
    } # end of block executed only for first file
    { # beginning of block which will be executed without any initial conditions
        if (FNR == 1) {print;next} # if first row of file then print it and go to next one
        if (a[$1] == $2) { # if array value which correspond to field first is equal to second field do something (array 'a' has been set in first file, and now we input index to file from second file knowing that first fields of those files are the same)
            print $1,$2,$3,b[$1]; # print field 1-3 and array b[$1]
        }
        else { # if array is not equal
            print $1,a[$1],b[$1],b[$1]; # print stuff
        }
    }
  ' OFS=',' file1.csv file2.csv # OFS means output field separator, so we want to have comma in result too.

awk를 사용하여 두 개의 csv 파일을 비교하고 값을 추가합니다.

답변1

설명하다

답변2

관련 정보