awk: 코드 블록을 분리한 다음, 존재하는 경우 여러 블록을 반복합니다.

Question 1

귀하의 질문에서 예상되는 출력을 보지 못했기 때문에 확실하지 않지만 Can awk † find the nth iteration of a "{" and return everything up to the next "}" character?이것이 당신이 원하는 것이라고 말씀하셨습니다(awk를 사용하고 입력의 다른 곳에는 나타날 수 없다고 가정하십시오) }.{

$ awk -v n=2 -v RS='}' 'NR==n{gsub(/.*\{\n|\n$/,""); print}' samp3.txt
first       "John"
address     "125 Main Street"
last    "Jacob"
age "30"
gender      "male"

쉘 루프에서 호출하려면 다음을 수행하십시오.

$ for i in {1..3}; do
    awk -v n="$i" -v RS='}' 'NR==n{gsub(/.*\{\n|\n$/,""); print}' samp3.txt
    echo "-----"
done
first       "John"
address     "124 Main Street"
last    "Jones"
special     "supervisor"
age "35"
gender      "male"
-----
first       "John"
address     "125 Main Street"
last    "Jacob"
age "30"
gender      "male"
-----
first       "John"
address     "523 Main Street"
last    "Jingle"
age "40"
gender      "male"
-----

그러나 루프에서 awk를 여러 번 호출하는 것보다 원하는 것을 달성하는 더 나은 방법이 거의 확실합니다. 예를 들어 awk를 한 번 호출하여 종결자가 있는 각 청크를 인쇄한 }다음 추가 처리를 위해 이를 쉘 배열로 읽습니다.

$ readarray -d '}' -t arr < <(awk 'BEGIN{RS=ORS="}"} {gsub(/.*\{\n|\n$/,"")} $0~/[^[:space:]]/' samp3.txt)
$ for i in "${arr[@]}"; do printf '%s\n' "$i"; echo "-----"; done
first       "John"
address     "124 Main Street"
last    "Jones"
special     "supervisor"
age "35"
gender      "male"
-----
first       "John"
address     "125 Main Street"
last    "Jacob"
age "30"
gender      "male"
-----
first       "John"
address     "523 Main Street"
last    "Jingle"
age "40"
gender      "male"
-----

그러나 실제로 쉘 루프에서 수행하는 모든 작업은 awk에 대한 단일 호출로 수행되어야 합니다.

Answer

귀하의 질문에서 예상되는 출력을 보지 못했기 때문에 확실하지 않지만 Can awk † find the nth iteration of a "{" and return everything up to the next "}" character?이것이 당신이 원하는 것이라고 말씀하셨습니다(awk를 사용하고 입력의 다른 곳에는 나타날 수 없다고 가정하십시오) }.{

$ awk -v n=2 -v RS='}' 'NR==n{gsub(/.*\{\n|\n$/,""); print}' samp3.txt
first       "John"
address     "125 Main Street"
last    "Jacob"
age "30"
gender      "male"

쉘 루프에서 호출하려면 다음을 수행하십시오.

$ for i in {1..3}; do
    awk -v n="$i" -v RS='}' 'NR==n{gsub(/.*\{\n|\n$/,""); print}' samp3.txt
    echo "-----"
done
first       "John"
address     "124 Main Street"
last    "Jones"
special     "supervisor"
age "35"
gender      "male"
-----
first       "John"
address     "125 Main Street"
last    "Jacob"
age "30"
gender      "male"
-----
first       "John"
address     "523 Main Street"
last    "Jingle"
age "40"
gender      "male"
-----

그러나 루프에서 awk를 여러 번 호출하는 것보다 원하는 것을 달성하는 더 나은 방법이 거의 확실합니다. 예를 들어 awk를 한 번 호출하여 종결자가 있는 각 청크를 인쇄한 }다음 추가 처리를 위해 이를 쉘 배열로 읽습니다.

$ readarray -d '}' -t arr < <(awk 'BEGIN{RS=ORS="}"} {gsub(/.*\{\n|\n$/,"")} $0~/[^[:space:]]/' samp3.txt)
$ for i in "${arr[@]}"; do printf '%s\n' "$i"; echo "-----"; done
first       "John"
address     "124 Main Street"
last    "Jones"
special     "supervisor"
age "35"
gender      "male"
-----
first       "John"
address     "125 Main Street"
last    "Jacob"
age "30"
gender      "male"
-----
first       "John"
address     "523 Main Street"
last    "Jingle"
age "40"
gender      "male"
-----

그러나 실제로 쉘 루프에서 수행하는 모든 작업은 awk에 대한 단일 호출로 수행되어야 합니다.

Question 2

내 코드의 가정은 정확하지 않을 수 있으며, 이는 많은 경우 실패할 수 있음을 의미합니다. 더 효율적인 솔루션이 있을 수 있습니다.

가설 1각 GROUP블록은 개행 문자로 구분됩니다.

가설 2각 블록에서 작업을 수행하고 싶습니다.

가설 3각 GROUP블록이 증가합니다(그렇지 않으면 빈 파일이 많아질 수 있습니다).

for i in {1..5}; do 
  awk -F"\n" -v RS="" -v inc="GROUP$i" '$0~inc{printf( "%s\n", $0); next}' $inputfile | sed  '/\/\|{\|}/d' > output$i.txt ; 
done

귀하의 예에는 GROUP1&4하나를 추가 GROUP5하고 for1-5 범위에서 증가하는 루프를 작성했습니다. 이 범위는 블록을 통과할 때 키로 사용됩니다 GROUP. 그룹이 더 많은 경우 그에 따라 범위를 늘릴 수 있습니다.

awk청크를 추출하기 위해 루프에서 사용됩니다. sed정리한 다음( awk한 번에 모두 수행할 수 있지만 아직 배우는 중) 각 청크를 GROUP청크 수와 일치하는 자체 출력 파일에 씁니다.

입력 파일

//GROUP1
{
first       "John"
address     "124 Main Street"
last    "Jones"
special     "supervisor"
age "35"
gender      "male"
}

//GROUP4
{
first       "John"
address     "125 Main Street"
last    "Jacob"
age "30"
gender      "male"
}
{
first       "John"
address     "523 Main Street"
last    "Jingle"
age "40"
gender      "male"
}

//GROUP5
{
first       "Maria"
address     "188 John Street"
last    "Phones"
special     "Supervisors supervisor"
age "35"
gender      "Female"
}

산출

cat output1.txt
first       "John"
address     "124 Main Street"
last    "Jones"
special     "supervisor"
age "35"
gender      "male"

cat output4.txt
first       "John"
address     "125 Main Street"
last    "Jacob"
age "30"
gender      "male"
first       "John"
address     "523 Main Street"
last    "Jingle"
age "40"
gender      "male"

cat output5.txt
first       "Maria"
address     "188 John Street"
last    "Phones"
special     "Supervisors supervisor"
age "35"
gender      "Female"

Answer