버전 2 확인

Question 1

나는 Perl 전문가는 아니지만 여기에 가능한 해킹이 있습니다. 올인원 같은데소스 파일, 출력 문자열에서 단일 문자 ack만 처리하는 것 같습니다 . $여러 문자를 허용하도록 이를 변경하면 확실히 작동하지만 간단하게 유지하려면 를 사용할 수 있습니다 0..9. 예를 들어, and as 및 ( 로 표시됨 ) abc...을 허용하도록 변경했습니다.$a$b$10$11diff -u

@@ -188,7 +188,7 @@
         $opt_output =~ s/\\r/\r/g;
         $opt_output =~ s/\\t/\t/g;
 
-        my @supported_special_variables = ( 1..9, qw( _ . ` & ' +  f ) );
+        my @supported_special_variables = ( 1..9, qw( a b _ . ` & ' +  f ) );
         @special_vars_used_by_opt_output = grep { $opt_output =~ /\$$_/ } @supported_special_variables;
 
         # If the $opt_output contains $&, $` or $', those vars won't be
@@ -924,6 +924,8 @@
                 # on them not changing in the process of doing the s///.
 
                 my %keep = map { ($_ => ${$_} // '') } @special_vars_used_by_opt_output;
+                $keep{a} = $10;
+                $keep{b} = $11;
                 $keep{_} = $line if exists $keep{_}; # Manually set it because $_ gets reset in a map.
                 $keep{f} = $filename if exists $keep{f};
                 my $special_vars_used_by_opt_output = join( '', @special_vars_used_by_opt_output );

그러나 10번째 일치 항목만 원하는 경우 $+다음과 같이 사용할 수 있습니다.마지막으로 성공한 검색 패턴의 마지막 대괄호와 일치하는 텍스트.

Answer

나는 Perl 전문가는 아니지만 여기에 가능한 해킹이 있습니다. 올인원 같은데소스 파일, 출력 문자열에서 단일 문자 ack만 처리하는 것 같습니다 . $여러 문자를 허용하도록 이를 변경하면 확실히 작동하지만 간단하게 유지하려면 를 사용할 수 있습니다 0..9. 예를 들어, and as 및 ( 로 표시됨 ) abc...을 허용하도록 변경했습니다.$a$b$10$11diff -u

@@ -188,7 +188,7 @@
         $opt_output =~ s/\\r/\r/g;
         $opt_output =~ s/\\t/\t/g;
 
-        my @supported_special_variables = ( 1..9, qw( _ . ` & ' +  f ) );
+        my @supported_special_variables = ( 1..9, qw( a b _ . ` & ' +  f ) );
         @special_vars_used_by_opt_output = grep { $opt_output =~ /\$$_/ } @supported_special_variables;
 
         # If the $opt_output contains $&, $` or $', those vars won't be
@@ -924,6 +924,8 @@
                 # on them not changing in the process of doing the s///.
 
                 my %keep = map { ($_ => ${$_} // '') } @special_vars_used_by_opt_output;
+                $keep{a} = $10;
+                $keep{b} = $11;
                 $keep{_} = $line if exists $keep{_}; # Manually set it because $_ gets reset in a map.
                 $keep{f} = $filename if exists $keep{f};
                 my $special_vars_used_by_opt_output = join( '', @special_vars_used_by_opt_output );

그러나 10번째 일치 항목만 원하는 경우 $+다음과 같이 사용할 수 있습니다.마지막으로 성공한 검색 패턴의 마지막 대괄호와 일치하는 텍스트.

Question 2

세 가지 대체 솔루션:

버전 2 확인

ack 버전 2에서는 변수 $10 $11등이 유효한 것 같습니다.

$ echo 'abcdefghijklmn' | 
  ack '(.)(.)(.)(.)(.)(.)(.)(.)(.)(.)(.)' \
  --output '$1 $2 $3 $11'

a b c k

$ ack --version
ack 2.24
Running under Perl 5.28.1 at /usr/bin/perl

그중에서도 얻으려면겹치는문자열은 다음과 같습니다:

echo 'abcdefghijklmn' |
    ack '(.)(?=(.)(.)(.)(.)(.)(.)(.)(.)(.)(.))' \
    --output '$1 $2 $3 $11'
a b c k
b c d l
c d e m
d e f n

펄5

그러나 다음을 통해 Perl에서 직접 동일한 작업을 수행할 수 있습니다.

echo 'abcdefghijklmn' | 
    perl -ne 'while($_ =~ /(.)(?=(.)(.)(.)(.)(.)(.)(.)(.)(.)(.))/g ){
        print $1," ",$2," ",$11," ","\n" }'
a b k
b c l
c d m
d e n

따라서 단어를 찾아 인쇄하려면(하나 이상의 공백으로 구분):

echo "word1 word2 word3 word4 word5 word6" |
    perl -ne 'while($_ =~ /(\S+) +(?=(\S+) +(\S+) +(\S+))/g ){$,=" ";print $1,$2,$3,$4,"\n" }'

word1 word2 word3 word4 
word2 word3 word4 word5 
word3 word4 word5 word6

인쇄된 줄에는 뒤에 공백이 있습니다(상관하지 않기를 바랍니다).

펄6

:ov또는 (겹침) 수정자를 사용하여 Perl6(Raku)을 사용해 볼 수도 있습니다.

echo "one two three four five" | 
    perl6 -ne 'my @var = $_.match(/ <|w> \w+ [" "+ \w+]**2 <|w> /, :ov); say @var.join("\n") ;'

one two three
two three four
three four five

단일 숫자를 변경하면 다른 개수도 일치됩니다.

echo "one two three four five" | 
perl6 -ne 'my @var = $_.match(/ <|w> \w+ [" "+ \w+]**3 <|w> /, :ov); say @var.join("\n") ;'

one two three four
two three four five

결과

Perl5를 사용하면 결과는 다음과 같습니다.

perl -ne 'while($_ =~ /(\S+) +(?=(\S+) +(\S+) +(\S+) +(\S+) +(\S+) +(\S+) +(\S+) +(\S+) +(\S+))/g ){
 $,=" ";
 print $1,$2,$3,$4,$5,$6,$7,$8,$9,$10,"\n" 
}' TWAIN_Mark_complete_parsed.txt | 
    sort | 
    uniq -c | 
    sort -rn >Twain_10grams5.txt

Perl6은 이러한 대규모 테스트 텍스트를 완료할 수 없습니다(메모리가 너무 많음)(Perl6은 아직 너무 새롭습니다). ack를 사용하는 것은 perl5보다 훨씬 느리지만 파일은 동일합니다.

head -n 10 Twain_10grams5.txt
     17 to mrs jane clemens and mrs moffett in st louis 
      8 ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- 
      7 in his home had been wounded and bruised almost to 
      7 his home had been wounded and bruised almost to death 
      7 happiness in his home had been wounded and bruised almost 
      6 shelley's happiness in his home had been wounded and bruised 
      5 was by the social fireside in the time of the 
      5 thing indeed if you would like to listen to it 
      5 laughable thing indeed if you would like to listen to 
      5 it was in this way that he found out that

Answer

세 가지 대체 솔루션:

버전 2 확인

ack 버전 2에서는 변수 $10 $11등이 유효한 것 같습니다.

$ echo 'abcdefghijklmn' | 
  ack '(.)(.)(.)(.)(.)(.)(.)(.)(.)(.)(.)' \
  --output '$1 $2 $3 $11'

a b c k

$ ack --version
ack 2.24
Running under Perl 5.28.1 at /usr/bin/perl

그중에서도 얻으려면겹치는문자열은 다음과 같습니다:

echo 'abcdefghijklmn' |
    ack '(.)(?=(.)(.)(.)(.)(.)(.)(.)(.)(.)(.))' \
    --output '$1 $2 $3 $11'
a b c k
b c d l
c d e m
d e f n

펄5

그러나 다음을 통해 Perl에서 직접 동일한 작업을 수행할 수 있습니다.

echo 'abcdefghijklmn' | 
    perl -ne 'while($_ =~ /(.)(?=(.)(.)(.)(.)(.)(.)(.)(.)(.)(.))/g ){
        print $1," ",$2," ",$11," ","\n" }'
a b k
b c l
c d m
d e n

따라서 단어를 찾아 인쇄하려면(하나 이상의 공백으로 구분):

echo "word1 word2 word3 word4 word5 word6" |
    perl -ne 'while($_ =~ /(\S+) +(?=(\S+) +(\S+) +(\S+))/g ){$,=" ";print $1,$2,$3,$4,"\n" }'

word1 word2 word3 word4 
word2 word3 word4 word5 
word3 word4 word5 word6

인쇄된 줄에는 뒤에 공백이 있습니다(상관하지 않기를 바랍니다).

펄6

:ov또는 (겹침) 수정자를 사용하여 Perl6(Raku)을 사용해 볼 수도 있습니다.

echo "one two three four five" | 
    perl6 -ne 'my @var = $_.match(/ <|w> \w+ [" "+ \w+]**2 <|w> /, :ov); say @var.join("\n") ;'

one two three
two three four
three four five

단일 숫자를 변경하면 다른 개수도 일치됩니다.

echo "one two three four five" | 
perl6 -ne 'my @var = $_.match(/ <|w> \w+ [" "+ \w+]**3 <|w> /, :ov); say @var.join("\n") ;'

one two three four
two three four five

결과

Perl5를 사용하면 결과는 다음과 같습니다.

perl -ne 'while($_ =~ /(\S+) +(?=(\S+) +(\S+) +(\S+) +(\S+) +(\S+) +(\S+) +(\S+) +(\S+) +(\S+))/g ){
 $,=" ";
 print $1,$2,$3,$4,$5,$6,$7,$8,$9,$10,"\n" 
}' TWAIN_Mark_complete_parsed.txt | 
    sort | 
    uniq -c | 
    sort -rn >Twain_10grams5.txt

Perl6은 이러한 대규모 테스트 텍스트를 완료할 수 없습니다(메모리가 너무 많음)(Perl6은 아직 너무 새롭습니다). ack를 사용하는 것은 perl5보다 훨씬 느리지만 파일은 동일합니다.

head -n 10 Twain_10grams5.txt
     17 to mrs jane clemens and mrs moffett in st louis 
      8 ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- 
      7 in his home had been wounded and bruised almost to 
      7 his home had been wounded and bruised almost to death 
      7 happiness in his home had been wounded and bruised almost 
      6 shelley's happiness in his home had been wounded and bruised 
      5 was by the social fireside in the time of the 
      5 thing indeed if you would like to listen to it 
      5 laughable thing indeed if you would like to listen to 
      5 it was in this way that he found out that

버전 2 확인

문제 배경

질문/내가 시도한 것

예상/원하는 출력

댓글에서 편집

댓글 분석

시스템 세부정보

답변1

답변2

버전 2 확인

펄5

펄6

결과

관련 정보