如何使用正则表达式从文件中读取信息并打印出来?

时间:2022-09-06 13:13:28

I am working with the following data:

我正在使用以下数据:

__DATA__
Branch 1: 10..11

      13 E 0.496 -> Q 0.724
      18 S 0.507 -> R 0.513
      19 N 0.485 -> S 0.681

Branch 2: 11..12

      81 R 0.891 -> Q 0.639
      88 Y 0.987 -> S 0.836

From the above data, I want to read numbers present before a character and then print them out. So for instance in Branch 1 I want to read 13 , 18, 19 and from Branch 2, I want to read 81 88.

从上面的数据,我想读取一个字符前的数字,然后打印出来。因此,例如在分支1中我想阅读13,18,19和分支2,我想阅读81 88。

I have written the following code for reading Branch 1 but it doesn't seem to work. Any suggestions?

我已经编写了以下代码来阅读第1分支,但它似乎没有用。有什么建议?

#!/usr/bin/perl
use strict;
use warnings;

my $find = 'Branch 1:';
$a = '0';

open (FILE, "/user/Desktop/file") || die "can't open file \n";
while (my $body = <FILE>) {
    if ($body =~ m/$find/) {
        my $value = my $body=~ m/\d{2}\s[A-Z]/;
        print "$value \n";
    }
    else {
        $a++; # only to check it reading anything
        print "$a \n";
    }
}
__END__

3 个解决方案

#1


The following seems to do what you want:

以下似乎做你想要的:

#!/usr/bin/perl

use strict;
use warnings;

while ( <DATA> ) {
    if ( /^(Branch [0-9]+): / or /^\s+([0-9]+) [A-Z]/ ) {
        print "$1\n";
    }
}

__DATA__
Branch 1: 10..11

      13 E 0.496 -> Q 0.724
      18 S 0.507 -> R 0.513
      19 N 0.485 -> S 0.681

Branch 2: 11..12

      81 R 0.891 -> Q 0.639
      88 Y 0.987 -> S 0.836

#2


m/\d{2}\s[A-Z]0/

should be

m/\d{2}\s[A-Z]/

or possibly

m/\d{2}\s[A-Z]\s0/

or even

m/\d{2}\s[A-Z]\s\d/

The point being that your code is expecting a 0 immediately after the letter, but there is a space inbetween.

关键是你的代码在字母后面紧跟一个0,但中间有一个空格。

#3


one thing is that you do not have a space between your [A-Z] and the 0

有一点是你的[A-Z]和0之间没有空格

#1


The following seems to do what you want:

以下似乎做你想要的:

#!/usr/bin/perl

use strict;
use warnings;

while ( <DATA> ) {
    if ( /^(Branch [0-9]+): / or /^\s+([0-9]+) [A-Z]/ ) {
        print "$1\n";
    }
}

__DATA__
Branch 1: 10..11

      13 E 0.496 -> Q 0.724
      18 S 0.507 -> R 0.513
      19 N 0.485 -> S 0.681

Branch 2: 11..12

      81 R 0.891 -> Q 0.639
      88 Y 0.987 -> S 0.836

#2


m/\d{2}\s[A-Z]0/

should be

m/\d{2}\s[A-Z]/

or possibly

m/\d{2}\s[A-Z]\s0/

or even

m/\d{2}\s[A-Z]\s\d/

The point being that your code is expecting a 0 immediately after the letter, but there is a space inbetween.

关键是你的代码在字母后面紧跟一个0,但中间有一个空格。

#3


one thing is that you do not have a space between your [A-Z] and the 0

有一点是你的[A-Z]和0之间没有空格