从文本文件中获取数据

时间:2021-08-13 09:47:10

I'm trying to extract data from a text file with the following structure:

我正在尝试使用以下结构从文本文件中提取数据:

Employee: John C.
  2013-01-01  10  $123
  2013-01-02  12  $120
  2013-01-03  8  $150
Employee: Michael G.
  2013-01-01  5  $13
  2013-01-05  11  $20
  2013-01-10  2  $155

As you can see, the pattern is a table header containing the Employee name and then table content containing all of its transactions, then the pattern repeats.

如您所见,模式是一个包含Employee名称的表头,然后是包含其所有事务的表内容,然后重复模式。

To extract transactions I do this:

要提取交易,我这样做:

awk '/^  [A-Z]/{print $1"\t"$2"\t"$3}'

This gives this result:

这给出了这个结果:

  2013-01-01  10  $123
  2013-01-02  12  $120
  2013-01-03  8   $150
  2013-01-01  5   $13
  2013-01-05  11  $20
  2013-01-10  2   $155

How can I create a two pass extraction that returns this:

如何创建一个返回此的两遍提取:

  2013-01-01  10  $123  John C.
  2013-01-02  12  $120  John C.
  2013-01-03  8   $150  John C.
  2013-01-01  5   $13   Michael G.
  2013-01-05  11  $20   Michael G.
  2013-01-10  2   $155  Michael G.

2 个解决方案

#1


5  

One way with awk:

awk的一种方法:

awk -F":" '/^Employee/{a=$NF;next}{print $0,a}' file

Test:

$ cat file
Employee: John C.
  2013-01-01  10  $123
  2013-01-02  12  $120
  2013-01-03  8  $150
Employee: Michael G.
  2013-01-01  5  $13
  2013-01-05  11  $20
  2013-01-10  2  $155
$ awk -F":" '/^Employee/{a=$NF;next}{print $0,a}' file
  2013-01-01  10  $123  John C.
  2013-01-02  12  $120  John C.
  2013-01-03  8  $150  John C.
  2013-01-01  5  $13  Michael G.
  2013-01-05  11  $20  Michael G.
  2013-01-10  2  $155  Michael G.

#2


2  

Code for GNU sed:

GNU sed代码:

sed '/:/{s/[^:]\+://;H;x;s/.*\n//;d};G;s/\n//' file

#1


5  

One way with awk:

awk的一种方法:

awk -F":" '/^Employee/{a=$NF;next}{print $0,a}' file

Test:

$ cat file
Employee: John C.
  2013-01-01  10  $123
  2013-01-02  12  $120
  2013-01-03  8  $150
Employee: Michael G.
  2013-01-01  5  $13
  2013-01-05  11  $20
  2013-01-10  2  $155
$ awk -F":" '/^Employee/{a=$NF;next}{print $0,a}' file
  2013-01-01  10  $123  John C.
  2013-01-02  12  $120  John C.
  2013-01-03  8  $150  John C.
  2013-01-01  5  $13  Michael G.
  2013-01-05  11  $20  Michael G.
  2013-01-10  2  $155  Michael G.

#2


2  

Code for GNU sed:

GNU sed代码:

sed '/:/{s/[^:]\+://;H;x;s/.*\n//;d};G;s/\n//' file