记录最后一个Bash命令到脚本中的文件

I write lots of small scripts to manipulate files on a Bash-based server. I would like to have a mechanism by which to log which commands created which files in a given directory. However, I don't just want to capture every input command, all the time.

我编写了许多小脚本来操作基于bashbased服务器上的文件。我希望有一种机制来记录在给定目录中创建哪些命令的命令。但是，我并不是想一直捕获每个输入命令。

Approach 1: a wrapper script that uses a Bash builtin (a la history or fc -ln -1) to grab the last command and write it to a log file. I have not been able to figure out any way to do this, as the shell builtin commands do not appear to be recognized outside of the interactive shell.

方法1:一个包装器脚本，使用Bash builtin (la history或fc -ln -1)获取最后一个命令并将其写入日志文件。我还没有找到任何方法来实现这一点，因为shell内置命令似乎在交互shell之外是无法识别的。

Approach 2: a wrapper script that pulls from ~/.bash_history to get the last command. This, however, requires setting up the Bash shell to flush every command to history immediately (as per this comment) and seems also to require that the history be allowed to grow inexorably. If this is the only way, so be it, but it would be great to avoid having to edit the ~/.bashrc file on every system where this might be implemented.

方法2:来自~/的包装器脚本。获取最后一个命令的bash_history。然而，这需要设置Bash shell，以便立即将每个命令刷新到history(正如本文所述)，而且似乎还需要允许历史无情地增长。如果这是唯一的方法，那就这样吧，但是最好避免编辑~/。bashrc文件适用于可能实现此功能的每个系统。

Approach 3: use script. My problem with this is that it requires multiple commands to start and stop the logging, and because it launches its own shell it is not callable from within another script (or at least, doing so complicates things significantly).

方法三:使用脚本。我的问题是，它需要多个命令来启动和停止日志记录，而且因为它启动了自己的shell，所以它不能从另一个脚本中调用(或者至少这样做会使事情变得非常复杂)。

I am trying to figure out an implementation that's of the form log_this.script other_script other_arg1 other_arg2 > file, where everything after the first argument is logged. The emphasis here is on efficiency and minimizing syntax overhead.

我正在尝试找出一个log_this表单的实现。脚本other_arg2 >文件，其中记录了第一个参数之后的所有内容。这里的重点是效率和最小化语法开销。

EDIT: iLoveTux and I both came up with similar solutions. For those interested, my own implementation follows. It is somewhat more constrained in its functionality than the accepted answer, but it also auto-updates any existing logfile entries with changes (though not deletions).

编辑:我和iLoveTux都有类似的解决方案。对于感兴趣的人，下面是我自己的实现。它的功能比公认的答案更受限制，但它也会使用更改(但不删除)自动更新任何现有的logfile条目。

Sample usage:

示例用法:

$ cmdlog.py "python3 test_script.py > test_file.txt"

creates a log file in the parent directory of the output file with the following:

在输出文件的父目录中创建一个日志文件，其格式如下:

2015-10-12@10:47:09 test_file.txt   "python3 test_script.py > test_file.txt"

Additional file changes are added to the log;

将附加的文件更改添加到日志中;

$ cmdlog.py "python3 test_script.py > test_file_2.txt"

the log now contains

日志现在包含

2015-10-12@10:47:09 test_file.txt   "python3 test_script.py > test_file.txt"
2015-10-12@10:47:44 test_file_2.txt "python3 test_script.py > test_file_2.txt"

Running on the original file name again changes the file order in the log, based on modification time of the files:

再次运行在原始文件名上，根据文件的修改时间更改日志中的文件顺序:

$ cmdlog.py "python3 test_script.py > test_file.txt"

produces

生产

2015-10-12@10:47:44 test_file_2.txt "python3 test_script.py > test_file_2.txt"
2015-10-12@10:48:01 test_file.txt   "python3 test_script.py > test_file.txt"

Full script:

完整脚本:

#!/usr/bin/env python3

'''
A wrapper script that will write the command-line
args associated with any files generated to a log
file in the directory where the files were made.

'''
import sys
import os
from os import listdir
from os.path import isfile, join
import subprocess
import time
from datetime import datetime

def listFiles(mypath):
    """
    Return relative paths of all files in mypath

    """
    return [join(mypath, f) for f in listdir(mypath) if
            isfile(join(mypath, f))]

def read_log(log_file):
    """
    Reads a file history log and returns a dictionary
    of {filename: command} entries.

    Expects tab-separated lines of [time, filename, command]

    """
    entries = {}
    with open(log_file) as log:
        for l in log:
            l = l.strip()
            mod, name, cmd = l.split("\t")
            # cmd = cmd.lstrip("\"").rstrip("\"")
            entries[name] = [cmd, mod]
    return entries

def time_sort(t, fmt):
    """
    Turn a strftime-formatted string into a tuple
    of time info

    """
    parsed = datetime.strptime(t, fmt)
    return parsed

ARGS = sys.argv[1]
ARG_LIST = ARGS.split()

# Guess where logfile should be put
if (">" or ">>") in ARG_LIST:
    # Get position after redirect in arg list
    redirect_index = max(ARG_LIST.index(e) for e in ARG_LIST if e in ">>")
    output = ARG_LIST[redirect_index + 1]
    output = os.path.abspath(output)
    out_dir = os.path.dirname(output)
elif ("cp" or "mv") in ARG_LIST:
    output = ARG_LIST[-1]
    out_dir = os.path.dirname(output)
else:
     out_dir = os.getcwd()

# Set logfile location within the inferred output directory
LOGFILE = out_dir + "/cmdlog_history.log"

# Get file list state prior to running
all_files = listFiles(out_dir)
pre_stats = [os.path.getmtime(f) for f in all_files]

# Run the desired external commands
subprocess.call(ARGS, shell=True)

# Get done time of external commands
TIME_FMT = "%Y-%m-%d@%H:%M:%S"
log_time = time.strftime(TIME_FMT)

# Get existing entries from logfile, if present
if LOGFILE in all_files:
    logged = read_log(LOGFILE)
else:
    logged = {}

# Get file list state after run is complete
post_stats = [os.path.getmtime(f) for f in all_files]
post_files = listFiles(out_dir)

# Find files whose states have changed since the external command
changed = [e[0] for e in zip(all_files, pre_stats, post_stats) if e[1] != e[2]]
new = [e for e in post_files if e not in all_files]
all_modded = list(set(changed + new))

if not all_modded:  # exit early, no need to log
    sys.exit(0)

# Replace files that have changed, add those that are new
for f in all_modded:
    name = os.path.basename(f)
    logged[name] = [ARGS, log_time]

# Write changed files to logfile
with open(LOGFILE, 'w') as log:
    for name, info in sorted(logged.items(), key=lambda x: time_sort(x[1][1], TIME_FMT)):
        cmd, mod_time = info
        if not cmd.startswith("\""):
            cmd = "\"{}\"".format(cmd)
        log.write("\t".join([mod_time, name, cmd]) + "\n")

sys.exit(0)

2 个解决方案

#1

OK, so you don't mention Python in your question, but it is tagged Python, so I figured I would see what I could do. I came up with this script:

好，你的问题中没有提到Python，但是它被标记为Python，所以我想我会看看我能做什么。我想到了这个剧本:

import sys
from os.path import expanduser, join
from subprocess import Popen, PIPE

def issue_command(command):
    process = Popen(command, stdout=PIPE, stderr=PIPE, shell=True)
    return process.communicate()

home = expanduser("~")
log_file = join(home, "command_log")

command = sys.argv[1:]
with open(log_file, "a") as fout:
    fout.write("{}\n".format(" ".join(command)))

out, err = issue_command(command)

which you can call like (if you name it log_this and make it executable):

你可以像这样调用(如果你命名它log_this并使它可执行):

$ log_this echo hello world

and it will put "echo hello world" in a file ~/command_log, note though that if you want to use pipes or redirection you have to quote your command (this may be a real downfall for your use case or it may not be, but I haven't figured out how to do this just yet without the quotes) like this:

它将“回声hello world”文件~ / command_log,不过要注意,如果你想使用管道或重定向你引用命令(这可能是一个真正的失败你的用例或它可能不是,但是我还没有想出如何做这个没有引号)是这样的:

$ log_this "echo hello world | grep h >> /tmp/hello_world"

but since it's not perfect, I thought I would add a little something extra.

但由于它并不完美，我想我应该再加一点。

The following script allows you to specify a different file to log your commands to as well as record the execution time of the command:

下面的脚本允许您指定一个不同的文件来记录您的命令，并记录命令的执行时间:

#!/usr/bin/env python
from subprocess import Popen, PIPE
import argparse
from os.path import expanduser, join
from time import time


def issue_command(command):
    process = Popen(command, stdout=PIPE, stderr=PIPE, shell=True)
    return process.communicate()

home = expanduser("~")
default_file = join(home, "command_log")

parser = argparse.ArgumentParser()
parser.add_argument("-f", "--file", type=argparse.FileType("a"), default=default_file)
parser.add_argument("-p", "--profile", action="store_true")
parser.add_argument("command", nargs=argparse.REMAINDER)
args = parser.parse_args()

if args.profile:
    start = time()
    out, err = issue_command(args.command)
    runtime = time() - start
    entry = "{}\t{}\n".format(" ".join(args.command), runtime)
    args.file.write(entry)
else:
    out, err = issue_command(args.command)
    entry = "{}\n".format(" ".join(args.command))
    args.file.write(entry)

args.file.close()

You would use this the same way as the other script, but if you wanted to specify a different file to log to just pass -f <FILENAME> before your actual command and your log will go there, and if you wanted to record the execution time just provide the -p (for profile) before your actual command like so:

你会使用这个其他脚本一样,但是如果你想指定一个不同的文件记录传递- f <文件名> 在你实际指挥你的日志会去那里,如果你想记录执行时间前提供- p(剖面)实际的命令如下所示:

$ log_this -p -f ~/new_log "echo hello world | grep h >> /tmp/hello_world"

I will try to make this better, but if you can think of anything else this could do for you, I am making a github project for this where you can submit bug reports and feature requests.

我将尽量使它更好，但是如果您能想到其他任何可以为您做的事情，我正在做一个github项目，您可以提交bug报告和功能请求。

#2

You can use the tee command, which stores its standard input to a file and outputs it on standard output. Pipe the command line into tee, and pipe tee's output into a new invocation of your shell:

您可以使用tee命令，该命令将其标准输入存储到文件中，并将其输出到标准输出。将命令行导入到tee中，并将管道tee的输出转换为您的shell的新调用:

echo '<command line to be logged and executed>' | \
    tee --append /path/to/your/logfile | \
    $SHELL

i.e., for your example of other_script other_arg1 other_arg2 > file,

即。，对于other_arg2 >文件的示例，

echo 'other_script other_arg1 other_arg2 > file' | \
    tee --append /tmp/mylog.log | \
    $SHELL

If your command line needs single quotes, they need to be escaped properly.

如果命令行需要单引号，则需要正确地转义它们。

#1