014_zk路径过滤分析

时间:2023-12-14 20:16:44

一、线上zk访问延迟特别高需要统计一段时间内的zk写入路径top10,实现如下:

#!/usr/bin/env python
# -*- coding:utf-8 -*-
import re,traceback def gen_range_hosts(path,n):
new_path= ""
try:
re_match = re.match(r'(.*)"path":"(.*)","version"', path, re.M | re.I)
if re_match is not None:
new_path = re_match.group(2)
except:
print "++++++++++++{n}++++++++++++{path}".format(n=n, path=path)
traceback.print_exc() return new_path def main():
with open('./publisher.log', 'r') as f:
n = 1
for line in f.readlines():
n +=1
new_line = line.strip()
if new_line.find("path") != -1:
print gen_range_hosts(new_line,n) if __name__ == '__main__':
main()
'''
<1>过滤日志命令:
cat newlog.log |egrep -v "^$"|sort |uniq -c|sort -rn >> okok.log
'''

二、可以根据指定时间过滤日志路径的功能需要实现。