论文--搜索批量下载_linux

时间:2023-01-14 14:53:49
 //搜索论文用的shell命令sed 6步轻松搞定
@wget -O networks.html url // 每一页25或者更多的pdf@cat  networks.html |grep "/stamp/stamp.jsp?tp=&arnumber=" | sed -e 's#"><img #\n #;s/^[ \t]*<a href="/ /;s# */stamp/#http://ieeexplore.ieee.org/stamp/#'|grep http://iee >myurl.txt
url.txt结果:25行或者50行100http://ieeexplore.ieee.org/ielx5/9121/28917/01301384.pdf
//将相对下载地址转换为 绝对地址@cat myurl.txt  |xargs wget 得到包含绝对地址的文件如:stamp.jsp\?tp\=\&arnumber\=4566619 
//如果在别的目录里面提取方法:@ ls |grep stamp >relative.txt//提取绝对地址:@cat relative.txt|xargs cat |grep http://iee|sed 's#[ \t]*<frame src="##;s#" frameborder=0./>##' >abs.txt如:http://ieeexplore.ieee.org/ielx5/5488974/5489112/05489243.pdf?tp=&arnumber=5489243&isnumber=5489112
//进行下载当前目录下@cat abs.txt |xargs wget //删除垃圾文件@ls |grep stamp |xargs rm
@//修改文件名
# for i in $(ls .|grep isnumber); do mv  $i $(echo $i |sed 's#?.*##'); done