1、hadoop会以块的形式存储在HDFS系统。通过命令可以查看所在节点和块的位置:
[root@master softpackage]# hadoop fs -put scala-2.10.4.tgz /[root@master softpackage]# hadoop fsck /scala-2.10.4.tgz -files -locations -blocksDEPRECATED: Use of this script to execute hdfs command is deprecated.Instead use the hdfs command for it.
Connecting to namenode via http://master:50070/fsck?ugi=root&files=1&locations=1&blocks=1&path=%2Fscala-2.10.4.tgzFSCK started by root (auth:SIMPLE) from /192.168.86.133 for path /scala-2.10.4.tgz at Fri Jun 09 11:14:14 EDT 2017/scala-2.10.4.tgz 29937534 bytes, 1 block(s): Under replicated BP-1810807976-192.168.86.133-1496888566245:blk_1073741829_1005. Target Replicas is 3 but found 2 replica(s).0. BP-1810807976-192.168.86.133-1496888566245:blk_1073741829_1005 len=29937534 repl=2 [DatanodeInfoWithStorage[192.168.86.132:50010,DS-ead6ac48-ce41-4133-9552-ec5ca51a6204,DISK], DatanodeInfoWithStorage[192.168.86.134:50010,DS-5059d0f7-4e64-4554-aa92-375a1fe573b8,DISK]]
Status: HEALTHY Total size:29937534 B Total dirs:0 Total files:1 Total symlinks:0 Total blocks (validated):1 (avg. block size 29937534 B) Minimally replicated blocks:1 (100.0 %) Over-replicated blocks:0 (0.0 %) Under-replicated blocks:1 (100.0 %) Mis-replicated blocks:0 (0.0 %) Default replication factor:3 Average block replication:2.0 Corrupt blocks:0 Missing replicas:1 (33.333332 %) Number of data-nodes:2 Number of racks:1FSCK ended at Fri Jun 09 11:14:14 EDT 2017 in 2 milliseconds
The filesystem under path '/scala-2.10.4.tgz' is HEALTHY
然后去datanode 查找具体位置:
[root@slave2 subdir0]# du -sh *224K blk_10737418254.0K blk_1073741825_1001.meta4.0K blk_10737418274.0K blk_1073741827_1003.meta4.0K blk_10737418284.0K blk_1073741828_1004.meta29M blk_1073741829232K blk_1073741829_1005.meta[root@slave2 subdir0]# pwd/opt/hadoop/dfs/data/current/BP-1810807976-192.168.86.133-1496888566245/current/finalized/subdir0/subdir0
可参考:
http://www.myexception.cn/database/1997522.html
相关文章
- 机器学习进阶-光流估计 1.cv2.goodFeaturesToTrack(找出光流估计所需要的角点) 2.cv2.calcOpticalFlowPyrLK(获得光流检测后的角点位置) 3.cv2.add(进行像素点的加和)
- hadoop学习3 查找块的位置
- ubuntu系统下怎样查找opencv3的安装位置,已经安装opencv contrib
- MVC3+EF4.1学习系列(五)----- EF查找导航属性的几种方式
- 机器学习进阶-案例实战-图像全景拼接-图像全景拼接(RANSCA) 1.sift.detectAndComputer(获得sift图像关键点) 2.cv2.findHomography(计算单应性矩阵H) 3.cv2.warpPerspective(获得单应性变化后的图像) 4.cv2.line(对关键点位置进行连线画图)
- 深度学习原理与框架-神经网络-cifar10分类(代码) 1.np.concatenate(进行数据串接) 2.np.hstack(将数据横着排列) 3.hasattr(判断.py文件的函数是否存在) 4.reshape(维度重构) 5.tanspose(维度位置变化) 6.pickle.load(f文件读入) 7.np.argmax(获得最大值索引) 8.np.maximum(阈值比较)
- Ant学习总结3(很多的属性,用的时候方便查找)
- 机器学习进阶-图像金字塔与轮廓检测-模板匹配(单目标匹配和多目标匹配)1.cv2.matchTemplate(进行模板匹配) 2.cv2.minMaxLoc(找出矩阵最大值和最小值的位置(x,y)) 3.cv2.rectangle(在图像上画矩形)
- hadoop中查找某个字符串所在的hdfs位置