（原）torch中显示nn.Sequential()网络的详细情况

转载请注明出处：

http://www.cnblogs.com/darkknightzh/p/6065526.html

本部分多试几次就可以弄得清每一层具体怎么访问了。

step1. 网络定义如下：

require "dpnn"

local net = nn.Sequential()

net:add(nn.SpatialConvolution(, , , , , , , ))

net:add(nn.SpatialBatchNormalization())

net:add(nn.ReLU())

net:add(nn.SpatialMaxPooling(, , , , , ))

net:add(nn.Inception{

     inputSize = ,

     kernelSize = {, },

     kernelStride = {, },

     outputSize = {, },

     reduceSize = {, , , },

     pool = nn.SpatialMaxPooling(, , , , , ),

     batchNorm = true

   })

net:evaluate()

上面的网络，包含conv+BatchNorm+ReLU+Maxpool+Inception层。

step2. 直接通过print(net)便可得到其网络结构：

nn.Sequential {

  [input -> (1) -> (2) -> (3) -> (4) -> (5) -> output]

  (1): nn.SpatialConvolution(3 -> 64, 7x7, 2,2, 3,3)

  (2): nn.SpatialBatchNormalization

  (3): nn.ReLU

  (4): nn.SpatialMaxPooling(3x3, 2,2, 1,1)

  (5): nn.Inception @ nn.DepthConcat {

    input

      |`-> (1): nn.Sequential {

      |      [input -> (1) -> (2) -> (3) -> (4) -> (5) -> (6) -> output]

      |      (1): nn.SpatialConvolution(64 -> 96, 1x1)

      |      (2): nn.SpatialBatchNormalization

      |      (3): nn.ReLU

      |      (4): nn.SpatialConvolution(96 -> 128, 3x3, 1,1, 1,1)

      |      (5): nn.SpatialBatchNormalization

      |      (6): nn.ReLU

      |    }

      |`-> (2): nn.Sequential {

      |      [input -> (1) -> (2) -> (3) -> (4) -> (5) -> (6) -> output]

      |      (1): nn.SpatialConvolution(64 -> 16, 1x1)

      |      (2): nn.SpatialBatchNormalization

      |      (3): nn.ReLU

      |      (4): nn.SpatialConvolution(16 -> 32, 5x5, 1,1, 2,2)

      |      (5): nn.SpatialBatchNormalization

      |      (6): nn.ReLU

      |    }

      |`-> (3): nn.Sequential {

      |      [input -> (1) -> (2) -> (3) -> (4) -> output]

      |      (1): nn.SpatialMaxPooling(3x3, 1,1, 1,1)

      |      (2): nn.SpatialConvolution(64 -> 32, 1x1)

      |      (3): nn.SpatialBatchNormalization

      |      (4): nn.ReLU

      |    }

      |`-> (4): nn.Sequential {

             [input -> (1) -> (2) -> (3) -> output]

             (1): nn.SpatialConvolution(64 -> 64, 1x1)

             (2): nn.SpatialBatchNormalization

             (3): nn.ReLU

           }

       ... -> output

  }

}

但实际上该网络还包括input，output，gradInput等参数。

step3. 使用下面代码便可输出网络比较详细的参数：

for k,curLayer in pairs(net) do

    print(k,curLayer)

end

step4. 输出：

_type	torch.DoubleTensor

output	[torch.DoubleTensor with no dimension]

gradInput	[torch.DoubleTensor with no dimension]

modules	{

  1 :

    {

      dH : 2

      dW : 2

      nInputPlane : 3

      output : DoubleTensor - empty

      kH : 7

      train : false

      gradBias : DoubleTensor - size: 64

      padH : 3

      bias : DoubleTensor - size: 64

      weight : DoubleTensor - size: 64x3x7x7

      _type : "torch.DoubleTensor"

      gradWeight : DoubleTensor - size: 64x3x7x7

      padW : 3

      nOutputPlane : 64

      kW : 7

      gradInput : DoubleTensor - empty

    }

  2 :

    {

      gradBias : DoubleTensor - size: 64

      output : DoubleTensor - empty

      gradInput : DoubleTensor - empty

      running_var : DoubleTensor - size: 64

      momentum : 0.1

      gradWeight : DoubleTensor - size: 64

      eps : 1e-05

      _type : "torch.DoubleTensor"

      affine : true

      running_mean : DoubleTensor - size: 64

      bias : DoubleTensor - size: 64

      weight : DoubleTensor - size: 64

      train : false

    }

  3 :

    {

      inplace : false

      threshold : 0

      _type : "torch.DoubleTensor"

      output : DoubleTensor - empty

      gradInput : DoubleTensor - empty

      train : false

      val : 0

    }

  4 :

    {

      dH : 2

      dW : 2

      kW : 3

      gradInput : DoubleTensor - empty

      indices : DoubleTensor - empty

      train : false

      _type : "torch.DoubleTensor"

      padH : 1

      ceil_mode : false

      output : DoubleTensor - empty

      kH : 3

      padW : 1

    }

  5 :

    {

      outputSize :

        {

          1 : 128

          2 : 32

        }

      inputSize : 64

      gradInput : DoubleTensor - empty

      modules :

        {

          1 :

            {

              train : false

              _type : "torch.DoubleTensor"

              output : DoubleTensor - empty

              gradInput : DoubleTensor - empty

              modules :

                {

                  1 : {...}

                  2 : {...}

                  3 : {...}

                  4 : {...}

                }

              dimension : 2

              size : LongStorage - size: 0

            }

        }

      kernelStride :

        {

          1 : 1

          2 : 1

        }

      _type : "torch.DoubleTensor"

      module :

        {

          train : false

          _type : "torch.DoubleTensor"

          output : DoubleTensor - empty

          gradInput : DoubleTensor - empty

          modules :

            {

              1 :

                {

                  _type : "torch.DoubleTensor"

                  output : DoubleTensor - empty

                  gradInput : DoubleTensor - empty

                  modules : {...}

                  train : false

                }

              2 :

                {

                  _type : "torch.DoubleTensor"

                  output : DoubleTensor - empty

                  gradInput : DoubleTensor - empty

                  modules : {...}

                  train : false

                }

              3 :

                {

                  _type : "torch.DoubleTensor"

                  output : DoubleTensor - empty

                  gradInput : DoubleTensor - empty

                  modules : {...}

                  train : false

                }

              4 :

                {

                  _type : "torch.DoubleTensor"

                  output : DoubleTensor - empty

                  gradInput : DoubleTensor - empty

                  modules : {...}

                  train : false

                }

            }

          dimension : 2

          size : LongStorage - size: 0

        }

      poolStride : 1

      padding : true

      reduceStride : {...}

      transfer :

        {

          inplace : false

          threshold : 0

          _type : "torch.DoubleTensor"

          output : DoubleTensor - empty

          gradInput : DoubleTensor - empty

          val : 0

        }

      batchNorm : true

      train : false

      pool :

        {

          dH : 1

          dW : 1

          kW : 3

          gradInput : DoubleTensor - empty

          indices : DoubleTensor - empty

          train : false

          _type : "torch.DoubleTensor"

          padH : 1

          ceil_mode : false

          output : DoubleTensor - empty

          kH : 3

          padW : 1

        }

      poolSize : 3

      reduceSize :

        {

          1 : 96

          2 : 16

          3 : 32

          4 : 64

        }

      kernelSize :

        {

          1 : 3

          2 : 5

        }

      output : DoubleTensor - empty

    }

}

train	false

上面的modules中，分别为conv、BatchNorm、ReLU、Maxpool、Inception对应的参数。

step5. 可通过net.modules[1]来索引nn.SpatialConvolution。如print(net.modules[1])得到：

nn.SpatialConvolution(3 -> 64, 7x7, 2,2, 3,3)

step6. 如果想更进一步，输出该层的参数，可以使用如下代码（实际上step4中已经输出了）：

for k,curLayer in pairs(net.modules[]) do

    if type(curLayer) ~= 'userdata' then

        print(k,curLayer)

    else

        local strval = ' '

        for i = , curLayer:dim() do

            strval = strval .. curLayer:size(i) .. " "

        end

        print(k .. " " .. type(curLayer) .. " " .. string.format("\27[31m size: %s", strval))

    end

end

step7. 得到的结果为：

dH	2

dW	2

nInputPlane	3

output userdata  size:

kH	7

train	false

gradBias userdata  size:  64

padH	3

bias userdata  size:  64

weight userdata  size:  64 3 7 7

_type	torch.DoubleTensor

gradWeight userdata  size:  64 3 7 7

padW	3

nOutputPlane	64

kW	7

gradInput userdata  size:

step8. 对于Inception层，step4中并没有完全显示出来。按照step5中的方式，使用net.modules[5]来得到Inception层。将step6进行更改，可输出：

outputSize	{

  1 : 128

  2 : 32

}

inputSize	64

gradInput userdata  size:

modules	{

  1 :

    {

      train : false

      _type : "torch.DoubleTensor"

      output : DoubleTensor - empty

      gradInput : DoubleTensor - empty

      modules :

        {

          1 :

            {

              _type : "torch.DoubleTensor"

              output : DoubleTensor - empty

              gradInput : DoubleTensor - empty

              modules :

                {

                  1 : {...}

                  2 : {...}

                  3 : {...}

                  4 : {...}

                  5 : {...}

                  6 : {...}

                }

              train : false

            }

          2 :

            {

              _type : "torch.DoubleTensor"

              output : DoubleTensor - empty

              gradInput : DoubleTensor - empty

              modules :

                {

                  1 : {...}

                  2 : {...}

                  3 : {...}

                  4 : {...}

                  5 : {...}

                  6 : {...}

                }

              train : false

            }

          3 :

            {

              _type : "torch.DoubleTensor"

              output : DoubleTensor - empty

              gradInput : DoubleTensor - empty

              modules :

                {

                  1 : {...}

                  2 : {...}

                  3 : {...}

                  4 : {...}

                }

              train : false

            }

          4 :

            {

              _type : "torch.DoubleTensor"

              output : DoubleTensor - empty

              gradInput : DoubleTensor - empty

              modules :

                {

                  1 : {...}

                  2 : {...}

                  3 : {...}

                }

              train : false

            }

        }

      dimension : 2

      size : LongStorage - size: 0

    }

}

kernelStride	{

  1 : 1

  2 : 1

}

_type	torch.DoubleTensor

module	nn.DepthConcat {

  input

    |`-> (1): nn.Sequential {

    |      [input -> (1) -> (2) -> (3) -> (4) -> (5) -> (6) -> output]

    |      (1): nn.SpatialConvolution(64 -> 96, 1x1)

    |      (2): nn.SpatialBatchNormalization

    |      (3): nn.ReLU

    |      (4): nn.SpatialConvolution(96 -> 128, 3x3, 1,1, 1,1)

    |      (5): nn.SpatialBatchNormalization

    |      (6): nn.ReLU

    |    }

    |`-> (2): nn.Sequential {

    |      [input -> (1) -> (2) -> (3) -> (4) -> (5) -> (6) -> output]

    |      (1): nn.SpatialConvolution(64 -> 16, 1x1)

    |      (2): nn.SpatialBatchNormalization

    |      (3): nn.ReLU

    |      (4): nn.SpatialConvolution(16 -> 32, 5x5, 1,1, 2,2)

    |      (5): nn.SpatialBatchNormalization

    |      (6): nn.ReLU

    |    }

    |`-> (3): nn.Sequential {

    |      [input -> (1) -> (2) -> (3) -> (4) -> output]

    |      (1): nn.SpatialMaxPooling(3x3, 1,1, 1,1)

    |      (2): nn.SpatialConvolution(64 -> 32, 1x1)

    |      (3): nn.SpatialBatchNormalization

    |      (4): nn.ReLU

    |    }

    |`-> (4): nn.Sequential {

           [input -> (1) -> (2) -> (3) -> output]

           (1): nn.SpatialConvolution(64 -> 64, 1x1)

           (2): nn.SpatialBatchNormalization

           (3): nn.ReLU

         }

     ... -> output

}

poolStride	1

padding	true

reduceStride	{}

transfer	nn.ReLU

batchNorm	true

train	false

pool	nn.SpatialMaxPooling(3x3, 1,1, 1,1)

poolSize	3

reduceSize	{

  1 : 96

  2 : 16

  3 : 32

  4 : 64

}

kernelSize	{

  1 : 3

  2 : 5

}

output userdata  size:

step9. 在step8中，modules中为对应的inception各层（3*3卷积，5*5卷积，pooling，1*1reduce）。可通过net.modules[5].module来得到这些层。该层也有train，output，gradInput，modules等变量。可通过print(net.modules[5].module)来输出。

step10. 根据step5中的思路，可通过net.modules[5].module.modules[1]来得到3*3卷基层具体情况：

_type	torch.DoubleTensor

output userdata  size:

gradInput userdata  size:

modules	{

  1 :

    {

      dH : 1

      dW : 1

      nInputPlane : 64

      output : DoubleTensor - empty

      kH : 1

      train : false

      gradBias : DoubleTensor - size: 96

      padH : 0

      bias : DoubleTensor - size: 96

      weight : DoubleTensor - size: 96x64x1x1

      _type : "torch.DoubleTensor"

      gradWeight : DoubleTensor - size: 96x64x1x1

      padW : 0

      nOutputPlane : 96

      kW : 1

      gradInput : DoubleTensor - empty

    }

  2 :

    {

      gradBias : DoubleTensor - size: 96

      output : DoubleTensor - empty

      gradInput : DoubleTensor - empty

      running_var : DoubleTensor - size: 96

      momentum : 0.1

      gradWeight : DoubleTensor - size: 96

      eps : 1e-05

      _type : "torch.DoubleTensor"

      affine : true

      running_mean : DoubleTensor - size: 96

      bias : DoubleTensor - size: 96

      weight : DoubleTensor - size: 96

      train : false

    }

  3 :

    {

      inplace : false

      threshold : 0

      _type : "torch.DoubleTensor"

      output : DoubleTensor - empty

      gradInput : DoubleTensor - empty

      train : false

      val : 0

    }

  4 :

    {

      dH : 1

      dW : 1

      nInputPlane : 96

      output : DoubleTensor - empty

      kH : 3

      train : false

      gradBias : DoubleTensor - size: 128

      padH : 1

      bias : DoubleTensor - size: 128

      weight : DoubleTensor - size: 128x96x3x3

      _type : "torch.DoubleTensor"

      gradWeight : DoubleTensor - size: 128x96x3x3

      padW : 1

      nOutputPlane : 128

      kW : 3

      gradInput : DoubleTensor - empty

    }

  5 :

    {

      gradBias : DoubleTensor - size: 128

      output : DoubleTensor - empty

      gradInput : DoubleTensor - empty

      running_var : DoubleTensor - size: 128

      momentum : 0.1

      gradWeight : DoubleTensor - size: 128

      eps : 1e-05

      _type : "torch.DoubleTensor"

      affine : true

      running_mean : DoubleTensor - size: 128

      bias : DoubleTensor - size: 128

      weight : DoubleTensor - size: 128

      train : false

    }

  6 :

    {

      inplace : false

      threshold : 0

      _type : "torch.DoubleTensor"

      output : DoubleTensor - empty

      gradInput : DoubleTensor - empty

      train : false

      val : 0

    }

}

train	false

注意：此处有一个module和一个modules，具体不太明白。

step11. 可通过net.modules[5].module.modules[1].modules进一步查看该层的情况：

1	nn.SpatialConvolution(64 -> 96, 1x1)

2	nn.SpatialBatchNormalization

3	nn.ReLU

4	nn.SpatialConvolution(96 -> 128, 3x3, 1,1, 1,1)

5	nn.SpatialBatchNormalization

6	nn.ReLU

可见，该层包括1*1conv，BatchNorm，ReLU，3*3conv，BatchNorm，Relu这些。

step12. 若要查看step11中的3*3卷基层信息，可使用如下索引：

net.modules[5].module.modules[1].modules[4]

结果如下：

dH	1

dW	1

nInputPlane	96

output userdata  size:

kH	3

train	false

gradBias userdata  size:  128

padH	1

bias userdata  size:  128

weight userdata  size:  128 96 3 3

_type	torch.DoubleTensor

gradWeight userdata  size:  128 96 3 3

padW	1

nOutputPlane	128

kW	3

gradInput userdata  size:

step13. 到了step12，已经索引到了step1中网络的最深层。网络中每层均有input，output等。

step14. 对于net.modules[5]的Inception层，net.modules[5].output的结果和net.modules[5].module.output的结果是一样的，如（为方便显示，只显示了一小部分。如果输出net.modules[5].output，可能会有很多全为0的）:

local imgBatch = torch.rand(,,,)

local infer = net:forward(imgBatch)

print(net.modules[].output[][][])

print(net.modules[].module.output[][][])

结果为：

0.01 *

 2.7396

 2.9070

 3.1895

 1.5040

 1.9784

 4.0125

 3.2874

 3.3137

 2.1326

 2.3930

 2.8170

 3.5226

 2.3162

 2.7308

 2.8511

 2.5278

 3.3325

 3.0819

 3.2826

 3.5363

 2.5749

 2.8816

 2.2393

 2.4765

 2.4803

 3.2553

 3.0837

 3.1197

 2.4632

 1.5145

 3.7101

 2.1888

[torch.DoubleTensor of size 32]

0.01 *

 2.7396

 2.9070

 3.1895

 1.5040

 1.9784

 4.0125

 3.2874

 3.3137

 2.1326

 2.3930

 2.8170

 3.5226

 2.3162

 2.7308

 2.8511

 2.5278

 3.3325

 3.0819

 3.2826

 3.5363

 2.5749

 2.8816

 2.2393

 2.4765

 2.4803

 3.2553

 3.0837

 3.1197

 2.4632

 1.5145

 3.7101

 2.1888

[torch.DoubleTensor of size 32]

（原）torch中显示nn.Sequential()网络的详细情况的更多相关文章

DB中耗时的存储过程及执行详细情况
SELECT a.object_id, a.database_id, OBJECT_NAME(object_id, database_id) 'proc name', a.cached_time, a ...
小白学习之pytorch框架(2)-动手学深度学习(begin-random&period;shuffle()、torch&period;index&lowbar;select()、nn&period;Module、nn&period;Sequential())
在这向大家推荐一本书-花书-动手学深度学习pytorch版,原书用的深度学习框架是MXNet,这个框架经过Gluon重新再封装,使用风格非常接近pytorch,但是由于pytorch越来越火,个人又比 ...
（原）torch中微调某层参数
转载请注明出处: http://www.cnblogs.com/darkknightzh/p/6221664.html 参考网址: https://github.com/torch/nn/issues ...
（原）ubuntu16在torch中使用caffe训练好的模型
转载请注明出处: http://www.cnblogs.com/darkknightzh/p/5783006.html 之前使用的是torch,由于其他人在caffe上面预训练了inception模型 ...
[pytorch笔记] torch&period;nn vs torch&period;nn&period;functional&semi; model&period;eval() vs torch&period;no&lowbar;grad()&semi; nn&period;Sequential() vs nn&period;moduleList
1. torch.nn与torch.nn.functional之间的区别和联系 https://blog.csdn.net/GZHermit/article/details/78730856 nn和n ...
ios 从网络上获取图片并在UIImageView中显示
ios 从网络上获取图片 -(UIImage *) getImageFromURL:(NSString *)fileURL { NSLog(@"执行图片下载函数"); UIIm ...
torch&period;nn&period;Sequential()详解
参考:官方文档源码官方文档 nn.Sequential A sequential container. Modules will be added to it in the order th ...
Pytorch——torch&period;nn&period;Sequential()详解
参考:官方文档源码官方文档 nn.Sequential A sequential container. Modules will be added to it in the order th ...
（原）torch中threads的addjob函数使用方法
转载请注明出处: http://www.cnblogs.com/darkknightzh/p/6549452.html 参考网址: https://github.com/torch/threads#e ...

随机推荐

UNIX文件的权限之&OpenCurlyDoubleQuote;设置用户ID位”
用stat函数可以获取一个文件的状态信息,原型是这样的: int stat(const char *path, struct stat *buf); 其中结构体stat的结构: struct stat ...
ActivityManagerService是如何启动app
ActivityManagerService是如何启动app 一．上一篇文章app的启动过程,说明了launcher启动app是通过binber,让ActivityManagerServi ...
C&num; 从excel里面复制的1万6千多条记录粘贴到FCKeditor里面，点保存的时候，保存不了，页面没有反应
客户那边添加公告,是直接从excel里面复制的,有1万6千多条记录,excel文件有6M多. 编辑器用的FCKeditor,也能粘贴上,就是点保存的时候,执行了一段时间就没有反映了,保存不了. 想着可 ...
c++实现简单计算器
帮一个同学写的,非计算机类专业,应付交差,也没什么功能,两个数的加减乘除运算,以及三角函数的运算.要求用到模板.运算符重载和异常处理. 一直以来都是用的java,没怎么用过c++,就当是复习了一下c+ ...
Env&colon;zsh和fish安装和使用
zsh优势兼容bash, 方便git管理,但是有时候切换速度较慢,特别遇到git仓库目录 fish优势速度较快,路径提示也不错,但是和bash不兼容 1. zsh 首先,可以通过cat /etc/sh ...
C&num; 4&period;0 新特性之并行运算(Parallel)
介绍C# 4.0 的新特性之并行运算 Parallel.For - for 循环的并行运算 Parallel.ForEach - foreach 循环的并行运算 Parallel.Invoke - 并 ...
细谈JavaScript中的书写规范
当你有一些感触想写下一些东西的时候,总会发现其实你想写的所有文章都有大牛已经给你写好了,而且写的比你好. https://github.com/ecomfe/spec/blob/master/java ...
Eclipse自动生成返回值对象的快捷键是什么？
如下代码 List<ShareholderEntity> shareList = fetch.parseShareHolder(data); 如何自动生成 List<Sharehol ...
AB PLC分类
AB PLC产品更新速度还是挺快的,以前很多产品都停产了,所以分类也跟着调整,就目前而言: 一.主要类型 AB PLC按类型,主要分为三大类: • 小型:MicroLogix控制器 • 中型:Comp ...
JSON&period;stringify与JSON&period;parse
JSON.stringify(value [, replacer] [, space]) 用于将对象 --> JSON 字符串. value:对象.数组.类 replacer: 数组时:v ...