快捷導(dǎo)航

對tensorflow中tf.nn.conv1d和layers.conv1d的區(qū)別詳解

更新時(shí)間：2020年02月11日 14:09:03 作者：孤獨(dú)的猿行客

今天小編就為大家分享一篇對tensorflow中tf.nn.conv1d和layers.conv1d的區(qū)別詳解，具有很好的參考價(jià)值，希望對大家有所幫助。一起跟隨小編過來看看吧

在用tensorflow做一維的卷積神經(jīng)網(wǎng)絡(luò)的時(shí)候會(huì)遇到tf.nn.conv1d和layers.conv1d這兩個(gè)函數(shù)，但是這兩個(gè)函數(shù)有什么區(qū)別呢，通過計(jì)算得到一些規(guī)律。

1.關(guān)于tf.nn.conv1d的解釋，以下是Tensor Flow中關(guān)于tf.nn.conv1d的API注解：

Computes a 1-D convolution given 3-D input and filter tensors.

Given an input tensor of shape [batch, in_width, in_channels] if data_format is "NHWC", or [batch, in_channels, in_width] if data_format is "NCHW", and a filter / kernel tensor of shape [filter_width, in_channels, out_channels], this op reshapes the arguments to pass them to conv2d to perform the equivalent convolution operation.

Internally, this op reshapes the input tensors and invokes `tf.nn.conv2d`. For example, if `data_format` does not start with "NC", a tensor of shape [batch, in_width, in_channels] is reshaped to [batch, 1, in_width, in_channels], and the filter is reshaped to [1, filter_width, in_channels, out_channels]. The result is then reshaped back to [batch, out_width, out_channels] whereoutwidthisafunctionofthestrideandpaddingasinconv2dwhereoutwidthisafunctionofthestrideandpaddingasinconv2d and returned to the caller.

Args: value: A 3D `Tensor`. Must be of type `float32` or `float64`. filters: A 3D `Tensor`. Must have the same type as `input`. stride: An `integer`. The number of entries by which the filter is moved right at each step. padding: 'SAME' or 'VALID' use_cudnn_on_gpu: An optional `bool`. Defaults to `True`. data_format: An optional `string` from `"NHWC", "NCHW"`. Defaults to `"NHWC"`, the data is stored in the order of [batch, in_width, in_channels]. The `"NCHW"` format stores data as [batch, in_channels, in_width]. name: A name for the operation (optional).

Returns:

A `Tensor`. Has the same type as input.

Raises:

ValueError: if `data_format` is invalid.

什么意思呢？就是說conv1d的參數(shù)含義：(以NHWC格式為例，即，通道維在最后)

1、value：在注釋中，value的格式為：[batch, in_width, in_channels]，batch為樣本維，表示多少個(gè)樣本，in_width為寬度維，表示樣本的寬度，in_channels維通道維，表示樣本有多少個(gè)通道。事實(shí)上，也可以把格式看作如下:[batch, 行數(shù), 列數(shù)]，把每一個(gè)樣本看作一個(gè)平鋪開的二維數(shù)組。這樣的話可以方便理解。

2、filters：在注釋中，filters的格式為：[filter_width, in_channels, out_channels]。按照value的第二種看法，filter_width可以看作每次與value進(jìn)行卷積的行數(shù)，in_channels表示value一共有多少列（與value中的in_channels相對應(yīng)）。out_channels表示輸出通道，可以理解為一共有多少個(gè)卷積核，即卷積核的數(shù)目。

3、stride：一個(gè)整數(shù)，表示步長，每次（向下）移動(dòng)的距離（TensorFlow中解釋是向右移動(dòng)的距離，這里可以看作向下移動(dòng)的距離）。

4、padding：同conv2d，value是否需要在下方填補(bǔ)0。

5、name：名稱?？墒÷浴?/p>

首先從參數(shù)列表可以看出value指的輸入的數(shù)據(jù)，stride就是卷積的步長，這里我們最有疑問的就是filters這個(gè)參數(shù)，那么我們對filter進(jìn)行簡單的說明。從上面可以看到filters的格式為：[filter_width, in_channels, out_channels],這是一個(gè)數(shù)組的維度，對應(yīng)的是卷積核的大小，輸入的channel的格式，和卷積核的個(gè)數(shù)，下面我們用例子說明問題：

import tensorflow as tf
import numpy as np
 
 
if __name__ == '__main__':
  inputs = tf.constant(np.arange(1, 6, dtype=np.float32), shape=[1, 5, 1])
  w = np.array([1, 2], dtype=np.float32).reshape([2, 1, 1])
  # filter width, filter channels and out channels(number of kernels)
  cov1 = tf.nn.conv1d(inputs, w, stride=1, padding='VALID')
  with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    out = sess.run(cov1)
    print(out)

其輸出為：

[[[ 5.],
    [ 8.],
    [11.],
    [14.]]]

我們分析一下，輸入的數(shù)據(jù)為[[[1],[2],[3],[4],[5]]],有5個(gè)特征，分別對應(yīng)的數(shù)值為1，2，3，4，5，那么經(jīng)過卷積的結(jié)果為5，8，11，14，那么這個(gè)結(jié)果是怎么來的呢，我們根據(jù)卷積的計(jì)算，可以得到5 = 1*1 + 2*2， 8=2*1+ 3*2， 11 = 3*1+4*2， 14=4*1+5*2，也就是W1=1， W2=2，正好和我們先面filters設(shè)置的數(shù)值相等，

w = np.array([1, 2], dtype=np.float32).reshape([2, 1, 1])

所以可以看到這個(gè)filtes設(shè)置的是是卷積核矩陣的，換句話說，卷積核矩陣我們是可以設(shè)置的。

2. 1.關(guān)于tf.layers.conv1d,函數(shù)的定義如下

tf.layers.conv1d(
 
inputs,
 
filters,
 
kernel_size,
 
strides=1,
 
padding='valid',
 
data_format='channels_last',
 
dilation_rate=1,
 
activation=None,
 
use_bias=True,
 
kernel_initializer=None,
 
bias_initializer=tf.zeros_initializer(),
 
kernel_regularizer=None,
 
bias_regularizer=None,
 
activity_regularizer=None,
 
kernel_constraint=None,
 
bias_constraint=None,
 
trainable=True,
 
name=None,
 
reuse=None
 
)

比較重要的幾個(gè)參數(shù)是inputs, filters, kernel_size，下面分別說明

inputs : 輸入tensor，維度(None, a, b) 是一個(gè)三維的tensor

None ：一般是填充樣本的個(gè)數(shù)，batch_size

a ：句子中的詞數(shù)或者字?jǐn)?shù)

b : 字或者詞的向量維度

filters : 過濾器的個(gè)數(shù)

kernel_size : 卷積核的大小，卷積核其實(shí)應(yīng)該是一個(gè)二維的，這里只需要指定一維，是因?yàn)榫矸e核的第二維與輸入的詞向量維度是一致的，因?yàn)閷τ诰渥佣?，卷積的移動(dòng)方向只能是沿著詞的方向，即只能在列維度移動(dòng)。一個(gè)例子：

import tensorflow as tf
import numpy as np
 
 
if __name__ == '__main__':
  inputs = tf.constant(np.arange(1, 6, dtype=np.float32), shape=[1, 5, 1])
  cov2 = tf.layers.conv1d(inputs, filters=1, kernel_size=2, strides=1, padding='VALID')
  with tf.Session() as sess:
    sess.run(tf.global_variables_initializer())
    out = sess.run(cov2)
    print(out)

輸出結(jié)果：

[[[-1.9953331]
 [-3.5520997]
 [-5.108866 ]
 [-6.6656327]]]

也許你得到的結(jié)果和我得到的結(jié)果不同，因?yàn)樵谶@個(gè)函數(shù)里面只是設(shè)置了卷積核的尺寸和步長，沒有設(shè)置具體的卷積核矩陣，所以這個(gè)卷積核矩陣是隨機(jī)生成的，就會(huì)出現(xiàn)可能運(yùn)行上面的程序出現(xiàn)不同結(jié)果的情況。

以上這篇對tensorflow中tf.nn.conv1d和layers.conv1d的區(qū)別詳解就是小編分享給大家的全部內(nèi)容了，希望能給大家一個(gè)參考，也希望大家多多支持腳本之家。

您可能感興趣的文章: