快捷導(dǎo)航

tensorflow模型的save與restore,及checkpoint中讀取變量方式

更新時(shí)間：2020年05月26日 09:33:13 作者：J_______ll

這篇文章主要介紹了tensorflow模型的save與restore,及checkpoint中讀取變量方式，具有很好的參考價(jià)值，希望對(duì)大家有所幫助。一起跟隨小編過(guò)來(lái)看看吧

創(chuàng)建一個(gè)NN

import tensorflow as tf
import numpy as np

#fake data
x = np.linspace(-1, 1, 100)[:, np.newaxis] #shape(100,1)
noise = np.random.normal(0, 0.1, size=x.shape)
y = np.power(x, 2) + noise  #shape(100,1) + noise
tf_x = tf.placeholder(tf.float32, x.shape) #input x
tf_y = tf.placeholder(tf.float32, y.shape) #output y
l = tf.layers.dense(tf_x, 10, tf.nn.relu) #hidden layer
o = tf.layers.dense(l, 1)     #output layer
loss = tf.losses.mean_squared_error(tf_y, o ) #compute loss
train_op = tf.train.GradientDescentOptimizer(learning_rate=0.5).minimize(loss)

1.使用save對(duì)模型進(jìn)行保存

sess= tf.Session()
sess.run(tf.global_variables_initializer())  #initialize var in graph
saver = tf.train.Saver() # define a saver for saving and restoring
for step in range(100):   #train
 sess.run(train_op,{tf_x:x, tf_y:y})
saver.save(sess, 'params/params.ckpt', write_meta_graph=False) # mate_graph is not recommend

生成三個(gè)文件，分別是checkpoint，.ckpt.data-00000-of-00001,.ckpt.index

2.使用restore對(duì)提取模型

在提取模型時(shí)，需要將模型結(jié)構(gòu)再定義一遍，再將各參數(shù)加載出來(lái)

#bulid entire net again and restore
tf_x = tf.placeholder(tf.float32, x.shape)
tf_y = tf.placeholder(tf.float32, y.shape)
l_ = tf.layers.dense(tf_x, 10, tf.nn.relu)
o_ = tf.layers.dense(l_, 1)
loss_ = tf.losses.mean_squared_error(tf_y, o_)
 
sess = tf.Session()
# don't need to initialize variables, just restoring trained variables
saver = tf.train.Saver() # define a saver for saving and restoring
saver.restore(sess, './params/params.ckpt')

3.有時(shí)會(huì)報(bào)錯(cuò)Not found：b1 not found in checkpoint

這時(shí)我們想知道我在文件中到底保存了什么內(nèi)容，即需要讀取出checkpoint中的tensor

import os
from tensorflow.python import pywrap_tensorflow
checkpoint_path = os.path.join('params','params.ckpt')
# Read data from checkpoint file
reader = pywrap_tensorflow.NewCheckpointReader(checkpoint_path)
var_to_shape_map = reader.get_variable_to_shape_map()
# Print tensor name and value
f = open('params.txt','w')
for key in var_to_shape_map: # write tensors' names and values in file
 print(key,file=f)
 print(reader.get_tensor(key),file=f)
f.close()

運(yùn)行后生成一個(gè)params.txt文件，在其中可以看到模型的參數(shù)。

補(bǔ)充知識(shí)：TensorFlow按時(shí)間保存檢查點(diǎn)

一實(shí)例

介紹一種更簡(jiǎn)便地保存檢查點(diǎn)功能的方法——tf.train.MonitoredTrainingSession函數(shù)，該函數(shù)可以直接實(shí)現(xiàn)保存及載入檢查點(diǎn)模型的文件。

演示使用MonitoredTrainingSession函數(shù)來(lái)自動(dòng)管理檢查點(diǎn)文件。

二代碼

import tensorflow as tf
tf.reset_default_graph()
global_step = tf.train.get_or_create_global_step()
step = tf.assign_add(global_step, 1)
with tf.train.MonitoredTrainingSession(checkpoint_dir='log/checkpoints',save_checkpoint_secs = 2) as sess:
 print(sess.run([global_step]))
 while not sess.should_stop():
  i = sess.run( step)
  print( i)

三運(yùn)行結(jié)果

1 第一次運(yùn)行后，會(huì)發(fā)現(xiàn)log文件夾下產(chǎn)生如下文件

2 第二次運(yùn)行后，結(jié)果如下：

INFO:tensorflow:Create CheckpointSaverHook.
INFO:tensorflow:Restoring parameters from log/checkpoints\model.ckpt-15147
INFO:tensorflow:Saving checkpoints for 15147 into log/checkpoints\model.ckpt.
[15147]
15148
15149
15150
15151
15152
15153
15154
15155
15156
15157
15158
15159