【教程】Tensorflow vs PyTorch —— 基本操作

请设置文章作者

发布于：Feb 13, 2020

次浏览

Tensorflow 和 PyTorch 的基本操作非常类似，大多数方法都是相通的，如果有 numpy 的基础，掌握起来就更加容易了。基本操作涉及到张量的初始化，切片索引，维度变换和合并拆分等。

1. 初始化

Tensorflow 和 PyTorch 的导入，查看版本，是否有GPU支持等非常简单代码如下：

2. 创建张量

在数学和物理概念中，有标量 Scaler，向量 Vector, 矩阵 Matrix 和张量 Tensor 之分。但我们这里为了简单和统一，将数据均称为张量。

Tensorflow 和 PyTorch 张量初始化可以直接分别调用： tf.constent 方法，调用 torch.tensor 方法，填入张量数值即可。

# ------------------------Tensorflow -----------------------------
#  dim = 0 
a = tf.constant(2.4)
#  dim = 1 
a = tf.constant([2.4])
# dim = 2
b = tf.constant([[1,2],[3,4]])

# ------------------------PyTorch ------------------------------------
#  dim = 0 
a = torch.tensor(2.4)
# dim = 1 
a = torch.tensor([2.4])
# dim = 2
b = torch.tensor([[1,2],[3,4]])

也可分别创建 tf.Variable, torch.Tensor 对象，实现张量初始化。但是这里需要注意两点:

tf.Variable 创建的是Variable对象，不是Tensor对象，前者可以跟踪求梯度，后者不能直接求梯度。

torch.tensor 和 torch.Tensor 均创建的是 Tensor 对象，但前者输入具体数值，后者输入 Tensor shape(size)，数值不可控，不推荐

推荐初始化方法

直接从 numpy.arrary 转换成 Tensor

使用，zeros, ones 和 eye 分别创建 0， 1 和对角张量。

随机初始化 tf.random.uniform, tf.random.normall; torch.rand

Tensorflow 和 PyTorch 的张量均可以通过 .numpy() 转换成 numpy.arrary，PyTorch 还可以使用.tolist() 转换成list

# ------------------------Tensorflow ------------------------
a = np.array([1,2,3])
aa = tf.Variable(a)

aa.numpy()
aa.numpy().tolist()

b = tf.ones(shape=[3,3])
c = tf.zeros(shape=[1,2,3])
d = tf.eye(4)


e = tf.random.uniform([3,3],
                     minval=-1,
                     maxval= 1)
f = tf.random.normal([3,3])

g = tf.fill(dims=[3,3], value=3)

# ------------------------ PyTorch ------------------------
a = np.array([1,2,3])
aa = torch.from_numpy(a)

aa.numpy()
aa.tolist()
aa[0].item()

b = torch.ones([3,3])
c = torch.zeros([1,2,3])
d = torch.eye(4)

e = torch.rand([3,3]) *2 -1
f = torch.tensor(np.random.normal( size= [3,3]))

h = torch.full([3,3],fill_value = 3)

3. 切片和索引

Tensorflow 和 PyTorch 切片和索引与 numpy.array 非常类似这里就不赘述了。注意掌握以下两个原则：

索引格式 start : end : step 其中 end 不包括在内。

中间若要省略过多冒号，可用 … 代替

4. 维度变换

4.1 Reshape

Tensorflow 和 PyTorch 均可用 reshape 方法变换维度。PyTorch 还有一个 view 方法，与reshape 方法效果一样。

需要注意的是

Tensorflow 是调用 tf.reshape, PyTorch 可以直接对 Tensor 本身进行操作。

Reshape 操作不会改变数据的存储顺序。

另外可以 Tensorfow 和 PyTorch 可以分别使用 tf.expand_dim 和 .unsqueeze 方法增加维度。分别使用 tf.squeeze 和 .squeeze 方法删除维度。

注意增加和删除的维度在该维度 (Aix) 上数值为 1，表现在数据上就是增加和删除了括号而已，并不会对数据的存储顺序造成影响。

# ------------------------Tensorflow ----------------------
x = tf.random.uniform([5,28,28,3])

print(tf.reshape(x, [5, 28*28, 3]).shape)
print(tf.reshape(x, [5, 14, 56, 3]).shape)
print(tf.expand_dims(x, axis=0).shape)

y = tf.expand_dims(x, axis=2)
print(y.shape)
print(tf.squeeze(y, axis=2).shape)

# ------------------------ PyTorch ------------------------
x = torch.rand([5,3,28,28])

print(x.view(5,3,28*28).shape)
print(x.reshape(5,3,28*28).shape)
print(torch.reshape(x,[5,3,28*28]).shape)

y = x.unsqueeze(2)
print(y.shape)
print(y.squeeze(2).shape)

4.2 交换维度

交换维度是非常常见的操作，比如 Tensorflow 中图片的存储方式通常是 [b, w, h, c] 而 PyTorch 则为 [b, c, w, h]

b, w, h, c 分别代表 batch, width, height, channel. （批，宽，高，通道）

Tensorflow 和 PyTorch 均可使用 transpose 的方法交换维度，但是实现方法有所差异，PyTorch 推荐使用 permute 方法，该方法与 tf.transpose 方法一致。

交换维度会改变数据存储顺序，谨慎使用。

该部分参考代码如下：

# ------------------------Tensorflow ----------------------
# Change dim [b,w,h,c] -> [b,c,w,h]

z = tf.transpose(x,perm=[0,3,1,2])
print('z shape', z.shape)
# ------------------------ PyTorch ------------------------

# exchange dim [b, c, w, h] -> [b, w, h, c]
z = x.transpose(1, 2).transpose(2, 3)
print(x.permute(0,3,1,2).shape)

5. Broadcasting

Broadcasting 是一个自动复制张量的手段，在深度学习中，通常我们需要将标量的偏置 (Bias) Broadcast 到与权重 (Weight) 一致。

Tensorflow 和 PyTorch 均可自动完成这个操作。当然我们也可以分别使用 tf.broadcast_to,.expand 方法手动实现。

注意 Broadcasting 只能将维度(轴)上数值为1的地方复制 n 遍。

6. 合并拆分

合并和拆分是非常常见的张量操作，对应的是 concat 和 split。Tensorflow 对应 tf.concat和tf.split PyTorch对应 .cat 和.split，这两个操作都不会改变张量的维度。

举一个例子，一个3维张量对应班级，学生，各科成绩 [classes, students, scores] ，我们可以在三个维度上对这个张量进行合并和拆分。

如果在 dim/axis=0 上合并，既合并班级，此时 students, scores 维度上数值必须保持一致，既学生数量和科目数量必须保持一致。

如果在 dim/axis=1 上合并，既合并学生，此时 classes, scores 维度上数值必须保持一致，既班级数量和科目数量必须保持一致。

如果在 dim/axis=0 上合并，既合并成绩，此时 classes, student 维度上数值必须保持一致，既班级数量和学生数量必须保持一致。

拆分也是同样的道理，在传入参数的 num_or_size_splits 如果传入一个数字，既表示在该维度下每份的大小，如果传入的是一个 list, 既表示按照 list 中的大小进行拆分。

# ------------------------Tensorflow -----------------------------
# Example [classes, students, scores] 
# Concat dim = 0, 
a = tf.random.uniform([4,32,8])
b = tf.random.uniform([5,32,8])
print(tf.concat([a,b], axis=0).shape)

# Concat dim = 1, Same classes different students
a = tf.random.uniform([4,10,8])
b = tf.random.uniform([4,22,8])
print(tf.concat([a,b], axis=1).shape)

#Concat dim = 2, Same classes, students, different subjects
a = tf.random.uniform([4,32,5])
b = tf.random.uniform([4,32,3])
print(tf.concat([a,b], axis=2).shape)

c = tf.random.uniform([9,32,8])

c1, c2 = tf.split(c,[3,6],axis=0)
print(c1.shape, c2.shape)

c1, c2, c3 = tf.split(c,3,axis=0)
print(c1.shape, c2.shape, c3.shape)

# ------------------------ PyTorch ------------------------
# Example [classes, students, scores] 
# Concat dim = 0, 
a = torch.rand(4,32,8)
b = torch.rand(5,32,8)
print(torch.cat([a,b], dim=0).shape)

# Concat dim = 1, Same classes different students
a = torch.rand(4,10,8)
b = torch.rand(4,22,8)
print(torch.cat([a,b], dim=1).shape)

#Concat dim = 2, Same classes, students, different subjects
a = torch.rand(4,32,5)
b = torch.rand(4,32,3)
print(torch.cat([a,b], dim=2).shape)

c = torch.rand(9,32,8)
# c1, c2 = torch.split(c,[3,6],dim=0)
c1, c2 = c.split([3,6],dim=0)
print(c1.shape, c2.shape)

# c1, c2, c3 = torch.split(c,3,dim=0)
c1, c2, c3 = c.split(3,dim=0)
print(c1.shape, c2.shape, c3.shape)

另外还有一个概念需要区分就是 stack 和 unstack

stack 会在相应维度前面新加一个维度，所以被 stack 的张量在每个维度上的大小必须一致。

tf.unstack 会将该维度完全拆散，既[3,32,8] -> axis=0 下 ->unstack 为三个[32,8]的矩阵。

torch.chunk 与split 概念一致，是指在该维度下均分为多少份

# ------------------------Tensorflow -----------------------------
d1, d2, d3 = tf.unstack(c1,axis=0)
print(d1.shape, d2.shape, d3.shape)

print(tf.stack([d1,d2,d3], axis=0).shape)
print(tf.stack([c1,c2,c3], axis=0).shape)

# Example [classes, students, scores] -> [grades, classes, students, scores]
a = tf.random.uniform([4,32,8])
b = tf.random.uniform([4,32,8])
print(tf.stack([a,b], axis=0).shape)

# ------------------------PyTorch -----------------------------
# c1, c2, c3 = torch.chunk(c,3,dim=0)
c1, c2, c3 = c.chunk(3,dim=0)
print(c1.shape, c2.shape, c3.shape)

# Example [classes, students, scores] -> [grades, classes, students, scores]
a = torch.rand(4,32,8)
b = torch.rand(4,32,8)
print(torch.stack([a,b],dim=0).shape)

更新于：Oct 14, 2020

tensorflow

tutorial

pytorch

Tensorflow2.0--可视化工具：Tensorboard

image from unsplash.com by Tara Hegerty 之前的文章介绍过如何在 Keras 中快速调用 Tensorboard 这个可视化工具，这篇文章我们再深入探讨一...

强化学习—— Q-Learning 玩 MountainCar 爬坡上山

之前的文章结合理论和实践熟悉了 Q-Learning 的经典算法，这篇文章我们基于 Open AI 的经典 MountainCar 环境。用 python 代码实现 Q-Learning 算法...