transforms.py 工具箱
- totensor
- resize
|
使用工具创建具体的工具
transforms.ToTensor()
使用工具:
输入:图片
【pytorch学习|深度学习pytorch代码: transforms结构及用法 常见的transforms】输出:result = tool(图片)
from PIL import Image as imim
from tensorboardX import SummaryWriter
from torchvision import transforms
# python的用法 -》 tensor的数据类型
# 通过transform.ToTensor解决两个问题
# 1.transforms该如何让使用(python)
# 2.为什么需要ToTensor类型img_path = r"C:\Users\123\Desktop\python4.7\hymenoptera_data\train\ants\6240338_93729615ec.jpg"
img = imim.open(img_path)
# print(img)writer = SummaryWriter ("logs")
tensor_trans = transforms.ToTensor()
tensor_img = tensor_trans(img)# ctrl+p查看需要填入的类型
# print(tensor_img)
writer.add_image("Tensor_img", tensor_img)
writer.close()
常见的Transforms
* PIL* Image.open()
* tensor* ToTensor()
* narray* cv.imread()
注意事项:
- 关注输入和输出类型
- 多看官方文档
- 关注方法需要什么参数
- 不知道返回值的时候print / print(type()) / debug
from PIL import Image
from tensorboardX import SummaryWriter
from torchvision import transformswriter = SummaryWriter("logs")
img = Image.open(r"C:\Users\123\Desktop\python4.7\flower.jpg")
print(img)# ToTensor的使用
trans_totensor = transforms.ToTensor()# 创建对象
img_tensor = trans_totensor(img)# 调用方法
writer.add_image("ToTensor", img_tensor)# Normalize
# output[channel] = (input[channel] - mean[channel]) / std[channel]均值和方差
print(img_tensor[0][0][0])# 像素的第一层第一行第一列
trans_norm = transforms.Normalize([0.5, 0.5, 0.5], [0.5, 0.5, 0.5])
img_norm = trans_norm(img_tensor)
print(img_norm[0][0][0])
writer.add_image("Normalize", img_norm)# 绘制图片# resize - 1
print(img.size)
trans_resize = transforms.Resize(512, 512)
# img PIL -> resize -> img_resize PIL
img_resize = trans_resize(img)
# img_resize PIL -> totensor -> img_resize tensor
image_resize = trans_totensor(img_resize)
print(img_resize)
writer.add_image("Resize-1", img_resize, 0)
print(img_resize)# Compose - resize - 2
trans_resize_2 = transforms.Resize(512)
# PIL -> PIL -> tensor
trans_compose = transforms.Compose(
[trans_resize_2, trans_totensor]
)
img_resize_2 = trans_compose(img)
writer.add_image("Resize-2", img_resize_2, 1)# RandomCrop
trans_random = transforms.RandomCrop(512)
trans_compose_2 = transforms.Compose([trans_random, trans_totensor])
for i in range(10):# 裁剪10个
img_crop = trans_compose_2(img)
writer.add_image("RandomCrop", img_crop, i)writer.close()
推荐阅读
- 机器学习算法|机器学习算法(二十九)(高斯混合模型(Gaussian Mixed Model,GMM))
- nlp|Onehot_encode与Word2vec词向量训练
- Python OpenCV cv2.imread()函数用法指南
- 在Python中使用JSON数据经典指南
- NumPy索引 – 索引编制介绍代码示例
- Python中的图像处理(缩放,旋转,移位和边缘检测)
- 一文了解垃圾回收算法中的引用计数算法
- 人工智能|Pytorch(colab端)——基于VGG模型迁移学习的猫狗大战
- 神经网络|完整的动手指南,可在Google Colab GPU上训练你的神经网络模型