深度学习系列66：试穿模型IDM-VTON上手

2024-04-28 06:30:02
开发
32

1. 模型概述

在这里插入图片描述
如图，总体流程为：

输入为：衣服的编码xg；人物+noise的编码xt；人物身上衣物的mask和人体pose分割(densepose)；
衣服部分经过两部分网络：1）高级语义网络IP-Adapter：是一个图像编码器，比如CLIP模型；2）低级语义网络：称为GarmentNet，是一个UNet，用来提取图像低级细节特征，例如纹理，图案等等。
人体部分经过TryonNet，也是一个UNet。其输入和GarmentNet同层进行拼接后，输入自注意力层，然后取左半部分，与IPAdaper的结果，以及文本编码结果进行交叉注意力计算。

官网为：https://idm-vton.github.io/
不同模型的效果对比图如下：
在这里插入图片描述

2. 快速上手

可以在huggingface的demo上进行尝试：https://hf-mirror.com/spaces/yisol/IDM-VTON
参考https://github.com/camenduru/IDM-VTON-jupyter/blob/main/IDM_VTON_jupyter.ipynb，执行代码：

git clone  https://hub.nuaa.cf/camenduru/IDM-VTON-hf
cd IDM-VTON-hf
apt -y install -qq aria2
aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://hf-mirror.com/camenduru/IDM-VTON/resolve/main/densepose/model_final_162be9.pkl -d /content/IDM-VTON-hf/ckpt/densepose -o model_final_162be9.pkl
aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://hf-mirror.com/camenduru/IDM-VTON/resolve/main/humanparsing/parsing_atr.onnx -d /content/IDM-VTON-hf/ckpt/humanparsing -o parsing_atr.onnx
aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://hf-mirror.com/camenduru/IDM-VTON/resolve/main/humanparsing/parsing_lip.onnx -d /content/IDM-VTON-hf/ckpt/humanparsing -o parsing_lip.onnx
aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://hf-mirror.com/camenduru/IDM-VTON/resolve/main/openpose/ckpts/body_pose_model.pth -d /content/IDM-VTON-hf/ckpt/openpose/ckpts -o body_pose_model.pth
aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://hf-mirror.com/camenduru/IDM-VTON/resolve/main/IDM-VTON-DC/unet/diffusion_pytorch_model.bin -d /content/IDM-VTON-hf/ckpt/openpose/ckpts/unet -o diffusion_pytorch_model.bin
aria2c --console-log-level=error -c -x 16 -s 16 -k 1M https://hf-mirror.com/camenduru/IDM-VTON/resolve/main/IDM-VTON-DC/unet/config.json -d /content/IDM-VTON-hf/ckpt/openpose/ckpts/unet -o config.json

pip install -q diffusers==0.25.0 accelerate==0.26.1 einops==0.7.0 onnxruntime==1.16.2 cloudpickle omegaconf gradio==4.24.0 fvcore av config spaces -i https://pypi.tuna.tsinghua.edu.cn/simple

然后执行python app.py启动应用即可
另外下载的模型也可以替换为F16的版本，参考：https://hf-mirror.com/camenduru/IDM-VTON-F16/tree/main

原文地址:https://blog.csdn.net/kittyzc/article/details/138205119 本文来自互联网用户投稿，该文观点仅代表作者本人，不代表本站立场。本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如若转载，请注明出处：https://www.suanlizi.com/kf/1784349312482938880.html 如若内容造成侵权/违法违规/事实不符，请联系《酸梨子》网邮箱：1419361763@qq.com进行投诉反馈，一经查实，立即删除！

阅读全部

深度学习系列66：试穿模型IDM-VTON上手

1. 模型概述

2. 快速上手

相关推荐

最近更新

热门阅读