当前位置:网站首页>Paper Accuracy - 2017 CVPR "High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis"
Paper Accuracy - 2017 CVPR "High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis"
2022-08-11 03:12:00 【clarkjs】
Overview
Like the previous blog, this paper largely draws on the idea of the pioneering work "Context Encoders: Feature Learning by Inpainting", using the encoder-decoder structure for image generation, but the previous oneThe paper has big flaws, the most obvious of which is that the result of the completion is rather blurry (with poor texture technology), and the scale of the input image is fixed at 128*128, and it cannot handle high-resolution images (please note that, for context encoders, high-resolution results are obtained by direct upsampling from low-resolution outputs.).In view of this, this paper innovatively proposes a multi-scale neural patch, which performs both content learning and texture learning, and finally forms a model with excellent content and texture.Note: This idea is similar to the "Globally and Locally Consistent Image Completion" published at the ACM summit in the same year. The core innovation of this paper is to use the CE generator to innovatively propose twoA discriminator, global and local, is actually a global consideration of the correctness of content filling, and a local (blank area and a small part of the surrounding) considering the texture, which can be understood as the fit of the details.
I. Method details
1. Network Structure
There are two parts of network classification: (1) content generation network (the missing square mask in the center of the image is filled with the average pixel color and then input into the network); (2) texture generation network, the content generation network adopts pioneering workThe CE generator method, the texture generation network adopts VGG-19 pre-trained using ImageNet.
The relu3_1 and relu4_1 layers are used in the texture generation network to calculate the texture.
边栏推荐
猜你喜欢

Add user error useradd: cannot open /etc/passwd

google搜索技巧——程序员推荐

浮点数在内存中的存储方式

音视频开发,为什么要学习FFmpeg?应该怎么入手FFmpeg学习?

①In-depth analysis of CAS SSO single sign-on framework source code

DOM-DOM tree, a DOM tree has three types of nodes

ES6 advanced string processing new features

BUU刷题记录

CSAPP Data Lab

BUU brushing record
随机推荐
shell脚本入门
[BX]和loop
IDE compilation error: Dangling metacharacter
A practice arrangement about map GIS (below) GIS practice of Redis
Official release丨VS Code 1.70
Vulnhub靶机:GEMINI INC_ 2
C语言之自定义类型------结构体
Ninjutsu_v3_08_2020-安全渗透系统安装
CSDN blog replacement skin
Deep Learning - Second Time
"Life Is Like First Seen" is ill-fated, full of characters, and the contrast of Zhu Yawen's characters is too surprising
Entity到Vo的转换
SQL 开发的十个高级概念
[4G/5G/6G专题基础-154]: 5G无线准入控制RAC(Radio Admission Control)
ifconfig与ip命令的比较
互换性与测量技术-公差原则与选用方法
Ten Advanced Concepts of SQL Development
多商户商城系统功能拆解26讲-平台端分销设置
Salesforce disbands the Chinese team, which CRM product is more suitable for the Chinese
入职数字ic设计后的一些工作心得