当前位置:网站首页>Paper Accuracy - 2017 CVPR "High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis"
Paper Accuracy - 2017 CVPR "High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis"
2022-08-11 03:12:00 【clarkjs】
Overview
Like the previous blog, this paper largely draws on the idea of the pioneering work "Context Encoders: Feature Learning by Inpainting", using the encoder-decoder structure for image generation, but the previous oneThe paper has big flaws, the most obvious of which is that the result of the completion is rather blurry (with poor texture technology), and the scale of the input image is fixed at 128*128, and it cannot handle high-resolution images (please note that, for context encoders, high-resolution results are obtained by direct upsampling from low-resolution outputs.).In view of this, this paper innovatively proposes a multi-scale neural patch, which performs both content learning and texture learning, and finally forms a model with excellent content and texture.Note: This idea is similar to the "Globally and Locally Consistent Image Completion" published at the ACM summit in the same year. The core innovation of this paper is to use the CE generator to innovatively propose twoA discriminator, global and local, is actually a global consideration of the correctness of content filling, and a local (blank area and a small part of the surrounding) considering the texture, which can be understood as the fit of the details.
I. Method details
1. Network Structure
There are two parts of network classification: (1) content generation network (the missing square mask in the center of the image is filled with the average pixel color and then input into the network); (2) texture generation network, the content generation network adopts pioneering workThe CE generator method, the texture generation network adopts VGG-19 pre-trained using ImageNet.
The relu3_1 and relu4_1 layers are used in the texture generation network to calculate the texture.
边栏推荐
猜你喜欢
![[BX]和loop](/img/3c/1be08db6898613c3a1c0bcd39e73be.png)
[BX]和loop

google搜索技巧——程序员推荐

Realization of vending machine function based on FPGA state machine

一次简单的 JVM 调优,学会拿去写到简历里

C语言之自定义类型------结构体

MongoDB 基础了解(二)

代码 Revert 后再次 Merge 会丢失的问题,已解决

Environment configuration of ESP32 (arduino arduino2.0 VScode platform which is easy to use?)

按摩椅控制板的开发让按摩椅变得简约智能

Some work experience after joining the digital ic design
随机推荐
索引的创建、查看、删除
Ten Advanced Concepts of SQL Development
Logstash日志数据写入异常排查问题总结
(Nips-2015) Spatial Transformer Network
ES进阶 数组功能语法新特性详解
按摩椅控制板的开发让按摩椅变得简约智能
【LeetCode】Day112-重复的DNA序列
leetcode: 358. Reorder strings at K distance intervals
What does the sanction of the mixer Tornado mean for the DeFi market?
ROS源代码阅读(1)
输入起始位置,终止位置截取链表
CC0 与商业 IP:哪种模式更适合 NFT?
[4G/5G/6G专题基础-154]: 5G无线准入控制RAC(Radio Admission Control)
字体反扒
[Pdf generated automatically bookmarks]
OpenCV founder: Open source must not be completely free!
面试常考的7种排序算法
shell脚本入门
The most unlucky and the luckiest
【Pdf自动生成书签】