当前位置:网站首页>Paper Accuracy - 2017 CVPR "High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis"
Paper Accuracy - 2017 CVPR "High-Resolution Image Inpainting using Multi-Scale Neural Patch Synthesis"
2022-08-11 03:12:00 【clarkjs】
Overview
Like the previous blog, this paper largely draws on the idea of the pioneering work "Context Encoders: Feature Learning by Inpainting", using the encoder-decoder structure for image generation, but the previous oneThe paper has big flaws, the most obvious of which is that the result of the completion is rather blurry (with poor texture technology), and the scale of the input image is fixed at 128*128, and it cannot handle high-resolution images (please note that, for context encoders, high-resolution results are obtained by direct upsampling from low-resolution outputs.).In view of this, this paper innovatively proposes a multi-scale neural patch, which performs both content learning and texture learning, and finally forms a model with excellent content and texture.Note: This idea is similar to the "Globally and Locally Consistent Image Completion" published at the ACM summit in the same year. The core innovation of this paper is to use the CE generator to innovatively propose twoA discriminator, global and local, is actually a global consideration of the correctness of content filling, and a local (blank area and a small part of the surrounding) considering the texture, which can be understood as the fit of the details.
I. Method details
1. Network Structure
There are two parts of network classification: (1) content generation network (the missing square mask in the center of the image is filled with the average pixel color and then input into the network); (2) texture generation network, the content generation network adopts pioneering workThe CE generator method, the texture generation network adopts VGG-19 pre-trained using ImageNet.
The relu3_1 and relu4_1 layers are used in the texture generation network to calculate the texture.
边栏推荐
猜你喜欢
Some work experience after joining the digital ic design
Detailed explanation of new features of ES advanced array function syntax
DOM-DOM树,一个DOM树有三种类型的节点
AI+Medical: Using Neural Networks for Medical Image Recognition and Analysis
一次简单的 JVM 调优,学会拿去写到简历里
qtcreator调试webkit
按摩椅控制板的开发让按摩椅变得简约智能
添加用户报错useradd: cannot open /etc/passwd
flink The object probably contains or references non serializable fields.
浮点数在内存中的存储方式
随机推荐
解决vim与外界的复制粘贴(不用安装插件)
多商户商城系统功能拆解26讲-平台端分销设置
leetcode:358. K 距离间隔重排字符串
alibaba数据同步组件canal的实践整理
Briefly, talk about the use of @Transactional in the project
The most unlucky and the luckiest
言简意赅,说说 @Transactional 在项目中的使用
Google search skills - programmer is recommended
对加密世界的经济误解:现金是储蓄?稀缺性创造价值?
【LeetCode】Day112-repetitive DNA sequence
(Nips-2015)空间变换器网络
SQL 开发的十个高级概念
DOM-DOM树,一个DOM树有三种类型的节点
"Beijing-Taiwan high-speed rail" debuted on Baidu map, can it really be built in 2035?
CTO说MySQL单表行数不要超过2000w,为啥?
MongoDB 基础了解(二)
[Pdf generated automatically bookmarks]
redis学习五redis的持久化RDB,fork,copyonwrite,AOF,RDB&AOF混合使用
(CVPR - 2017) in depth and potential body learning context awareness feature for pedestrian recognition
最倒霉与最幸运