当前位置:网站首页>MIT: label every pixel in the world with unsupervised! Humans: no more 800 hours for an hour of video
MIT: label every pixel in the world with unsupervised! Humans: no more 800 hours for an hour of video
2022-04-23 11:10:00 【Zhiyuan community】
Taking the advantage of ICLR 2022 On the occasion of the award ,MIT、 Cornell 、 Google and Microsoft 「 To show off 」 A new SOTA—— Label every pixel in the world , And there is no need for manual work !
Address of thesis :https://arxiv.org/abs/2203.08414
From the effect of the comparison picture , This method is sometimes even more detailed than manual work , Even the shadows are marked .
But unfortunately , Although it looks very cool , But there was no shortlist ( Including nominations ).
Say back to CV field , Actually , The problem of labeling data has plagued the academic circles for a long time .
For humans , Whether it's avocado or mashed potatoes , Even 「 Alien Mothership 」, Just take a look at , You can recognize .
But for machines , It's not that simple .
Make a data set for training , You need to frame the specific content in the image , At present, this matter can only be carried out manually .
such as , A dog sitting on the grass , Then you need to circle the dog first , And note ——「 Dog 」, And then put a note on the back piece of land 「 The grass 」.
Based on this , The trained model can make 「 Dog 」 and 「 The grass 」 Differentiate .
and , This matter is very troublesome .
You don't do it , It's hard for the model to recognize objects 、 Human or other important image features .
Do it , And very troublesome .
For human taggers , Segmented images cost about... More than classification or target detection 100 Times the energy .
Just labels 1 An hour of data takes 800 Hours .
The data indicates the worker : I'm going to graduate, too ?
In order that human beings no longer have to endure 「 mark 」 The torture of ( Of course, it is mainly to promote the progress of Technology ), The group of scientists just mentioned proposed a new method based on Transformer Methods 「STEGO」, Thus, the task of image semantic segmentation can be completed without supervision .
The purpose of unsupervised semantic segmentation is to find and locate semantic categories in image corpus , Without any form of annotation .
To solve this problem ,STEGO The algorithm must generate significant and compact enough features for each pixel , To form different clusters .
Different from the previous end-to-end model ,STEGO A method of separating feature learning from clustering is proposed , Will look for similar images that appear in the entire dataset , then , It associates these similar objects , To achieve pixel level label prediction .
stay CocoStuff On dataset ,27 Category specific unsupervised semantic segmentation tasks ( Including the ground 、 sky 、 Architecture 、 lawn 、 Vehicle 、 people 、 Animal, etc. ).
Baseline method comparison Cho wait forsomeone 2021 Put forward in PiCIE Method , The picture results show ,STEGO The semantic segmentation prediction results do not ignore the key objects at the same time , Retain local details .
版权声明
本文为[Zhiyuan community]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204231101108934.html
边栏推荐
- Understand the key points of complement
- Excel·VBA数组冒泡排序函数
- 学习网站资料
- Ueditor -- limitation of 4m size of image upload component
- vm设置静态虚拟机
- remote: Support for password authentication was removed on August 13, 2021.
- MBA-day6 逻辑学-假言推理练习题
- 关于JUC三大常用辅助类
- Analysis on the characteristics of the official game economic model launched by platoffarm
- STM32接电机驱动,杜邦线供电,然后反烧问题
猜你喜欢
关于JUC三大常用辅助类
进程间通信 -- 消息队列
Usage of rename in cygwin
STM32接电机驱动,杜邦线供电,然后反烧问题
Cumcm 2021 - B: préparation d'oléfines C4 par couplage éthanol (2)
Upgrade the functions available for cpolar intranet penetration
Solutions to common problems in visualization (IX) background color
More reliable model art than deep learning
《Neo4j权威指南》简介,求伯君、周鸿袆、胡晓峰、周涛等大咖隆重推荐
Detailed explanation of typora Grammar (I)
随机推荐
Constraintlayout layout
26. 删除有序数组中的重复项
An interesting interview question
Promise details
Mysql系列SQL查询语句书写顺序及执行顺序详解
妊娠箱和分娩箱的区别
MySQL分区表实现按月份归类
MySQL面试题讲解之如何设置Hash索引
Implementation of inserting millions of data into MySQL database in 10 seconds
Mysql排序的特性详情
MySQL8. 0 upgraded stepping on the pit Adventure
Detailed explanation of writing sequence and execution sequence of MySQL series SQL query statements
MySQL对数据表已有表进行分区表的实现
学习 Go 语言 0x07:《Go 语言之旅》中 Stringer 练习题代码
Visualization Road (10) detailed explanation of segmentation canvas function
软件测试人员,如何优秀的提Bug?
Excel·VBA自定义函数获取单元格多数值
SWAT - Introduction to Samba web management tool
How to bind a process to a specified CPU
Mba-day6 logic - hypothetical reasoning exercises