当前位置:网站首页>MIT: label every pixel in the world with unsupervised! Humans: no more 800 hours for an hour of video
MIT: label every pixel in the world with unsupervised! Humans: no more 800 hours for an hour of video
2022-04-23 11:10:00 【Zhiyuan community】
Taking the advantage of ICLR 2022 On the occasion of the award ,MIT、 Cornell 、 Google and Microsoft 「 To show off 」 A new SOTA—— Label every pixel in the world , And there is no need for manual work !

Address of thesis :https://arxiv.org/abs/2203.08414
From the effect of the comparison picture , This method is sometimes even more detailed than manual work , Even the shadows are marked .

But unfortunately , Although it looks very cool , But there was no shortlist ( Including nominations ).
Say back to CV field , Actually , The problem of labeling data has plagued the academic circles for a long time .
For humans , Whether it's avocado or mashed potatoes , Even 「 Alien Mothership 」, Just take a look at , You can recognize .
But for machines , It's not that simple .
Make a data set for training , You need to frame the specific content in the image , At present, this matter can only be carried out manually .
such as , A dog sitting on the grass , Then you need to circle the dog first , And note ——「 Dog 」, And then put a note on the back piece of land 「 The grass 」.
Based on this , The trained model can make 「 Dog 」 and 「 The grass 」 Differentiate .

and , This matter is very troublesome .
You don't do it , It's hard for the model to recognize objects 、 Human or other important image features .
Do it , And very troublesome .
For human taggers , Segmented images cost about... More than classification or target detection 100 Times the energy .
Just labels 1 An hour of data takes 800 Hours .
The data indicates the worker : I'm going to graduate, too ?
In order that human beings no longer have to endure 「 mark 」 The torture of ( Of course, it is mainly to promote the progress of Technology ), The group of scientists just mentioned proposed a new method based on Transformer Methods 「STEGO」, Thus, the task of image semantic segmentation can be completed without supervision .
The purpose of unsupervised semantic segmentation is to find and locate semantic categories in image corpus , Without any form of annotation .
To solve this problem ,STEGO The algorithm must generate significant and compact enough features for each pixel , To form different clusters .
Different from the previous end-to-end model ,STEGO A method of separating feature learning from clustering is proposed , Will look for similar images that appear in the entire dataset , then , It associates these similar objects , To achieve pixel level label prediction .
stay CocoStuff On dataset ,27 Category specific unsupervised semantic segmentation tasks ( Including the ground 、 sky 、 Architecture 、 lawn 、 Vehicle 、 people 、 Animal, etc. ).
Baseline method comparison Cho wait forsomeone 2021 Put forward in PiCIE Method , The picture results show ,STEGO The semantic segmentation prediction results do not ignore the key objects at the same time , Retain local details .

版权声明
本文为[Zhiyuan community]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204231101108934.html
边栏推荐
- mysql中整数数据类型tinyint详解
- Cygwin 中的 rename 用法
- Typora operation skill description (I)
- 学习 Go 语言 0x03:理解变量之间的依赖以及初始化顺序
- Explain in detail the pitfalls encountered in DTS due to the time zone problems of timestamp and datetime in MySQL
- VIM usage
- 进程间通信 -- 消息队列
- After the MySQL router is reinstalled, it reconnects to the cluster for boot - a problem that has been configured in this host before
- Solve the problem of "suncertpathbuilderexception: unable to find valid certification path to requested target"
- Visual Road (XII) detailed explanation of collection class
猜你喜欢

进程间通信 -- 消息队列

Excel · VBA array bubble sorting function

MIT:用无监督为世界上每个像素都打上标签!人类:再也不用为1小时视频花800个小时了

Excel·VBA自定义函数获取单元格多数值

Mysql8.0安装指南

Google Earth engine (GEE) - scale up the original image (taking Hainan as an example)

C语言之结构体(进阶篇)

Typora operation skill description (I) md

Intuitive understanding entropy

GO接口使用
随机推荐
mysql分表之后如何平滑上线详解
Detailed explanation of how to smoothly go online after MySQL table splitting
MySQL interview questions explain how to set hash index
How to bind a process to a specified CPU
After the MySQL router is reinstalled, it reconnects to the cluster for boot - a problem that has been configured in this host before
Google Earth engine (GEE) - scale up the original image (taking Hainan as an example)
redis优化系列(二)Redis主从原理、主从常用配置
初探 Lambda Powertools TypeScript
How to Ping Baidu development board
Pycharm
Understanding of fileprovider path configuration strategy
MBA - day5 mathématiques - Questions d'application - Questions d'ingénierie
@Valid, @ validated learning notes
比深度学习更值得信赖的模型ART
Introduction to neo4j authoritative guide, recommended by Qiu Bojun, Zhou Hongxiang, Hu Xiaofeng, Zhou Tao and other celebrities
Solution architect's small bag - 5 types of architecture diagrams
HuggingFace
MySQL数据库10秒内插入百万条数据的实现
Visual common drawing (III) area map
Analysis on the characteristics of the official game economic model launched by platoffarm