当前位置:网站首页>MIT: label every pixel in the world with unsupervised! Humans: no more 800 hours for an hour of video
MIT: label every pixel in the world with unsupervised! Humans: no more 800 hours for an hour of video
2022-04-23 11:10:00 【Zhiyuan community】
Taking the advantage of ICLR 2022 On the occasion of the award ,MIT、 Cornell 、 Google and Microsoft 「 To show off 」 A new SOTA—— Label every pixel in the world , And there is no need for manual work !
Address of thesis :https://arxiv.org/abs/2203.08414
From the effect of the comparison picture , This method is sometimes even more detailed than manual work , Even the shadows are marked .
But unfortunately , Although it looks very cool , But there was no shortlist ( Including nominations ).
Say back to CV field , Actually , The problem of labeling data has plagued the academic circles for a long time .
For humans , Whether it's avocado or mashed potatoes , Even 「 Alien Mothership 」, Just take a look at , You can recognize .
But for machines , It's not that simple .
Make a data set for training , You need to frame the specific content in the image , At present, this matter can only be carried out manually .
such as , A dog sitting on the grass , Then you need to circle the dog first , And note ——「 Dog 」, And then put a note on the back piece of land 「 The grass 」.
Based on this , The trained model can make 「 Dog 」 and 「 The grass 」 Differentiate .
and , This matter is very troublesome .
You don't do it , It's hard for the model to recognize objects 、 Human or other important image features .
Do it , And very troublesome .
For human taggers , Segmented images cost about... More than classification or target detection 100 Times the energy .
Just labels 1 An hour of data takes 800 Hours .
The data indicates the worker : I'm going to graduate, too ?
In order that human beings no longer have to endure 「 mark 」 The torture of ( Of course, it is mainly to promote the progress of Technology ), The group of scientists just mentioned proposed a new method based on Transformer Methods 「STEGO」, Thus, the task of image semantic segmentation can be completed without supervision .
The purpose of unsupervised semantic segmentation is to find and locate semantic categories in image corpus , Without any form of annotation .
To solve this problem ,STEGO The algorithm must generate significant and compact enough features for each pixel , To form different clusters .
Different from the previous end-to-end model ,STEGO A method of separating feature learning from clustering is proposed , Will look for similar images that appear in the entire dataset , then , It associates these similar objects , To achieve pixel level label prediction .
stay CocoStuff On dataset ,27 Category specific unsupervised semantic segmentation tasks ( Including the ground 、 sky 、 Architecture 、 lawn 、 Vehicle 、 people 、 Animal, etc. ).
Baseline method comparison Cho wait forsomeone 2021 Put forward in PiCIE Method , The picture results show ,STEGO The semantic segmentation prediction results do not ignore the key objects at the same time , Retain local details .
版权声明
本文为[Zhiyuan community]所创,转载请带上原文链接,感谢
https://yzsam.com/2022/04/202204231101108934.html
边栏推荐
- Excel·VBA自定义函数获取单元格多数值
- Usage Summary of datetime and timestamp in MySQL
- Oracle连通性测试小工具
- Introduction to neo4j authoritative guide, recommended by Qiu Bojun, Zhou Hongxiang, Hu Xiaofeng, Zhou Tao and other celebrities
- Code implementation of general bubbling, selection, insertion, hill and quick sorting
- Microsoft Access database using PHP PDO ODBC sample
- 学习 Go 语言 0x03:理解变量之间的依赖以及初始化顺序
- Use of SVN:
- 使用 PHP PDO ODBC 示例的 Microsoft Access 数据库
- mysql中整数数据类型tinyint详解
猜你喜欢
数据库管理软件SQLPro for SQLite for Mac 2022.30
采用百度飞桨EasyDL完成指定目标识别
After the MySQL router is reinstalled, it reconnects to the cluster for boot - a problem that has been configured in this host before
An interesting interview question
Microsoft Access database using PHP PDO ODBC sample
Google Earth Engine(GEE)——将原始影像进行升尺度计算(以海南市为例)
Detailed explanation of typora Grammar (I)
Jupyter lab top ten high productivity plug-ins
进程间通信 -- 消息队列
语雀文档编辑器将开源:始于但不止于Markdown
随机推荐
Jupyter Lab 十大高生产力插件
Prevent SQL injection in web projects
C语言之结构体(进阶篇)
VM set up static virtual machine
Differences among restful, soap, RPC, SOA and microservices
期货开户哪个公司好?安全靠谱的期货公司谁能推荐几家?
After the MySQL router is reinstalled, it reconnects to the cluster for boot - a problem that has been configured in this host before
Mysql排序的特性详情
Three web components (servlet, filter, listener)
SVN的使用:
MySQL interview questions explain how to set hash index
Mysql中一千万条数据怎么快速查询
Visual solutions to common problems (VIII) mathematical formulas
Code implementation of general bubbling, selection, insertion, hill and quick sorting
Usage of rename in cygwin
妊娠箱和分娩箱的区别
Promise details
VScode
Mysql系列SQL查询语句书写顺序及执行顺序详解
About the three commonly used auxiliary classes of JUC