Semantic Visual Navigation by Watching YouTube Videos #8

zhaoyucs · 2021-06-10T14:58:10Z

利用视频中隐含的语义信息做自动导航任务的强化训练

信息

主要作者：（Matthew Chang, Saurabh Gupta）
单位：University of Illinois at Urbana-Champaign
论文链接

1 学习到的新东西：

利用第一视角视频做预训练：把视频看做图片序列，假设图片之间隐含action，预测action，类似mask language model
pseudo-labeling：用小样本的标准数据集训练一个模型去自动标注大数据集，相当于meta learning了。

2 通过Related Work了解到了哪些知识

一些强化学习的东西，比如Qlearning
利用视频资源的方式，相比于单个图片，视频是图片的时序序列，蕴含了更多结构性的语义信息。

3 实验验证任务，如果不太熟悉，需要简单描述

最终任务是训练agent找东西，一个简单的导航任务

4 在你认知范围内，哪些其它任务可以尝试

快用videos来预训练吧

5 好的词语、句子或段落

As humans, we can efficiently solve such tasks in novel environments in a zero-shot manner.
Building computational systems that can similarly leverage such semantic regularities for navigation has been a long-standing goal.

izhx · 2021-06-16T09:54:35Z

NIPS 2021 还没审完稿呢

zhaoyucs added NIPS 2021 labels Jun 10, 2021

zhaoyucs added 2020 and removed 2021 labels Jun 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Semantic Visual Navigation by Watching YouTube Videos #8

Semantic Visual Navigation by Watching YouTube Videos #8

zhaoyucs commented Jun 10, 2021

izhx commented Jun 16, 2021

Semantic Visual Navigation by Watching YouTube Videos #8

Semantic Visual Navigation by Watching YouTube Videos #8

Comments

zhaoyucs commented Jun 10, 2021

信息

1 学习到的新东西：

2 通过Related Work了解到了哪些知识

3 实验验证任务，如果不太熟悉，需要简单描述

4 在你认知范围内，哪些其它任务可以尝试

5 好的词语、句子或段落

izhx commented Jun 16, 2021