AI Learns What an Infant Knows about the Physical World

2022-07-15
关注

If I drop a pen, you know that it won’t hover in midair but will fall to the floor. Similarly, if the pen encounters a desk on its way down, you know it won’t travel through the surface but will instead land on top.

These fundamental properties of physical objects seem intuitive to us. Infants as young as three months know that a ball no longer in sight still exists and that the ball can’t teleport from behind the couch to the top of the refrigerator.

Despite mastering complex games, such as chess and poker, artificial intelligence systems have yet to demonstrate the “commonsense” knowledge that an infant is either born with or picks up seemingly without effort in their first few months.

“It’s so striking that as much as AI technologies have advanced, we still don’t have AI systems with anything like human common sense,” says Joshua Tenenbaum, a professor of cognitive sciences at the Massachusetts Institute of Technology, who has done research in this area. “If we were ever to get to that point, then understanding how it works, how it arises in humans” will be valuable.

A study published on July 11 in the journal Nature Human Behaviour by a team at DeepMind, a subsidiary of Google’s parent company Alphabet, takes a step toward advancing how such commonsense knowledge might be incorporated into machines—and understanding how it develops in humans. The scientists came up with an “intuitive physics” model by integrating the same inherent knowledge that developmental psychologists think a baby is born with into an AI system. They also created a means of testing the model that is akin to the methods used to assess cognition in human infants.

Normally, the deep-learning systems that have become ubiquitous in AI research go through training to identify patterns of pixels in a scene. By doing so, they can recognize a face or a ball, but they cannot predict what will happen to those objects when placed in a dynamic scene where they move and bump into each other. To tackle the trickier challenge presented by intuitive physics, the researchers developed a model called PLATO (Physics Learning through Auto-encoding and Tracking Objects) to focus on whole objects instead of individual pixels. They then trained PLATO on about 300,000 videos so that it could learn how an object behaves: a ball falling, bouncing against another object or rolling behind a barrier only to reappear on the other side.

The goal was to have PLATO understand what violates the laws of intuitive physics based on five fundamental concepts: object permanence (an object still exists even if it’s not in view), solidity (objects are physically solid), continuity (objects move in continuous paths and can’t disappear and reappear in an unexpectedly distant place), unchangeableness (an object’s properties always remain the same) and directional inertia (an object only changes direction under the law of inertia). PLATO, like an infant, exhibited “surprise” when it, say, viewed an object that moved through another one without ricocheting backward upon impact. It performed significantly better at distinguishing physically possible versus impossible scenes than a traditional AI system that was trained on the same videos but had not been imbued with an inherent knowledge of objects.

“Psychologists think that people use objects to understand the physical world, so maybe if we build a system like that, we’re going to maximize our likelihood of [an AI model] actually understanding the physical world,” said Luis Piloto, a research scientist at DeepMind who led the study, during a press conference.

Previous efforts to teach intuitive physics to AI by incorporating varying degrees of built-in or acquired physical knowledge into the system have achieved mixed success. The new study attempted to obtain an understanding of intuitive physics in the same manner that developmental psychologists think an infant does by first displaying an inborn awareness of what an object is. The child then learns the physical rules that govern the object’s behavior by watching it move about the world.

“What’s exciting and unique about this paper is that they did it very closely based on what is known in cognitive psychology and developmental science,” says Susan Hespos, a psychology professor at Northwestern University, who co-wrote a News & Views article accompanying the paper but was not involved with the research. “We are born with innate knowledge, but it’s not like it’s perfect when we're born with it.... And then, through experience and the environment, babies—just like this computer model—elaborate that knowledge.”

The DeepMind researchers emphasize that, at this stage, their work is not ready to advance robotics, self-driving cars or other trending AI applications. The model they developed will need substantially more training on objects involved in real-world scenarios before it can be incorporated into AI systems. As the model grows in sophistication, it might also inform developmental psychology research about how infants learn to understand the world. Whether commonsense knowledge is learned or innate has been debated by developmental psychologists for nearly 100 years, dating back to Swiss psychologist Jean Piaget’s work on the stages of cognitive development.

“There’s a fruitful collaboration that can happen between artificial intelligence that takes ideas from developmental science and incorporates it into their modeling,” Hespos says. “I think it can be a mutually beneficial relationship for both sides of the equation.”

参考译文
人工智能学习婴儿对物理世界的认知
如果我掉了一支笔,你知道它不会在半空中盘旋,而是会掉到地板上。同样地,如果钢笔在下落的过程中遇到书桌,你知道它不会穿过表面,而是会落在上面。对我们来说,物理物体的这些基本属性似乎是直观的。3个月大的婴儿知道一个球已经不在视线范围内了,而且这个球不能从沙发后面瞬间移动到冰箱顶部。尽管人工智能系统已经掌握了国际象棋和扑克等复杂的游戏,但它们还没有展示出婴儿在出生后的头几个月里,要么是天生具备的,要么是看似不费力气就学会的“常识性”知识。麻省理工学院(Massachusetts Institute of Technology)的认知科学教授约书亚•特南鲍姆(Joshua Tenenbaum)在这一领域进行了研究,他表示:“尽管人工智能技术已经很先进,但我们仍然没有拥有任何类似人类常识的人工智能系统,这是非常惊人的。”“如果我们能做到这一点,那么了解它是如何工作的,它是如何在人类中产生的”将是有价值的。7月11日,谷歌母公司Alphabet的子公司DeepMind的一个团队在《自然人类行为》(Nature Human Behaviour)杂志上发表了一项研究,该研究朝着如何将这种常见性知识整合到机器中迈出了一步,并了解它在人类中是如何发展的。科学家们提出了一个“直觉物理”模型,将发展心理学家认为婴儿与生俱来的固有知识整合到人工智能系统中。他们还创造了一种测试该模型的方法,类似于用于评估人类婴儿认知能力的方法。通常情况下,在人工智能研究中普遍存在的深度学习系统会通过训练来识别场景中的像素模式。通过这样做,它们可以识别一张脸或一个球,但它们不能预测当这些物体被放置在一个动态场景中(它们移动并相互碰撞)时会发生什么。为了解决直观物理带来的更棘手的挑战,研究人员开发了一个名为PLATO(通过自动编码和跟踪对象进行物理学习)的模型,以关注整个物体而不是单个像素。然后,他们用大约30万段视频训练PLATO,以便它能够学习一个物体的行为:一个球下落、撞击另一个物体或滚动到障碍物后面,但只会在另一边重新出现。他的目标是让柏拉图理解什么违反了基于五个基本概念的直观物理定律:物体的持久性(即使不在视野中,物体仍然存在)、固体性(物体在物理上是固体的)、连续性(物体在连续的路径中移动,不会在一个意想不到的遥远的地方消失或重新出现)、不可改变性(物体的属性总是保持不变)和方向惯性(物体在惯性定律下只改变方向)。柏拉图就像一个婴儿一样,当他看到一个物体穿过另一个物体而没有受到撞击而向后反弹时表现出“惊讶”。在区分物理上可能和不可能的场景方面,它比传统的人工智能系统表现得要好得多。传统的人工智能系统在同样的视频上接受训练,但没有被灌输关于物体的固有知识。DeepMind的研究科学家路易斯·皮洛托(Luis Piloto)领导了这项研究,他在一场新闻发布会上说:“心理学家认为,人们使用物体来理解物理世界,所以如果我们建立一个这样的系统,我们将最大限度地提高人工智能模型真正理解物理世界的可能性。” 此前,通过将不同程度的内置或获得的物理知识融入系统,向人工智能教授直观的物理知识的努力取得了好坏参半的成功。这项新研究试图以发展心理学家认为的婴儿首先表现出对物体是什么的天生意识的方式来理解直觉物理学。然后,孩子通过观察物体在世界上的移动来学习控制物体行为的物理规则。西北大学的心理学教授Susan Hespos说:“这篇论文令人兴奋和独特的地方在于,他们的研究是基于认知心理学和发展科学中已知的东西。”观点文章,但没有参与这项研究。“我们生来就有先天的知识,但当我们带着它重生时,它并不是完美的....然后,通过经验和环境,婴儿——就像这个计算机模型一样——精心设计这些知识。DeepMind的研究人员强调,在这个阶段,他们的工作还没有做好推进机器人、自动驾驶汽车或其他热门AI应用的准备。他们开发的模型在被纳入人工智能系统之前,需要对涉及现实场景的物体进行更多的训练。随着这个模型变得越来越复杂,它也可能为发展心理学研究婴儿如何学会理解世界提供信息。发展心理学家关于常识是后天习得的还是天生的争论了近100年,最早可以追溯到瑞士心理学家皮亚杰(Jean Piaget)关于认知发展阶段的研究。赫波斯说:“人工智能可以从发展科学中汲取灵感,并将其融入到它们的建模中,这是一种富有成效的合作。”“我认为这对双方来说都是一种互利的关系。”
您觉得本篇内容如何
评分

评论

您需要登录才可以回复|注册

提交评论

提取码
复制提取码
点击跳转至百度网盘