How This AI Image Won a Major Photography Competition

2023-05-04 10:51:02
关注

In March the Sony World Photography Awards announced the winning entry in their creative photo category: a black-and-white image of an older woman embracing a younger one, entitled PSEUDOMNESIA: The Electrician. The press release announcing the win describes the photograph as “haunting” and “reminiscent of the visual language of 1940s family portraits.”

But the artist, Berlin-based Boris Eldagsen, turned down the award. His photograph was not a photograph at all, he announced: he had crafted it through creative prompting of DALL-E 2, an artificial intelligence image generator.

“I applied as a cheeky monkey, to find out if the [competitions] are prepared for AI images to enter. They are not,” Eldagsen explained on his website. His stunt has sparked controversy and conversation about when AI-generated or assisted images should be considered art.

Scientific American spoke with Eldagsen about the image’s creation and the future of AI-aided “promptography.”

[An edited transcript of the interview follows.]

How did you get started with AI art?

I started with photography because drawing was a lonely job. I was always experimenting. So when AI generators started, I was hooked from the very beginning. For me, as an artist, AI generators are absolute freedom. It’s like the tool I have always wanted. I was always working from my imagination as a photographer, and now the material I work with is knowledge. And if you are older, it’s a plus, because you can put all your knowledge into prompting and creating images. If I were 15, I would have probably just generated Batman.

Where did the inspiration for The Electrician come from?

I did it for myself as an exercise, and I just love the result. It sparked off from a project that started years back. My father was born in 1924. So he went to war when he was 17 but, like most of that generation in Germany, never talked about it. After his death, I found some images from the forties my mom and I hadn’t seen before. I learned a lot about their time just looking at these images, and I started to collect images from the forties at flea markets, and also on eBay, but didn’t know what to do with them.

So my first experiment was: Can I re-create images of that time using AI? And then “The Electrician” just grew. The best images are those you didn’t have in your mind before. They came out of the process. You start, and it leads you somewhere—with AI it’s the same. You start somewhere and then you make many different decisions. You delete elements, you add frames. Sometimes the AI has very good suggestions. Sometimes it’s just crap. That takes time and patience, so it’s not finished in 20 seconds. It can take days.

So how did you actually make this image?

I used DALL-E 2, and it was all done by text prompts and inpainting and outpainting. For inpainting, you could say, “I don’t like his tie,” and you erase it and write, “I want him to have a white tie.” Then you get suggestions. And if you don’t like any of those suggestions, you start again. Outpainting [is what] you do when the frame is not large enough. You put in an additional frame so you can see his whole tie, his pants, the chair, the floor. It’s endless.

Why did you decide to submit it to a photography competition?

I’ve been very involved in AI and photography—I’ve become one of the experts in Germany—so it’s not just me poking fun. I wanted to test if a competition has taken into account that AI-generated images can be sent in. I applied to three different competitions, and the image always was a finalist. There’s something about the image. When I apply, I don’t say it’s AI-generated. I keep the information very short: just the image and the title. Then when it was selected, I said the art is AI-generated.

What I was hoping for has happened: the conversation has started, and it was basically with the help of the community. I did not expect it to be that big; I thought it would have been a conversation for a week in the European photo scene.

Do you think someone could have figured out that it was AI-generated just by looking at the image?

Of course. There’s a difference in color that comes through outpainting. On the left side it’s too yellow, and on the right side it changes into black and white. And then maybe the fingers, but also part of the arm on the right side, you can tell. If you work with it on a daily basis, like I do, you can tell.

Have you ever been fooled by an AI image?

There’s a German magazine called GEO; it’s something like National Geographic in Germany. They had an online test with images, asking “Is it generated or authentic?” I failed once.

I think with The Electrician, it’s very easy to tell because it’s an old image from early last September. But I think by the end of this year, we won’t be able to tell.

Does that alarm you?

As an artist, I just love it. As a citizen, I’m deeply concerned. Most kinds of photography can be augmented by AI but not the photojournalism part. The press needs to come up with a system to make it clear what is authentic, manipulated or generated. The Pope Balenciaga AI-generated photo should have always been notated. If you don’t do that, democracy will be manipulated and misinformed by anyone who can write five words.

But to fact-check is a lot of work. That takes time. So for the press to do that, to pay all the staff and to also have AI technology to help—who is going to pay for this? Now as the citizens, I say, we cannot let the press work alone. It’s very important for a democratic society [to be able to distinguish real photos from fakes]. So we have to think about the structure to co-finance that [fact-checking] as citizens, as a democratic state. But how can we co-finance it and still maintain the freedom of press? This is something we need to think about.

So the future of democracy and journalism aside—how will AI fit in the art world?

One thing I propose is to clean up the terminology and not call realistic AI art “AI photography” anymore, because it’s not photography. And one suggestion that came out of the community was “promptography,” and I just love it. It is large enough to encompass that the result can look like a drawing, like a painting, like a photo.

The next step would be to talk about the relationship between promptography and photography. Do they belong into one basket—one museum, festival, gallery, competition? It’s very complex. And I don’t have any answer for that. The only thing I can say is that the easy answers on both sides—those who want to go back to analog times and those who say promptography is photography—are nonsense. We need to think deeper than that.

参考译文
这幅由人工智能生成的图像如何赢得了一项重要摄影比赛
三月份,索尼世界摄影奖公布了他们创意摄影类别中的获奖作品:一幅黑白照片,展现了一位年长妇女拥抱一位年轻妇女的画面,标题为《PSEUDOMNESIA:电工》。宣布获奖的新闻稿将这张照片描述为“令人不安”并“让人联想起1940年代家庭肖像的语言风格”。但这位艺术家、柏林的博里斯·埃尔达根拒绝了该奖项。他宣布,这张照片根本不是摄影,而是一张他通过DALL-E 2(一种人工智能图像生成器)创意提示制作出的图像。“我以一种调皮的方式申请了比赛,想看看它们是否已经准备好接受AI图像的参赛。它们还没有准备好。”埃尔达根在其网站上解释道。他的这一行为引发了争议和讨论,也激发了人们对于人工智能生成或辅助图像是否应被视为艺术的思考。科学美国人杂志采访了埃尔达根,谈及这幅图像的创作过程以及未来人工智能辅助的“提示摄影”(promptography)。[以下是编辑后的访谈记录。]你是怎么开始接触人工智能艺术的? 我最初接触的是摄影,因为绘画是一种孤独的工作。我一直在做各种实验。当AI生成器出现时,我立刻被它吸引了。对我而言,作为一名艺术家,AI生成器代表着绝对的自由,它就像我一直梦想的工具。作为摄影师,我总是从自己的想象出发进行创作,而现在,我所处理的材料是知识。如果你年纪比较大,这反而是个优势,因为你可以将自己的全部知识用于提示和创作图像。如果我15岁才开始,我可能会生成蝙蝠侠的图像。《电工》的灵感来自哪里? 我创作它是为了自己练习,而且我非常喜欢结果。这次创作来源于几年前的一个项目。我的父亲出生于1924年,17岁时就参军上了战场。但和德国那一代人中的大多数人一样,他从未谈论过那段经历。在他去世后,我发现了一些我妈和我从未见过的四十年代的照片。通过观察这些照片,我了解到了他们那个年代的很多情况,于是我开始在跳蚤市场和eBay上收集四十年代的照片,但当时不知道如何处理它们。于是我的第一次实验是:我能否用AI重现那个时代的图像?然后“电工”就这样诞生了。最好的图片往往是那些你事先没有想到的,它们是在过程中产生的。你开始之后,就会被引导到某个方向,人工智能也是如此。你从某个点出发,然后做出许多不同的决定。你删除某些元素,添加一些边框。有时候AI的建议非常棒,有时候则一塌糊涂。这需要时间和耐心,不是20秒就能完成的,有时可能需要几天。你是如何制作这幅图像的? 我使用了DALL-E 2,整个过程都是通过文本提示、内绘(inpainting)和外绘(outpainting)完成的。对于内绘,你可以说:“我不喜欢他的领带”,然后你擦掉它并写下:“我想要一条白色领带。”之后你会得到一些建议。如果你不喜欢这些建议,就重新开始。外绘则是当画面的尺寸不够大时进行的。你可以添加一个额外的边框,这样你就能看到他的整条领带、裤子、椅子和地板。这几乎是无限延伸的。你为什么决定将它提交给摄影比赛? 我一直非常关注AI和摄影的结合,我已经成为德国的AI摄影专家之一,所以我并不是在开玩笑。我想测试一下,摄影比赛是否已经考虑到AI生成图像的参赛可能性。我向三个不同的比赛提交了这幅作品,它每次都进入了决赛阶段。这幅图像有某种吸引力。当我提交作品时,我并没有说明它是AI生成的,我只是提供图像和标题,信息非常简短。当它被选中后,我才说明它是AI生成的。我期望发生的事情已经发生了:人们开始讨论了,而且讨论是在社区的帮助下进行的。我没有预料到它会引起这么大的反响,我以为只是在欧洲摄影圈内讨论一周。你认为有人仅凭观看图像就能判断出它是AI生成的吗? 当然可以。通过外绘,颜色会有些差异。左边太黄了,而右边就变成黑白的了。此外,右边的手指和部分手臂你也能看出来。如果你像我一样每天都在使用AI,你就能分辨出来。你有没有被AI图像骗到过? 德国有一本叫GEO的杂志,有点像德国的国家地理杂志。他们曾在线上举行过一个测试,展示图像并提问“这是生成的还是真实的?”,我有一次没通过。我认为《电工》很容易被识别,因为它是去年九月初生成的旧图像。但我认为到今年年底,我们就无法分辨了。这让你感到担忧吗? 作为一名艺术家,我非常喜欢。但作为一名公民,我非常担心。大多数类型的摄影都可以借助AI进行增强,但新闻摄影除外。新闻媒体需要建立一个系统来明确哪些是真实的、经过篡改的或生成的。比如巴尔巴奇亚(Balenciaga)的教皇AI生成图片,应该一直被标注出来。如果你不这样做,那么民主社会就会被任何能写出五个单词的人所操控和误导。但事实核查是一项非常繁重的工作,需要大量时间和人力。新闻媒体要承担这项工作、支付所有员工的工资,并配备AI技术进行辅助——谁来为这些买单?作为公民,我认为我们不能让媒体独自承担。在民主社会中,能够分辨真实照片和假照片是非常重要的。因此,我们需要思考如何共同出资来支持这种事实核查,作为公民、作为民主国家。但如何共同出资又不损害新闻自由?这是一个需要思考的问题。那么,撇开民主和新闻业的未来不说,人工智能将在艺术界扮演什么角色? 我建议的一件事是,要清理术语,不要再把写实风格的AI艺术称为“AI摄影”,因为它不是摄影。社区中有人提出了“提示摄影”(promptography)这个概念,我非常喜欢。它足够广泛,涵盖的结果可以看起来像素描、像绘画,也可以像照片。下一步将是讨论提示摄影和摄影之间的关系。它们是否应该归入同一个类别——同一个博物馆、摄影节、画廊或比赛?这个问题非常复杂,而我还没有任何答案。我能说的唯一一点是,两边的简单答案——那些想回到模拟时代的人,以及那些认为提示摄影就是摄影的人——都是毫无意义的。我们需要有更深的思考。
您觉得本篇内容如何
评分

评论

您需要登录才可以回复|注册

提交评论

广告

scientific

这家伙很懒,什么描述也没留下

关注

点击进入下一篇

能用ChatGPT写论文吗?

提取码
复制提取码
点击跳转至百度网盘