AI’s creative abilities are outstripping its driving skills. While self-driving car technology is going nowhere, there has been an explosion in research around generative models, or artificial intelligence (AI) systems that can create images from simple text. In the span of a week, AI researchers from Meta and Alphabet’s Google have leapt forward, developing systems that can generate videos with just about any text prompt one can imagine.
The videos from Meta look like trippy dream sequences: a teddy bear painting flowers or a horse galloping over a field. They last about one or two seconds and have a glitchy quality that betrays their source, but they’re still remarkable. The videos generated by Google, of coffee being poured in a cup or a flight over a snowy mountain, look especially realistic.
Google has also built an even more impressive second system called Phenaki that can create longer videos, two minutes or more. Here’s a prompt Google used for one: “Lots of traffic in futuristic city. An alien spaceship arrives to the futuristic city. The camera gets inside the alien spaceship. The camera moves forward until showing an astronaut in the blue room. The astronaut is typing in the keyboard. The camera moves away from the astronaut. The astronaut leaves the keyboard and walks to the left…” That’s less than a third of the entire prompt, which reads like a movie script, and the resulting clip, posted on Twitter by Dumitru Erhan, one of Phenaki’s creators at Google Brain, is remarkable. You may be thinking this is the end of Hollywood as we know it or that anyone with a few brain cells and a computer will soon be making films. That’s actually along the lines of Google’s idea, which Erhan tweeted was to empower people to “create their own visual stories… [to] make creativity for people easier.”
It’s hard to see AI-generated videos at your local movie theatre anytime soon, but they’ll be posted on social media feeds, particularly TikTok, Insta Reels and YouTube.
TikTok users love adding stickers, text and green screens to posts and the platform has new tech for it. In August, it added an AI image generator to its app for stylized green screens. Type in a prompt like “Boris Johnson” and TikTok will bring up an abstract image reminiscent of the UK politician.
What happens when machines not only recommend videos that keep us scrolling, but have a greater hand in creating them too? Many of us love to watch footage of cute cats and people tripping over themselves, so an algorithm that could produce fake montages of awkward stumbles or frisky kittens would attract viral hits with little work, so long as they appear real.
Content creators on TikTok, and also the platforms themselves, have every incentive to exploit a tool that can spew out videos at scale, cheap and easy. For the rest of us, the result would be social media feeds that are more machine-driven than ever.
Already powered by AI and recommendation algorithms, AI videos would add to the self-reinforced feedback loops that scratch our cognitive itches.
The other worry is misinformation... but social media platforms have been trying to weed out fake content, and both Google and Facebook are refusing to release their video making tools because of the potential for misuse. Google said that its own system had generated videos that were biased against women even when it tried to filter out such stereotyped results. The model or its source code won’t be released until the problem is fixed, Google researchers said.
Of course, soon enough you’ll be able to use these tools with few restrictions anyway, thanks to organizations like Stability AI. The British startup released an image-generation tool last August that allowed anyone to generate cool art, as well as fake photos of celebrities, politicians and war zones, something larger AI companies have banned. I tried the tool and in seconds was able to cook up photos of former President Donald Trump playing golf with North Korean leader Kim Jong Un. Stability is working on a video-generation tool that it plans to release publicly when it’s ready.
But while greater access to such tools will lead to more fake content, it will also mean more people are aware that the tools exist. They’re more likely to suspect that ‘photo’ of President Joe Biden punching an old lady is AI-generated. That’s the hope, anyway.
Just as concerning is what these tools will do to people’s daily diets of content. Google’s researchers contend that their tools will augment human creativity. But when it becomes so easy to make video that you barely have to think about it, is that really harnessing our imagination? Maybe not in every case. Coupled with recommendation engines that drive so much of what we see online, geared toward generating clicks, that makes our future look much more machine-directed—and, arguably, not very creative.
Catch all the Business News, Market News, Breaking News Events and Latest News Updates on Live Mint. Download The Mint News App to get Daily Market Updates.
MoreLess