Microsoft develops artificial intelligence enabled ‘bot artist’
The Microsoft technology, using artificial intelligence, can generate images of everything from ordinary pastoral scenes, such as grazing livestock, to the absurd, such as a floating double-decker bus
Washington: Microsoft researchers are developing an artificial intelligence (AI) enabled ‘drawing bot’ that can create images from text descriptions of an object. The technology can generate images of everything from ordinary pastoral scenes, such as grazing livestock, to the absurd, such as a floating double-decker bus, Microsoft said in a blog post.
Each image contains details that are absent from the text descriptions, indicating that this artificial intelligence contains an artificial imagination, it said. The technology under development in Microsoft’s research labs is programmed to pay close attention to individual words when generating images from caption-like text descriptions, the company said.
This deliberate focus produces a nearly three-fold boost in image quality compared to the previous state-of-the-art technique for text-to-image generation, according to results on an industry standard test reported in a research paper posted on arXiv.org.
“If you go to Bing and you search for a bird, you get a bird picture. But here, the pictures are created by the computer, pixel by pixel, from scratch,” said Xiaodong He, a principal researcher at Microsoft’s research lab in Washington.
He and colleagues started with technology that automatically writes photo captions—the CaptionBot— and then moved to the one that answers questions humans ask about images, such as the location or attributes of objects, which can be especially helpful for blind people.
“Now we want to use the text to generate the image,” said Qiuyuan Huang, a postdoctoral researcher in He’s group. Text-to-image generation technology could find practical applications acting as a sort of sketch assistant to painters and interior designers, or as a tool for voice-activated photo refinement, the researchers said.
At the core of Microsoft’s drawing bot is a technology known as a generative adversarial network, or GAN. The network consists of two machine learning models, one that generates images from text descriptions and another, known as a discriminator, that uses text descriptions to judge the authenticity of generated images.
Editor's Picks »
- Mukesh Ambani vs Jeff Bezos set to begin from Gujarat
- Marco Pierre White: ‘Chefs are not geniuses or artists, they are just workers’
- RBI will take steps to help sustain growth: Shaktikanta Das
- India is at par with China in space race: Isro’s K. Sivan
- AAP rules out alliance with Congress for Elections 2019
- What to expect from Q3 results of IndiGo, SpiceJet, Jet Airways
- Forget privatisation, govt has hugged its banks tighter
- Flat profit, rising debt are growing worries for Reliance
- Q3 results: HUL growth off a high base shows it’s on a roll
- DCB Bank Q3 results: Small loans give big pain as farm, mortgages lift delinquencies