Apple Researchers Unveil Keyframer, an AI Tool That Animates Still Images Using LLMs

Apple Researchers Unveil Keyframer, an AI Tool That Animates Still Images Using LLMs (venturebeat.com) 31

Posted by msmash on Wednesday February 14, 2024 @02:01PM from the moving-forward dept.

Apple researchers have unveiled a new AI tool called "Keyframer," (PDF) which harnesses the power of large language models (LLMs) to animate static images through natural language prompts. From a report: This novel application, detailed in a new research paper published on arxiv.org, represents a giant leap in the integration of artificial intelligence into the creative process -- and it may also hint at what's to come in newer generations of Apple products such as the iPad Pro and Vision Pro. The research paper, titled "Keyframer: Empowering Animation Design using Large Language Models," explores uncharted territory in the application of LLMs to the animation industry, presenting unique challenges such as how to effectively describe motion in natural language.

Imagine this: You're an animator with an idea that you want to explore. You've got static images and a story to tell, but the thought of countless hours bending over an iPad to breathe life into your creations is, well, exhausting. Enter Keyframer. With just a few sentences, those images can begin to dance across the screen, as if they've read your mind. Or rather, as if Apple's large language models (LLMs) have. Keyframer is powered by a large language model (in the study, they use GPT-4) that can generate CSS animation code from a static SVG image and prompt. "Large language models have the potential to impact a wide range of creative domains, but the application of LLMs to animation is under-explored and presents novel challenges such as how users might effectively describe motion in natural language," the researchers explain.

Apple Researchers Unveil Keyframer, an AI Tool That Animates Still Images Using LLMs

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 31 Comments Log In/Create an Account

Comments Filter:

reading is the key (Score:2)

by zlives ( 2009072 ) writes:

i read it as keyfarmer...
- - - - Re: (Score:2)
        
        by iMadeGhostzilla ( 1851560 ) writes:
        
        Please leave the poor old man with diminished mental faculties alone.
Maybe some day... (Score:1)

by MpVpRb ( 1423381 ) writes:

AI researchers will invent something useful, like tools for finding tricky bugs in software. Even more useful would be a simulation that accurately predicted the real world results of proposed laws
Instead, they work on tools that allow people to make more crap that is useless at best, and potentially very harmful if used as a weapon
- Re: (Score:3)
  
  by Drethon ( 1445051 ) writes:
  
  AI researchers will invent something useful, like tools for finding tricky bugs in software. Even more useful would be a simulation that accurately predicted the real world results of proposed laws
  Instead, they work on tools that allow people to make more crap that is useless at best, and potentially very harmful if used as a weapon
  Some of the trickiest bugs in software are not really bugs, but implementation that differs from the intended purpose. To find these, the ML would need to understand what the software should do, which often the people writing the code don't fully understand.
  - Re: (Score:2)
    
    by nightflameauto ( 6607976 ) writes:
    
    AI researchers will invent something useful, like tools for finding tricky bugs in software. Even more useful would be a simulation that accurately predicted the real world results of proposed laws
    Instead, they work on tools that allow people to make more crap that is useless at best, and potentially very harmful if used as a weapon
    Some of the trickiest bugs in software are not really bugs, but implementation that differs from the intended purpose. To find these, the ML would need to understand what the software should do, which often the people writing the code don't fully understand.
    Have the LLMs "listen" to the spec meetings where marketing says one thing, accounting says another thing, customer service says another thing, the dev team lower expectations somewhere closer to reality, and then sales comes in to sweep the leg and send everybody back into daydream central. If an LLM can sort that mess out, then maybe the LLM can tell the developers WTF they're supposed to be building.
    - Re: (Score:2)
      
      by Drethon ( 1445051 ) writes:
      
      AI researchers will invent something useful, like tools for finding tricky bugs in software. Even more useful would be a simulation that accurately predicted the real world results of proposed laws
      Instead, they work on tools that allow people to make more crap that is useless at best, and potentially very harmful if used as a weapon
      Some of the trickiest bugs in software are not really bugs, but implementation that differs from the intended purpose. To find these, the ML would need to understand what the software should do, which often the people writing the code don't fully understand.
      Have the LLMs "listen" to the spec meetings where marketing says one thing, accounting says another thing, customer service says another thing, the dev team lower expectations somewhere closer to reality, and then sales comes in to sweep the leg and send everybody back into daydream central. If an LLM can sort that mess out, then maybe the LLM can tell the developers WTF they're supposed to be building.
      Or at least provide a nice concise summary of how what everyone wants all conflicts with what everyone else wants. Though I'm sure even with that, some members of the meetings would still think doing everything at once is possible.
      - Re: (Score:2)
        
        by narcc ( 412956 ) writes:
        
        Are you writing bad science fiction? LLMs can't do any of these things.
        
        Re: (Score:2)
        
        by Drethon ( 1445051 ) writes:
        
        Are you writing bad science fiction? LLMs can't do any of these things.
        Isn't pretty much all news on what LLMs will be doing tomorrow pretty much science fiction?
        
        Re: (Score:2)
        
        by narcc ( 412956 ) writes:
        
        There's a lot of nonsense about LLMs floating around these days. It wouldn't matter all that much, but those mistaken beliefs lead people to use them in inappropriate ways that have already caused real harm. In an incredibly boring twist, the danger from LLMs wasn't what they could do, but what people believed they could do.
    - Re: (Score:1)
      
      by angel'o'sphere ( 80593 ) writes:
      
      For that you need domain knowledge, which an LLM does not have.
      You could combine it with an expert system, but you then had already most of the software.
- They're doing just that (Score:2)
  
  by rsilvergun ( 571051 ) writes:
  
  and it's going to put a *lot* of programmers out of work. There's a lot less talk about it in the press because it's not as flashy as animating still images.
  - Re: (Score:3)
    
    by narcc ( 412956 ) writes:
    
    That's extremely unlikely. These things are far, far, less capable than you think.
AI killed the video star (Score:2)

by flyingfsck ( 986395 ) writes:

Hollywood stars are now on life support.
still image animation using LLM's? (Score:3)

by oldgraybeard ( 2939809 ) writes: on Wednesday February 14, 2024 @02:26PM (#64239474)

Guess when your only tool is a hammer every problem looks like a nail.

- Re: (Score:3)
  
  by ceoyoyo ( 59147 ) writes:
  
  This is a pretty good problem for an LLM. The description makes it sound more exciting than it is. They've taken an XML description of a bunch of image elements and used an LLM to generate CSS code to animate those elements. LLMs are good at generating simple but tedious code from a description of the desired outcome.
  It's not going to make the T1000 reconstitute itself from a puddle of silver goo, but it can make the cartoon moon rise on your animated birthday card web page.
  The paper is really a design stud
- Re: (Score:2)
  
  by ljw1004 ( 764174 ) writes:
  
  Guess when your only tool is a hammer every problem looks like a nail.
  I think when progress is being made at the phenomenal rate it is right now, then it's exactly the right course of action to see just how versatile your hammer is.
Sounds similar to Google Lumiere (Score:2)

by LindleyF ( 9395567 ) writes:

https://lumiere-video.github.i... [github.io]
- Re: (Score:1)
  
  by ACForever ( 6277156 ) writes:
  
  where do you think apple stole the idea from?
Faster shitty powerpoint animations, meh (Score:2)

by Pinky's Brain ( 1158667 ) writes:

This is not some first step towards automating 2D character animation, it's a toy with the only claim to fame being from Apple.
"precursor to a new era" (Score:2)

by ZipNada ( 10152669 ) writes:

"Keyframer could well be a precursor to a new era where the boundaries between creator and creation become increasingly fluid, guided by the invisible hand of artificial intelligence."
That creeped me out.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Apple Researchers Unveil Keyframer, an AI Tool That Animates Still Images Using LLMs (venturebeat.com) 31

Apple Researchers Unveil Keyframer, an AI Tool That Animates Still Images Using LLMs More Login

Apple Researchers Unveil Keyframer, an AI Tool That Animates Still Images Using LLMs

reading is the key (Score:2)

Re: (Score:2)

Maybe some day... (Score:1)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

They're doing just that (Score:2)

Re: (Score:3)

AI killed the video star (Score:2)

still image animation using LLM's? (Score:3)

Re: (Score:3)

Re: (Score:2)

Sounds similar to Google Lumiere (Score:2)

Re: (Score:1)

Faster shitty powerpoint animations, meh (Score:2)

"precursor to a new era" (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals

Slashdot