• fidodo@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1
      ·
      7 months ago

      The llm is executing a function on a diffusion image model. The llm does not generate the image itself

      • kelvie@lemmy.ca
        link
        fedilink
        English
        arrow-up
        5
        arrow-down
        2
        ·
        7 months ago

        This doesn’t contradict what the OP said. ChatGPT is now an interface to both an LLM and a diffusion-based image generator.

    • h3rm17@sh.itjust.works
      link
      fedilink
      English
      arrow-up
      4
      arrow-down
      2
      ·
      7 months ago

      Yeah, but the model that does the images is actually Dall-e, you are just using gpt’s interface to create them

    • Nexz@feddit.nl
      link
      fedilink
      English
      arrow-up
      2
      arrow-down
      1
      ·
      7 months ago

      I mean, the GPT model is a LLM and ChatGPT uses DALL-E in the background to create images. So depending on definition you’re both correct :-)