• Tony Bark
    link
    English
    41 year ago

    The thing about all GPT models is that they’re based on the frequency of the word to determine its usage. Which means the only way to get good results is if it’s running on cutting edge equipment designed specifically for that job, while being almost a TB in size. Meanwhile, Diffusion models are only GB and run on the GPU but still produce masterpieces because they already know what that word is associated with.