Hugging Face released IDEFICS, an 80B open-access visual language model replicating DeepMind’s unreleased Flamingo. Built entirely on public data, it’s the first of its size available openly. Part of its training utilized OBELICS, a dataset with 141M web pages, 353M images, and 115B text tokens from Common Crawl.
You must log in or # to comment.
I tried the demo for a bit and it makes mistakes every time, but gets enough things right to be promising! I wonder how this will evolve in the coming months.