Using Reddit’s popular ChangeMyView community as a source of baseline data, OpenAI had previously found that 2022’s ChatGPT-3.5 was significantly less persuasive than random humans, ranking in just the 38th percentile on this measure. But that performance jumped to the 77th percentile with September’s release of the o1-mini reasoning model and up to percentiles in the high 80s for the full-fledged o1 model.

So are you smarter than a Redditor?

  • satans_methpipe@lemmy.world
    link
    fedilink
    English
    arrow-up
    10
    ·
    12 hours ago

    Their models are more persuasive than a person and/or older model with internet access. Very impressive. I wager your stock is worth all of the gold in fort knox ($0).

    • T156@lemmy.world
      link
      fedilink
      English
      arrow-up
      4
      ·
      7 hours ago

      Their own older model, no less.

      It would be weirder/more of note if their new model was worse.