• knightly the Sneptaur
    link
    fedilink
    arrow-up
    9
    ·
    1 month ago

    Given the error rate of LLMs, it seems more like they wasted $258 and a week that could have been spent on a human review.

    • OmnipotentEntity@beehaw.org
      link
      fedilink
      arrow-up
      23
      arrow-down
      1
      ·
      1 month ago

      LLMs are bad for the uses they’ve been recently pushed for, yes. But this is legitimately a very good use of them. This is natural language processing, within a narrow scope with a specific intention. This is exactly what it can be good at. Even if does have a high false negative rate, that’s still thousands and thousands of true positive cases that were addressed quickly and cheaply, and that a human auditor no longer needs to touch.

    • t3rmit3@beehaw.org
      link
      fedilink
      arrow-up
      17
      arrow-down
      1
      ·
      edit-2
      1 month ago

      What do you believe would make this particular use prone to errors?

        • t3rmit3@beehaw.org
          link
          fedilink
          arrow-up
          14
          ·
          1 month ago

          I think you may have misunderstood the purpose of this tool.

          It doesn’t read the deeds, make a decision, and submit them for termination all on its own. It reads them, identifies racial covenants based on patterns of language (which is exactly what LLMs are very good at), and then flags them for a human to review.

          This tool is not replacing jobs, because the whole point is that these reviews were never going to get the budget and manpower to be done manually, and instead would have simply remained on the books.

          I get being disdainful or even angry about LLMs in our unregulated-capitalism anti-worker hellhole because of the way that most companies are using them, but tools aren’t themselves good or bad, they’re just tools. And using a tool to identify racial covenants in legal documents that otherwise would go un-remediated, seems like a pretty good use to me.

          • knightly the Sneptaur
            link
            fedilink
            arrow-up
            3
            ·
            1 month ago

            So, what? They’re going to pay a human to OK the output and the whole lot of them never even gets seen?

            Say 12 minutes per covenant, that’s 1 million work hours that humans could get paid for. Pay them $50 an hour and it’s $50 million. That’s nothing, less than 36 hours worth of the $12.5 Billion in weapons shipments we’ve sent to Israel in the last year. We could pay for projects like this with the rounding errors on the budget for blowing up foreign kids, and the people we pay to do it could afford to put their kids through college.

            Instead, we get a project to train a robotic bigotry filter for real estate legalese and 50 more cruise missiles from the savings.

            • t3rmit3@beehaw.org
              link
              fedilink
              arrow-up
              10
              ·
              edit-2
              1 month ago

              I think you are confused about the delineation between local and federal governments. It’s not all one giant pool of tax money. None of Santa Clara County’s budget goes to missiles.

              Also, this feels like you are too capitalism-pilled, and rather than just spending the $240 to do this work, and using the remaining $49,999,760 to just fund free college or UBI programs, you’re like, “how about we pay these people to do the most mind-numbingly, soul-crushingly boring work there is, reading old legal documents?”

              You know what would actually happen if you did that? People would seriously read through them for 1 day, and then they’d be like, “clear”, “clear”, “clear” without looking at half of them. It’s not like you’re gonna find and fund another group to review the first group’s work, after all. So you’d still be where we are now, but you also wasted x* peoples’ time that they could have been enjoying doing literally anything else.

              • knightly the Sneptaur
                link
                fedilink
                arrow-up
                2
                ·
                1 month ago

                I think you are confused about the delineation between local and federal governments.

                I am not, I simply don’t believe the delineation is relevant since taxpayers fund both the state and federal budgets.

                Also, this feels like you are too capitalism-pilled

                This is me being “reasonable” and working within the constraints of the system. If we aren’t going to have free universal college et al then we can at least trade some of the bloated military budget for a public works program.

                People would seriously read through them for 1 day, and then they’d be like, “clear”, “clear”, “clear” without looking at half of them.

                Sounds to me like a 50% improvement over zero human eyes.

                It’s not like you’re gonna find and fund another group to review the first group’s work, after all.

                Why not? We could hire three teams to do it simultaneously in every state in the country and the cost would still be a tiny fraction of how much was wasted on the F-35 program.

                • howrar@lemmy.ca
                  link
                  fedilink
                  arrow-up
                  6
                  ·
                  1 month ago

                  Sounds to me like a 50% improvement over zero human eyes.

                  It certainly would be. Thankfully, there’s many more than zero human eyes involved in this.

        • GetOffMyLan@programming.dev
          link
          fedilink
          arrow-up
          5
          ·
          1 month ago

          One of LLMs main strengths over traditional text analysis tools is the ability to “understand” context.

          They are bad at generating factual responses. They are amazing at analysing text.

          • knightly the Sneptaur
            link
            fedilink
            arrow-up
            5
            ·
            1 month ago

            LLMs neither understand nor analyze text. They are statistical models of the text they were trained on. A map of language.

            And, like any map, they should not be confused for the territory they represent.

            If you admit that they have issues with facts, why would you assume that the randomly generated facts their “analysis” produces must be accurate?

            • GetOffMyLan@programming.dev
              link
              fedilink
              arrow-up
              3
              arrow-down
              1
              ·
              edit-2
              1 month ago

              I mean they literally do analyze text. They’re great at it. Give it some text and it will analyze it really well. I do it with code at work all the time.

              Because they are two completely different tasks. Asking them to recall information from their training is a very bad use. Asking them to analyze information passed into them is what they are great at.

              Give it a sample of code and it will very accurately analyse and explain it. Ask it to generate code and the results are wildly varied in accuracy.

              I’m not assuming anything you can literally go and use one right now and see.

              • apotheotic (she/her)@beehaw.org
                link
                fedilink
                English
                arrow-up
                7
                ·
                1 month ago

                The person you’re replying to is correct though. They do not understand, they do not analyse. They generate (roughly) the most statistically likely answer to your prompt, which may very well end up being text representing an accurate analysis. They might even be incredibly reliable at doing so. But this person is just pushing back against the idea of these models actually understanding or analysing. Its slightly pedantic, sure, but its important to distinguish in the world of machine intelligence.

                • GetOffMyLan@programming.dev
                  link
                  fedilink
                  arrow-up
                  3
                  arrow-down
                  1
                  ·
                  1 month ago

                  I literally quoted the word for that exact reason. It just gets really tiring when you talk about AIs and someone always has to make this point. We all know they don’t think or understand in the same way we do. No one gains anything by it being pointed out constantly.

                  • apotheotic (she/her)@beehaw.org
                    link
                    fedilink
                    English
                    arrow-up
                    4
                    ·
                    1 month ago

                    You said “they literally do analyze text” when that is not, literally, what they do.

                    And no, we don’t “all know” that. Lay persons have no way of knowing whether AI products currently in use have any capacity for genuine understanding and reasoning, other than the fact that the promotional material uses words like “understanding”, “reasoning”, “thought process”, and people talking about it use the same words. The language we choose to use is important!