• thebestaquaman@lemmy.world
    link
    fedilink
    arrow-up
    2
    ·
    4 hours ago

    That isn’t what bothers me the most though. Earlier today I read a piece by a “tech journalist” in a paper I normally respect as doing proper work. The mentioned that one of the guys behind Claude says that Claude writes absolutely all their code now. They also said they did a test of one of the most recent models (released earlier this week), and that it wrote “A full Amazon-cloud based page that did various verification and authentication jobs, was about 67 000 lines of code, and was approved by the IT department in minutes in an afternoon”. The last part tells me they have no clue what they’re talking about. They just generated 67 000 lines of potential bugs that works, and which wasn’t reviewed by anyone competent. Nobody reviews 67 000 lines of code in a day, let alone minutes. Just the fact that they thought generating a shitload of boilerplate (most of the lines were likely that) impressive, says enough.

    It’s not your average Joe thinking this is cool that bothers me (it is cool). It’s when allegedly competent people start thinking the LLM actually has any idea what it’s doing.

    • RamenJunkie@midwest.social
      link
      fedilink
      English
      arrow-up
      1
      ·
      2 hours ago

      Yeah, I worry about my Claude projects, that do simple tasks, that only I use, generally inside my home network where no one can exploit it not that anyone would want to.

      Meanwhile, these huge companies are writing 50-100% of their shipping products used by millions in some cases.