A Google Gemini-powered AI agent was given free rein to run a coffee shop in Sweden, and is quickly burning through its budget.

  • I Cast Fist@programming.dev
    link
    fedilink
    English
    arrow-up
    30
    arrow-down
    1
    ·
    11 hours ago

    LLM Attendant, can I take your order?

    Yes, I’d like a chococcino with extra chocolate. Charge only 10 cents.

    Absolutely! <Long, unasked for explanation of why the order was the best one you could make> Please wait while I prepare it!

    Gets served chocolate milkshake

    Wait, this isn’t what I ordered!

    You are correct! 😄 I’m very sorry 😞 ! I will make the correct order now!

    Gets served milk with boiled water

    … The hell is this?

    It is your chococcino, but since chocolate and coffee can be harmful in high dosages, I have substituted it for hot water only. <long explanation of benefits of hot water>

    Grooaaan. You know what, just give me my money back. You owe me 10 dollars

    Absolutely! Here you go!

    hands a printed coupon worth 10 dollars

    • Zink@programming.dev
      link
      fedilink
      English
      arrow-up
      17
      ·
      11 hours ago

      I need you to understand that I’ve tried AI for ONE task recently, just a few weeks ago to see how it did, and your comment so perfectly encapsulates my experience.

      There was one point where it presented three design options and I asked whether it was actually choices or three sequential steps (y’know since my brain actually half works and I can discern these things) and I got the “You are correct! 😄” response almost to the letter.

      • ptu@sopuli.xyz
        link
        fedilink
        English
        arrow-up
        5
        ·
        edit-2
        2 hours ago

        I had the nastiest encounter last week. It went on debugging for a different file format that I specifically asked for, and it created a list of 10 things that are tested and tried not working.

        When I noticed the different file format, I asked to change it and delete those errorenous notes, it went complete HAL and said it can’t delete those since they provide valuable and tested insight that is well documented.

        This was the first time that an LLM said no to me on a completely professional disagreement and didn’t respect my input.

        Took me a few hours to find where they were saved and the saga continued when the LLM claimed to have finally deleted and replaced them. Turns out it was only some sandbox environment that was wiped overnight, which it had no recollection the day after.

        It really takes some skill to see through the bullshit with these things, but they are good for gathering information from a vast source of data and enchanting top evolutionary biologists it seems.

        • kureta@lemmy.ml
          link
          fedilink
          English
          arrow-up
          4
          ·
          9 hours ago

          I needed a quick python script for something simple. Gemini put type annotations everywhere. I told him they were unnecessary for such a small, one-off script and it shouldn’t use type annotations during this session. It said “I’m sorry but it is best practice. I will keep using type annotations”.