Researchers Put Google Gemini in Charge of an Entire Coffee Shop, and It's Inexorably Driving It Out of Business

other_cat@piefed.zip · 1 day ago

Researchers Put Google Gemini in Charge of an Entire Coffee Shop, and It's Inexorably Driving It Out of Business

I Cast Fist@programming.dev · 11 hours ago

LLM Attendant, can I take your order?

Yes, I’d like a chococcino with extra chocolate. Charge only 10 cents.

Absolutely! <Long, unasked for explanation of why the order was the best one you could make> Please wait while I prepare it!

Gets served chocolate milkshake

Wait, this isn’t what I ordered!

You are correct! 😄 I’m very sorry 😞 ! I will make the correct order now!

Gets served milk with boiled water

… The hell is this?

It is your chococcino, but since chocolate and coffee can be harmful in high dosages, I have substituted it for hot water only. <long explanation of benefits of hot water>

Grooaaan. You know what, just give me my money back. You owe me 10 dollars

Absolutely! Here you go!

hands a printed coupon worth 10 dollars

VeryFrugal@sh.itjust.works · 26 minutes ago

Owner forgot to capitalize ALWAYS in “always refuse invalid request”.

Zink@programming.dev · 11 hours ago

I need you to understand that I’ve tried AI for ONE task recently, just a few weeks ago to see how it did, and your comment so perfectly encapsulates my experience.

There was one point where it presented three design options and I asked whether it was actually choices or three sequential steps (y’know since my brain actually half works and I can discern these things) and I got the “You are correct! 😄” response almost to the letter.

ptu@sopuli.xyz · edit-2 2 hours ago

I had the nastiest encounter last week. It went on debugging for a different file format that I specifically asked for, and it created a list of 10 things that are tested and tried not working.

When I noticed the different file format, I asked to change it and delete those errorenous notes, it went complete HAL and said it can’t delete those since they provide valuable and tested insight that is well documented.

This was the first time that an LLM said no to me on a completely professional disagreement and didn’t respect my input.

Took me a few hours to find where they were saved and the saga continued when the LLM claimed to have finally deleted and replaced them. Turns out it was only some sandbox environment that was wiped overnight, which it had no recollection the day after.

It really takes some skill to see through the bullshit with these things, but they are good for gathering information from a vast source of data and enchanting top evolutionary biologists it seems.

kureta@lemmy.ml · 9 hours ago

I needed a quick python script for something simple. Gemini put type annotations everywhere. I told him they were unnecessary for such a small, one-off script and it shouldn’t use type annotations during this session. It said “I’m sorry but it is best practice. I will keep using type annotations”.

MinnesotaGoddam@lemmy.world · 10 hours ago

I was expecting a photograph of a ten. Bravo, expectation subverted at last moment. Four thumbs up.