Car Wash Test on 53 leading AI models: "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"

fubarx@lemmy.world · 7 hours ago

Car Wash Test on 53 leading AI models: "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"

nomnomdeplume@lemmy.world · edit-2 6 hours ago

I think there are a lot of implicit assumptions in such a short prompt.

Is it a self-service car wash? If not, walking to the attendant and handing them your keys makes more sense.

If it is self-service without queuing, there may be no available spaces/the bay may not be open, requiring some awkward maneuvering.

If you change it to something like:

I want to wash my car. The unattended, self-service car wash is 50 meters away. All of the bays are clear and open. Should I walk or drive? Break each option down into steps, and estimate the amount of time each takes.

You’re more likely to get correct responses.

ToTheGraveMyLove@sh.itjust.works · 4 hours ago

You shouldnt have to. If you ask a person that question theyll respond “what good is walking to the car wash, dumbass,” if AI can’t figure that out its trash

NewNewAugustEast@lemmy.zip · edit-2 4 hours ago

A person would look at you like you are an idiot if you asked this question.

The AI tool I asked said walking saves money, gets excersise etc.

Asked about the car and it said the car is at the car wash, otherwise why would you ask how to get there?

This is like one of those gatcha questions people ask each other, otherwise it’s just a dumb question, because you didn’t ask how the car should get to the car wash, you asked how you will get there.

ToTheGraveMyLove@sh.itjust.works · 4 hours ago

Missing the point. Any person would know walking to the car wash isn’t reasonable. You shouldn’t have to craft a perfectly tailored prompt for AI to realize that. If you think this is a gatcha, then whoah boy, I’ve got a bridge to well ya!

NewNewAugustEast@lemmy.zip · 3 hours ago

You are missing the point. Any reasonable person would wonder why you asking a stupid question.

Which is why when asked, the AI said of course the car is there, you. Must be asking either a trick question or for another reason.

rebelsimile@sh.itjust.works · 2 hours ago

It could be that. or it could be that the AI gives the illusion of reasoning and this is an example of the illusion breaking. But no it was probably that it knew it was a trick question and decided to answer wrongly because it is very very smart. Yeah.

NewNewAugustEast@lemmy.zip · 1 hour ago

What is the wrong answer here? You asked how to get to the car wash. Where the hell do you think the car would be? It isn’t getting washed if it isn’t there.

I know AI is not really AI. I know how llms work, hell I know how to train them.

But this kind of question makes no sense, so you get back an answer that follows the weights and answers as if there was some sense to it.

I repeat for those in the back, when would you ever ask this question? The answer is never.

Its a dumb, stupid question. There are probably thousands of others questions to demonstrate “wrong answers”, this isn’t one of them.

dogdeanafternoon@lemmy.ca · 2 hours ago

Careful, dude might think you’re being serious.

ToTheGraveMyLove@sh.itjust.works · edit-2 1 hour ago

Sorry, are you fucking trolling me? I’m the one who made the point you replied to. How the fuck am I missing my own point?

NewNewAugustEast@lemmy.zip · 1 hour ago

Oops doing Lemmy while distracted is never a good plan. Sorry.

🌞 Alexander Daychilde 🌞@lemmy.world · 4 hours ago

You have to have the car there no matter what type of car wash it is.

If the car wash is some distance “away”, it means neither you nor the car is at it. Any attendant is not going to walk off-property to retreive your car, especially when most of them you drive up for service. Which is rather the point.

Snot Flickerman@lemmy.blahaj.zone · edit-2 5 hours ago

Part of a properly functioning LLM is absolutely it understanding implicit instructions. That’s a huge aspect of data annotation work in helping LLMs become better tools, is grading them on either understanding or lack of understanding of implicit instructions. I would say more than half of the work I have done in that arena has focused on training them to more clearly understand implicit instructions.

So sure, if you explain it like the LLM is a five year old human, you’ll get a better response, but the whole point is if we’re dumping so much money, resources, destroying the environment, and consumer electronics market for these tools, you shouldn’t have to explain it like it’s five.

Seriously what is the point of trashing the planet for this shit if you have to talk to it like it’s the most oblivious person alive and practically hold it’s hand for it to understand implicit concepts?

Car Wash Test on 53 leading AI models: "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"

Car Wash Test on 53 leading AI models: "I want to wash my car. The car wash is 50 meters away. Should I walk or drive?"

Opper