ChatGPT Health 'under-triaged' half of medical emergencies in a new study

MicroWave@lemmy.world · 16 days ago

ChatGPT Health 'under-triaged' half of medical emergencies in a new study

CorrectAlias@piefed.blahaj.zone · 16 days ago

Compared with the doctors in the study, the bot also over-triaged 64.8% of nonurgent cases, recommending a doctor’s appointment when it wasn’t necessary.

So it goes both ways. Almost like it’s an LLM, not intelligent, and is non-deterministic because all LLMs function that way. Maybe we shouldn’t have every part of society reliant on something like this?

Kairos@lemmy.today · 16 days ago

LLMs are very deterministic

Nate Cox@programming.dev · 16 days ago

You keep using that word. I do not think it means what you think it means.

fartsparkles@lemmy.world · edit-2 16 days ago

deleted by creator

jacksilver@lemmy.world · 16 days ago

I mean, that’s kinda like saying a random number generator can be deterministic. It can be, but that’s not how it’s used.

Sure LLMs can be deterministic, but they aren’t in practice cause it makes the results worse. If you prompt any production LLM with the same inputs, you aren’t guaranteed the same outputs.

SaveTheTuaHawk@lemmy.ca · 15 days ago

If you prompt any production LLM with the same inputs, you aren’t guaranteed the same outputs.

If you promp MDs with the same inputs, you aren’t guaranteed the same outputs. If you prompt the same MD early and late in a busy shift, you aren’t guaranteed the same outputs.

The reality is over 790,000 people a year die from medical diagnostic errors.

Kairos@lemmy.today · 16 days ago

LLMs like all computer software is deterministic. It has a stable output for all inputs. LLMs as users use them have random parameters inserted to make it act nondeterministically if you assume this random info is nondeterministic.

jacksilver@lemmy.world · 16 days ago

You’re being down voted because LLMs aren’t deterministic, it’s basically the biggest issue in productizing them. LLMs have a setting called “temperature” that is used to randomize the next token selection process meaning LLMs are inherently not deterministic.

If you se the temperature to 0, then it will produce consistent results, but the “quality” of output drops significantly.

Kairos@lemmy.today · 16 days ago

If you give whatever random data source it uses the same seed, it will output the same thing.

nate3d@lemmy.world · 16 days ago

So question then, what parameter controls deterministic results for an LLM?

Pieisawesome@lemmy.dbzer0.com · 15 days ago

It’s the temperature. If you set it to 0, no randomness is introduced.

Of course it impairs the llm substantially, but you CAN get deterministic results.

Kairos@lemmy.today · 16 days ago

I honestly dont know. I think all that matters is the token window and a random seed used foe a random weighted choice.

nate3d@lemmy.world · 16 days ago

I encourage you to do some additional research on LLMs and the underlying mathematical models before making statements on incorrect information

The answer to this question was Temperature. It’s one of the many hyperparameters available to the engineer loading the model. Begin with looking into the difference between hyperparameters and parameters, as they relate to LLMs.

I’m one of the contributors to the LIDA cognitive architecture. This is my space and I want to help people learn so we can begin to use this technology as was intended - not all this marketing wank.

Nate Cox@programming.dev · 16 days ago

Listen, this is going to sound like a loaded inflammatory question and I don’t really know how to fix that over text, but you say you’re in the space and I’m genuinely curious as to your take on this:

Do you think it’s possible to build LLM technology in a way that:

Respects copyright and ip,
Doesn’t fuck up the economy and eat all the ram,
Doesn’t drink all the water and subject people to Datacenter hell, and
is consistently accurate and has enough data to be useful?

nate3d@lemmy.world · edit-2 16 days ago

No. And I’ve lost my voice describing why this is the case - LLMs do not use training data in real time which is indicative of the fact that their reasoning chains are learned over many training epochs rather than something akin to a search engine which is parsing and aggregating results from direct sources. I wish I had a different answer but that is simply how the mathematics behind this kind of machine learning model work. The only way to properly manage it would be to limit and license the data appropriately during core model training, but that genie is out of the bottle.
We will eventually (soon hopefully) hit critical mass where the technology isn’t delivering value on the hardware it takes to run it. The limitations, like I detailed above, are core to the technology and are not something that we’re just around the corner from solving. Those are core limitations and a different technology will be needed to move the ball forward past what is essentially a calculator with words. When this happens, we’ll see a whiplash effect where a ton of (server) hardware hits the market from the small datacenters looking to capitalize on the current rush. It’ll cripple the market for new hardware, I’d expect, as they’re going to want to get that capital back ASAP as it’s a quickly deprecating asset if just sitting idle.
Similar to above, the current trajectory isn’t going to last. It’s going to hurt once the reality finally sets in for the economy.
Oh yes, and it’s already been there for years! Unfortunately, these applications are not the glamorous applications like a “Her”-style chat companion, but rather precise application of specific machine learning models for specific business needs. I.e. do you really need an LLM to upload a picture to ask what kind of cat is in the picture? NO! That’s what convolutional neural networks are for, or maybe some custom vision transformers. There are dozens of types of ML models that have clear applications and with fine tuning and proper process implementation, the models can produce production-ready results as any other means of solving this issue.

The core problem with this technology is the misuse/misunderstanding that:

AI does not yet exist. Full stop.
An LLM is just ONE TYPE of machine learning algorithm
An LLM does not possess the ability to understand OR interpret intent
An LLM CAN NOT THINK This is the point I can’t stress enough; the “thinking” models you see today are doing nothing much more than cramming additional data into it’s working context and hoping that this guides the inference to produce a higher-quality result. Once a model is loaded for inference (i.e. asking questions) it is a STATIC entity and does not change.

Thank you for coming to my autistic TED talk <3

Edit: Also, fantastic question and never apologize for wanting to learn; keep that hunger and run with it

Kairos@lemmy.today · 16 days ago

Not who you asked but

Yes. Public domain only IG.
Small
Small
No. Not while being 1.

chicken@lemmy.dbzer0.com · 16 days ago

Showing that someone hasn’t answered your quiz question correctly isn’t a great way to make an argument.

nate3d@lemmy.world · 16 days ago

You’ve missed the point - I was responding to someone answering in an authoritative manner about something of which they were mis-informed. I posed a question someone in the space would immediately know. The disappointing part is simply pasting my question into any search engine or LLM would immediately have said “Temperature.”

This is a perfect example of how we’re using our brain less and less and simply relying on “something” else to answer it for us. Do your research. Learn and teach.

CorrectAlias@piefed.blahaj.zone · edit-2 16 days ago

Sure, but not always, which means they can’t be considered completely deterministic. If you input the same text into an LLM, there’s a high chance that you’ll get a different output. This is due to a lot of factors, but LLMs hallucinate because of it.

Medical care is something where I would not ever use an LLM. Sure, doctors can come to different results, too, but at least they can explain their logic. LLMs are unable to do this at any real level.

Pieisawesome@lemmy.dbzer0.com · 15 days ago

But you can use th temperature to get non random, deterministic results.

If you self host a llm, you can definitely get the exact same answer each time, but the user query has to be exactly the same…

Kairos@lemmy.today · 16 days ago

The tech itself is deterministic like all other computer software. The provider just adds randomness. Additionally, it is only deterministic over the whole context exactly. Asking twice is different than once, and saying “black man” in the place of “white woman” is also different.

CorrectAlias@piefed.blahaj.zone · 16 days ago

I’m acutely aware that it’s computer software, however, LLMs are unique in that they have what you’re calling “randomness”. This randomness is not entirely predicitible, and the results are non-deterministic. The fact that they’re mathematical models doesn’t really matter because of the added “randomness”.

You can ask the same exact question in two different sessions and get different results. I didn’t mean to ask twice in a row, I thought that was clear.

Kairos@lemmy.today · 15 days ago

If you use the same random data source the results are deterministic. Same thing with user inputs/timing of them.

CorrectAlias@piefed.blahaj.zone · 15 days ago

I don’t know what else to say, because you can literally test this yourself and get non-deterministic results.