this post was submitted on 13 Aug 2023

388 points (74.4% liked)

Technology

63010 readers

3807 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each other!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed
Accounts 7 days and younger will have their posts automatically removed.

Approved Bots

founded 2 years ago

MODERATORS

L3s@lemmy.world

enu@lemmy.world

technopagan@lemmy.world

L4s@lemmy.world

388

ChatGPT In Trouble: OpenAI may go bankrupt by 2024, AI bot costs company $700,000 every day (www.firstpost.com)

submitted 2 years ago by balfrag@lemmy.world to c/technology@lemmy.world

206 comments fedilink hide all child comments

Repost from HN: https://news.ycombinator.com/item?id=37109394

you are viewing a single comment's thread
view the rest of the comments

[–] Billy_Gnosis@lemmy.world 51 points 2 years ago (2 children)

If AI was so great, it would find a solution to operate at fraction of the cost it does now

[–] Death_Equity@lemmy.world 70 points 2 years ago (5 children)

Wait, has anybody bothered to ask AI how to fix itself? How much Avocado testing does it do? Can AI pull itself up by its own boot partition, or does it expect the administrator to just give it everything?

[–] wizardbeard@lemmy.dbzer0.com 13 points 2 years ago (2 children)

Really says something that none of your responses yet seem to have caught that this was a joke.

[–] Death_Equity@lemmy.world 3 points 2 years ago

The upvotes vs comments shows the vocal minority is just doing vocal minority things.

[–] FaceDeer@kbin.social 2 points 2 years ago (1 children)

OP might have been intending it as a joke, but self-improvement is a very real subject of AI research so if that's the case he accidentally said something about a serious topic.

[–] Buddahriffic@lemmy.world 2 points 2 years ago (1 children)

It's an essential part of the idea of the technological singularity. An AI iterates itself and the systems it runs on, becoming more efficient, powerful, and effective at a rate that makes all of human progress up to that point look like nothing.

[–] MajorHavoc@lemmy.world 3 points 2 years ago (1 children)

While I'm inclined to believe the singularity is achievable, it's important to remember that there's no evidence today that it will ever be reached.

Our hope for it, and the good than can come with it, can't pull it into the realm of things we will see in our lifetimes. It could emerge soon, but it's at least as likely to stay science fiction for another millennia.

[–] Buddahriffic@lemmy.world 2 points 2 years ago

Yeah, when chat gpt 4 first came out, I thought we might be close. But as it's capabilities and limitations became more clear, it doesn't look like we're close at all. I mean, it's hard to say for sure since an LLM will just make up a part of an AI and maybe the other pieces are farther along but just not getting as much attention because there's value in not making those things public.

But as someone who works in one of the fields that would be involved in the technological singularity, no one really knows good ways to apply AI to the work we do and the best initiatives I've seen come out of the corporate drive to leverage AI aren't actually AI, but just smarter automation tools.

[–] vrighter@discuss.tchncs.de 6 points 2 years ago (1 children)

if we don't know, it doesn't know.

If we know, but there's no public text about it, it doesn't know either.

it is trained off of stuff that has already been written, and trained to emulate the statistical properties of those words. It cannot and will not tell us anything new

[–] FaceDeer@kbin.social 14 points 2 years ago (1 children)

That's not true. These models aren't just regurgitating text that they were trained on. They learn the patterns and concepts in that text, and they're able to use those to infer things that weren't explicitly present in the training data.

I read recently about some researchers who were experimenting with ChatGPT's ability to do basic arithmetic. It's not great at it, but it's definitely figured out some techniques that allow it to answer math problems that were not in its training set. It gets them wrong sometimes, but it's like a human doing math in its head rather than a calculator using rigorous algorithms so that's to be expected.

[–] vrighter@discuss.tchncs.de 4 points 2 years ago (2 children)

they learn statistical correlations between words. given the last 5000 (or however large the context is) words, and absolutely no other information besides that, what is the most likely word to appear next? It's a glorified order 5000 markov chain.

The reason it can "do" some math is that there are tons of examples in the training set using small numbers usually used as examples. it can do basic arithmetic because it has seen "2+2=4" and other examples with simple numbers like that. The studies used test basic arithmetic. The same things that it had millions of pre-worked examples of. And it still gets those wrong, with astonishing frequency. those studies aren't talking about asking it "what is the square root of pi" or stuff like that. but stuff such as "is 7 greater than 4?", "what is 10 + 3?", "is 97 prime?" stuff it has most definitely seen the answers to. ask it about some large prime, and it'll nay no, and be probably right, because most numbers are composite

[–] FaceDeer@kbin.social 3 points 2 years ago* (last edited 2 years ago) (1 children)

those studies aren’t talking about asking it “what is the square root of pi” or stuff like that. but stuff such as “is 7 greater than 4?”, “what is 10 + 3?”, “is 97 prime?” stuff it has most definitely seen the answers to.

No, they very explicitly checked to see whether the training set contains the literal math problem that they asked it for the answer to. ChatGPT is able to answer math questions that it has never seen before. I believe this is the article (though I had to go searching, it's been a while).

When people dismiss LLMs as "just prediction engines" they're really missing the point. Of course they're prediction engines, that's not in dispute. The question is about how they go about making those predictions. When I show you the string "18 + 10 =" you can predict what comes next, yes? Well, how did you predict it? Did you memorize that particular specific string, or have you developed heuristics for how to do simple addition problems when you see them?

[–] MajorHavoc@lemmy.world 0 points 2 years ago* (last edited 2 years ago) (1 children)

These things are currently infamously bad at math, though.

I won't argue that it'll never get there. I'm confident it will, - though with a lot more perl hacks than elegant emergence.

But today, these things have an astonishingly high 'appearance of intelligence' to 'incredible stupidity' ratio.

[–] FaceDeer@kbin.social 3 points 2 years ago (2 children)

Humans are also not particularly well known for their math skills. Ask a random stranger to do simple arithmetic in their head, with only a few seconds to think and no outside help, and I wouldn't expect particularly reliable results.

[–] vrighter@discuss.tchncs.de 1 points 2 years ago

however, people are not notoriously bad at the types of basic arithmetic they test for. every time I pay something with cash, I work out how much change I'm owed mentally, and so does the seller. I can count on one hand the number of times I've actually been given incorrect change throughout my entire lifetime. And when I did get wrong change, it was usually "oh, I thought you gave me €10 ínstead of €20". Meaning that they actually still did the math correctly.

No sane person will ever tell you 4 is bigger than 7. Yet llms sometimes get even this type of question wrong. They learn patterns, but not concepts. This is even simpler than basic arithmetic.

[–] MajorHavoc@lemmy.world 1 points 2 years ago

Haha. Fair point.

[–] MajorHavoc@lemmy.world 0 points 2 years ago* (last edited 2 years ago)

It gives me the giggles that folks speculating are getting more upvotes than your post that demonstrates actual understanding of the implementation details.

If I were the type to sell sizzle hype and snake oil, now would be the time to do it. The venture capitalists may have learned their lesson, but the general public haven't.

[–] discodoubloon@kbin.social 6 points 2 years ago

It doesn’t “know” anything. It can’t solve that problem. It’s trained on humans so it’s limited to what we have written down.

I love ChatGPT but if it’s creative it’s because you asked it the right questions and found an oblique answer yourself.

[–] Buffaloaf@lemmy.world 5 points 2 years ago (3 children)

I asked Bard how ChatGPT can fix itself, here's what it said:

ChatGPT can fix itself in a number of ways, including:

Retraining on a larger dataset. ChatGPT is trained on a massive dataset of text and code, but it can always be retrained on a larger dataset. This would help ChatGPT to learn new things and improve its performance.

Using a different algorithm. ChatGPT is currently trained using the GPT-3 algorithm, but there are other algorithms that could be used. For example, ChatGPT could be trained using the Megatron-Turing NLG algorithm, which is one of the most powerful language models in the world.

Using a combination of algorithms. ChatGPT could also be trained using a combination of algorithms. This would allow ChatGPT to benefit from the strengths of different algorithms.

Using a feedback loop. ChatGPT could be used to generate text, and then this text could be used to train ChatGPT. This would allow ChatGPT to learn from its own mistakes and improve its performance over time.

Using human feedback. ChatGPT could also be used to generate text, and then this text could be reviewed by humans. The human feedback could then be used to improve ChatGPT's performance.

[–] lagomorphlecture@lemm.ee 10 points 2 years ago (1 children)

"Using a feedback loop. ChatGPT could be used to generate text, and then this text could be used to train ChatGPT. This would allow ChatGPT to learn from its own mistakes and improve its performance over time."

So basically create its own Fox News and see how that goes.

[–] vrighter@discuss.tchncs.de 13 points 2 years ago (1 children)

this is widely known to very quickly destroy your model

[–] Buffaloaf@lemmy.world 4 points 2 years ago

Remember, this is Bard's advice for ChatGPT

[–] chaogomu@kbin.social 7 points 2 years ago (1 children)

The feedback loop is already happening, and is called model collapse.

It's not a good thing.

[–] FaceDeer@kbin.social 3 points 2 years ago

The full suggestion includes "This would allow ChatGPT to learn from its own mistakes", which implies that the text it generated would be evaluated and curated before being sent back into it for training. That, as well as including non-AI-generated text along with the AI generated stuff, should stop model collapse.

Model collapse is basically inbreeding, with similar causes and similar solutions. A little inbreeding is not inherently bad, indeed it's used frequently when you're trying to breed an organism to have specific desirable characteristics.

[–] FaceDeer@kbin.social 4 points 2 years ago

If having an AI tell researchers that they should base its next iteration off of Megatron isn't the plot of a Michael Bay Transformers movie already, it should have been.

[–] Zeth0s@lemmy.world 1 points 2 years ago

It's working on it. It's just a pretty difficult task to be better than 50 years of optimization by human computer scientists https://www.nature.com/articles/d41586-023-01883-4

[–] Zeth0s@lemmy.world 9 points 2 years ago* (last edited 2 years ago)

Deepmind is actually working on an AI that improve performances of low level programs. It started with improving sorting algorithm.

It's an RL algorithm.

Main issue is that everything takes time, and expectations on current AI are artificially inflated.

It will reach the point most are discussing now, it'll simply take a bit longer than people expect

Source: https://www.nature.com/articles/d41586-023-01883-4