this post was submitted on 01 Oct 2023
1134 points (97.6% liked)

Technology

59575 readers
3195 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] FooBarrington@lemmy.world 0 points 1 year ago* (last edited 1 year ago) (2 children)

While you are right about baseload being more satisfiable through nuclear, you are wrong that it's in any way important for AI model training. This is one of the best uses for solar energy: you train while you have lots of energy, and you pause training while you don't. Baseload is important for things that absolutely need to get done (e.g. powering machines in hospitals), or for things that have a high startup cost (e.g. furnaces). AI model training is the opposite of both, so baseload isn't relevant at all.

[–] eestileib@sh.itjust.works 2 points 1 year ago (1 children)

It's not life-critical but it is financially-critical to the company. You aren't going to build a project on the scale of a data center that is capable of running 24/7 and not run it as much as possible.

That equipment is expensive, and has a relatively short useful lifespan even if not running.

This is why tire factories and refineries run three shifts, this isn't a phenomenon unique to data centers.

[–] FooBarrington@lemmy.world 2 points 1 year ago

It's not life-critical but it is financially-critical to the company. You aren't going to build a project on the scale of a data center that is capable of running 24/7 and not run it as much as possible.

Sorry, but that's wrong. You'll run it as much as is profitable. If electricity cost goes up, there is a point where you'll stop running it, since it becomes too expensive. Even more so considering that AI models don't have a set goal to reach - you train them as long as you want and can, but training a little bit extra will have diminishing returns after a while.

That equipment is expensive, and has a relatively short useful lifespan even if not running.

Not really, the limiting factors in AI training are mostly supply of cards. The cards already in use will stay in use until they fail, they won't be replaced with newer cards the second they get released.

This is why tire factories and refineries run three shifts, this isn't a phenomenon unique to data centers.

This is comparing apples and oranges, since tire factories:

  • have long-term planning and production goals to reach

  • have employees who must be planned

  • have resource input costs that are higher than electricity

Of course you want the highest utilisation that you can economically reach, but a better comparison would be crypto mining - which also has expensive equipment that has a relatively short useful lifespan even if not running, and yet they stop mining when electricity is too expensive.

[–] guacupado@lemmy.world 0 points 1 year ago (1 children)

"And you pause training while you dont." lmao I don't know why people keep giving advice in spaces they've never worked in.

[–] FooBarrington@lemmy.world 1 points 1 year ago (1 children)

What are you trying to imply? That training Transformer models necessarily needs to be a continuous process? You know it's pretty easy to stop and continue training, right?

I don't know why people keep commenting in spaces they've never worked in.

[–] guacupado@lemmy.world 0 points 1 year ago* (last edited 1 year ago) (1 children)

No datacenter is shutting off of a leg, hall, row, or rack because "We have enough data, guys." Maybe at your university server room where CS majors are interning. These things are running 24/7/365 with UU tracking specifically to keep them up.

[–] FooBarrington@lemmy.world 1 points 1 year ago* (last edited 1 year ago)

What are you talking about? Who said anything close to "we have enough data, guys"?

Are you ok? You came in with a very snippy and completely wrong comment, and you're continuing with something completely random.