this post was submitted on 16 Oct 2024
1 points (100.0% liked)

Singularity

131 readers
1 users here now

Everything pertaining to the technological singularity and related topics, e.g. AI, human enhancement, etc.

founded 1 year ago
MODERATORS
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/Gothsim10 on 2024-10-16 10:35:13+00:00.

Original Title: Engineers at Fireworks AI have successfully ported FireAttention to AMD MI300s, resulting in 80% more throughput and 60% faster latency than NIM on Nvidia H100s. With these improvements, FireAttention V3 enables AMD MI300 to become a viable alternative for GPU inference.

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here