this post was submitted on 18 Dec 2024
42 points (95.7% liked)

Futurology

1886 readers
14 users here now

founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] Espiritdescali@futurology.today 4 points 3 weeks ago (1 children)

Do these open source models have access to the same volume of training data that the commercial models have?

[–] BaylorSwift3@futurology.today 2 points 3 weeks ago* (last edited 3 weeks ago)

Paradoxically their approach is to use less training data. HF are saying they have reverse engineered some of the capabilities of OpenAI’s o1 model, by using an approach called 'Test-time compute scaling' which OpenAI have acknowledged using, but not disclosed exactly how.

https://the-decoder.com/study-shows-test-time-compute-scaling-is-a-path-to-better-ai-systems/