this post was submitted on 03 Oct 2024
1 points (100.0% liked)

Singularity

131 readers
3 users here now

Everything pertaining to the technological singularity and related topics, e.g. AI, human enhancement, etc.

founded 1 year ago
MODERATORS
 
This is an automated archive made by the Lemmit Bot.

The original was posted on /r/singularity by /u/InTheDarknesBindThem on 2024-10-03 19:30:58+00:00.


"We measured progress with over 20 automated internal evaluations. We used novel synthetic data generation techniques, such as distilling outputs from OpenAI o1-preview, to post-train the model for its core behaviors. This approach allowed us to rapidly address writing quality and new user interactions, all without relying on human-generated data."

Please correct me if Im wrong but; if im reading this right they were able to use o1-preview in the place of where they used to use humans for fine tuning responses and getting key behaviors to work in post training.

Or, in short, AI fine tuning AI (under supervision).

no comments (yet)
sorted by: hot top controversial new old
there doesn't seem to be anything here