lol we just unlocked a new paradigm, guaranteed we don’t hit a plateau for at least another two years. considering it looks like we’re probably already on the verge of one or two paradigm shifts on top of that, no real reason to anticipate a plateau in the immediate future regardless.
for now we might be able to 10x our language data, but the top quality content has already been used
beyond that I think synthetic data will rule; it needs to be validated or filtered somehow; I think we need to use agents and RL to make it high quality
lol we just unlocked a new paradigm, guaranteed we don’t hit a plateau for at least another two years. considering it looks like we’re probably already on the verge of one or two paradigm shifts on top of that, no real reason to anticipate a plateau in the immediate future regardless.
for now we might be able to 10x our language data, but the top quality content has already been used
beyond that I think synthetic data will rule; it needs to be validated or filtered somehow; I think we need to use agents and RL to make it high quality