An Alternative Approach to Building Generative AI Models

buildinstuff5432@alien.top · 3 years ago

An Alternative Approach to Building Generative AI Models

dqUu3QlS@alien.top · 3 years ago

A large technical disadvantage: I think we need a new type of precision cutting tool to extract and recognize shapes inside tensor weight images

Why do you think we need this? To me, it just indicates that the structure of Stable Diffusion is designed for real-world photos, artwork, and diagrams, and ill-suited for predicting the weights of an LLM.

the poc shows today’s models can predict new weights without training and without entity extraction/ml and within 13-30 seconds the output is are not dramatically horrible vs the original source weights.

Are you sure the output isn’t dramatically horrible? To me the predicted weight images look nothing like the original weight images. The fine detail is completely different.

But it doesn’t even matter how it looks to human eyes. What matters is, when a new model is constructed from the predicted weights, whether that model makes mostly-correct predictions.