• 0 Posts
  • 3 Comments
Joined 1 year ago
cake
Cake day: October 30th, 2023

help-circle
  • A large technical disadvantage: I think we need a new type of precision cutting tool to extract and recognize shapes inside tensor weight images

    Why do you think we need this? To me, it just indicates that the structure of Stable Diffusion is designed for real-world photos, artwork, and diagrams, and ill-suited for predicting the weights of an LLM.

    the poc shows today’s models can predict new weights without training and without entity extraction/ml and within 13-30 seconds the output is are not dramatically horrible vs the original source weights.

    Are you sure the output isn’t dramatically horrible? To me the predicted weight images look nothing like the original weight images. The fine detail is completely different.

    But it doesn’t even matter how it looks to human eyes. What matters is, when a new model is constructed from the predicted weights, whether that model makes mostly-correct predictions.