• 0 Posts
  • 4 Comments
Joined 1 year ago
cake
Cake day: October 30th, 2023

help-circle
  • There is no real logic in how these models were divided throughout the merge

    I’m kind of cautious how random merging affects the overall quality, since many of these merges models were trained with different prompt formats. In my experience that would inevitably lead to AI outputs that attempt some gibberish by adding bits of other used prompt formats (e.g. “### Response:” being printed out while using the ChatML template). To my surprise I witnessed that with OpenHermes 2.5 in some edge cases. But I would be eager to hear other people’s experience on this.