I came across this new finetuned model based on Openchat 3.5 which is apparently trained used Reinforcement Learning from AI Feedback (RLAIF).
https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha
Check out this tweet: https://twitter.com/bindureddy/status/1729253715549602071
It repeats itself and seems incapable of giving a response shorter than 200 words.
They forgot to include the tokenizer files from openchat 3.5, which caused some weirdness for me with new line characters among other things in the GGUF I got from TheBloke. The original repo has been fixed but I have yet to see a new GGUF.