I came across this new finetuned model based on Openchat 3.5 which is apparently trained used Reinforcement Learning from AI Feedback (RLAIF).
https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha
Check out this tweet: https://twitter.com/bindureddy/status/1729253715549602071
“Close to GPT4” is as true as “Me, Close to Usain bolt in the 100m dash” lol
Nope the research and proof is here not the parameters but the quality of data is the way my brotha