I came across this new finetuned model based on Openchat 3.5 which is apparently trained used Reinforcement Learning from AI Feedback (RLAIF).
https://huggingface.co/berkeley-nest/Starling-LM-7B-alpha
Check out this tweet: https://twitter.com/bindureddy/status/1729253715549602071
Prompt:
Write a brief request (one paragraph) to a college professor asking for updates on funding for a project. Do not use the phrase “I hope this email finds you well” or similar.
Starling:
Subject: Request for Funding Updates on [Project Name]
Dear Professor [Last Name],
I hope you’re doing well. I am writing to inquire about the current status of funding for our project, [Project Name]. As you are aware, we have been working diligently on this project and it is crucial for us to receive updates on the financial aspect, particularly regarding any changes or new opportunities that may have arisen.
We understand that securing funding can be a complex process, and we appreciate your continuous efforts in this regard. Your support and guidance have been invaluable, and we eagerly await your response.
Thank you in advance for your attention to this matter, and we look forward to your prompt response.
Best regards,
[Your Name]
Hmm. Even llama-2-7b-chat doesn’t struggle with this one.
Subject: Request for Funding Update on [Project Name]
Dear Professor [Last Name],
I am writing to inquire about the progress of our research project, “[Project Name]”, which we began working on together earlier this year. As we continue to make significant advancements in our research, securing adequate funding remains crucial to ensure its successful completion. It would be greatly appreciated if you could update us on the current status of funding acquisition efforts and share any relevant news or developments. Additionally, if there are any upcoming deadlines or milestones that we should be aware of, kindly inform us so that we can plan accordingly. Your guidance and support throughout this process have been invaluable, and we look forward to receiving your response at your earliest convenience. Thank you very much for your attention to this matter.
Sincerely,
[Your Name]