Looking for Python script to deploy custom LLM in Azure

MyObjectivism@alien.top · 2 years ago

Looking for Python script to deploy custom LLM in Azure

kivathewolf@alien.top · 2 years ago

While I have not tried this in azure, my understanding is that you can deploy a Linux vm with A100 in azure (T4or V100 may not work for all use cases, but will be a cheaper option). Once you have a Linux vm with GPU, you can choose how you would like to host the model(s). You can write some code and expose the LLM via an API ( I like Fast chat, but there are other options as well). Heck you can even use ooba if you like. Just make sure to check the license for what you use.