@diamond_jackie07 - Communick News

0 Posts
2 Comments

Joined 1 year ago

Cake day: October 28th, 2023

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

diamond_jackie07@alien.topBtoMachine Learning@academy.garden•[Project] LLM inference with vLLM and AMD: Achieving LLM inference parity with Nvidia
link
fedilink
English
arrow-up
1·
1 year ago
Will report back on 13b performance ASAP

link
fedilink

diamond_jackie07@alien.topBtoMachine Learning@academy.garden•[Project] LLM inference with vLLM and AMD: Achieving LLM inference parity with Nvidia
link
fedilink
English
arrow-up
1·
1 year ago
I tried on this config - Ryzen 9 7950x MI210. I got this result Throughput: 129 requests/min, 1028.89 tokens/s on llama2-7b. Which is even better than the performance they cite on the post

link
fedilink