Hi!
Currently, I am working at Microsoft as a Senior Researcher in the Azure Systems Research Group in Redmond, WA.
My focus is on efficient deployment of LLMs in cloud. I am working both at the model serving level, as well as the backend infrastructure and hardware level.
Most updated publications list: Here