DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging

Publication
arXiv
Neha Verma
Neha Verma
PhD Student

I am a PhD student at Johns Hopkins Center for Language and Speech Processing.