DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging

Publication
Workshop on Weight Symmetries in Deep Learning, ICML 2026