"Addition is All You Need for Energy-efficient Language Models" https://arxiv.org/abs/2410.00907
previous discussion: https://news.ycombinator.com/item?id=41784591