« BackWriting an LLM from scratch, part 32d – Interventions: adding attention biasgilesthomas.comSubmitted by gpjt 14 hours ago