Notes / Deep Learning / Transformers Transformers Transformer architecture and variants No notes yet.