Attention Mechanisms deep-learning deep-learning attention transformer 1 min read Understanding attention in neural networks This is a sample note. Replace with your content. Topics Self-attention Multi-head attention Scaled dot-product attention Related Notes in DEEP-LEARNING Attention Mechanisms CNNs Activation Functions Optimization & Training Inference & Model Compression Loss Functions Optimizers PyTorch Lightning Regularization Learning Rate Schedulers