What are the components of the Attention Mechanism in a Transformer?
The Attention Mechanism includes three components: Query, Key, and Value. It computes attention scores by measuring the similarity between the query and key, then uses these scores to weight the values for the final output.