Review

. 2021 Feb 26;23(3):283.

doi: 10.3390/e23030283.

Attention Mechanisms and Their Applications to Complex Systems

Adrián Hernández¹, José M Amigó¹

Affiliations

PMID: 33652728
PMCID: PMC7996841
DOI: 10.3390/e23030283

Review

Attention Mechanisms and Their Applications to Complex Systems

Adrián Hernández et al. Entropy (Basel). 2021.

. 2021 Feb 26;23(3):283.

doi: 10.3390/e23030283.

Authors

Adrián Hernández¹, José M Amigó¹

Affiliation

¹ Centro de Investigación Operativa, Universidad Miguel Hernández, Av. de la Universidad s/n, 03202 Elche, Spain.

PMID: 33652728
PMCID: PMC7996841
DOI: 10.3390/e23030283

Abstract

Deep learning models and graphics processing units have completely transformed the field of machine learning. Recurrent neural networks and long short-term memories have been successfully used to model and predict complex systems. However, these classic models do not perform sequential reasoning, a process that guides a task based on perception and memory. In recent years, attention mechanisms have emerged as a promising solution to these problems. In this review, we describe the key aspects of attention mechanisms and some relevant attention techniques and point out why they are a remarkable advance in machine learning. Then, we illustrate some important applications of these techniques in the modeling of complex systems.

Keywords: attention; complex and dynamical systems; deep learning; neural networks; self-attention; sequential reasoning.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

**Figure 2**
Temporal structure of a recurrent neural network.

**Figure 3**
Attention diagram. Attention as a sequential process of reasoning in which the task (query) is guided by a set of elements (values) of the source (or memory).

**Figure 4**
An encoder–decoder network.

**Figure 5**
An encoder–decoder network with attention.

**Figure 6**
A matrix of alignment scores. It represents how much of each input state should be considered when deciding the next state and generating the output.

**Figure 7**
Multi-headed attention. Self-attention process performed in parallel h times in different subspaces. The output values are concatenated and projected to a final value.

**Figure 8**
Diagram of the input features attention mechanism.

**Figure 9**
Diagram of the temporal attention mechanism.

**Figure 10**
Basic diagram of a memory network. For each input, the attention mechanism integrates a weighted sum over the memory vectors.

**Figure 11**
Self-attention graph. The self-attention component calculates how much each input vector contributes to form each output vector.

See this image and copyright information in PMC

References

1. Yadan O., Adams K., Taigman Y., Ranzato M. Multi-GPU Training of ConvNets. arXiv. 20131312.5853
1. LeCun Y., Bengio Y., Hinton G. Deep Learning. Nature. 2015;521:436–444. doi: 10.1038/nature14539. - DOI - PubMed
1. Sutskever I., Vinyals O., Le Q.V. Sequence to Sequence Learning with Neural Networks; Proceedings of the NIPS 2014; Montreal, QC, Canada. 8–13 December 2014.
1. Silver D., Schrittwieser J., Simonyan K., Antonoglou I., Huang A., Guez A., Hubert T., Baker L.R., Lai M., Bolton A., et al. Mastering the game of Go without human knowledge. Nature. 2017;550:354–359. doi: 10.1038/nature24270. - DOI - PubMed
1. Goodfellow I., Bengio Y., Courville A. Deep Learning. MIT Press; Cambridge, MA, USA: 2016. [(accessed on 26 February 2021)]. Available online: http://www.deeplearningbook.org.

Publication types

Actions

Grants and funding

PID2019-108654GB-I00/Ministerio de Ciencia e Innovación

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Attention Mechanisms and Their Applications to Complex Systems

Affiliation

Attention Mechanisms and Their Applications to Complex Systems

Authors

Affiliation

Abstract

Conflict of interest statement

Figures

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources