[4] D. Eck, “Finding Downbeats with a Relaxation Oscillator,” Psychological
Research, vol. 66, no. 1, pp. 18–25, 2002.
[5] F. A. Gers, J. Perez-Ortiz, D. Eck and J. Schmidhuber, “DEKF-LSTM,” in
Proc. 10th European Symposium on Artificial Neural Networks,
ESANN 2002, 2002.
[6] F. A. Gers and J. Schmidhuber, “Recurrent Nets that Time and Count,” in
Proc. IJCNN’2000, Int. Joint Conf. on Neural Networks, Como, Italy,
2000.
[7] F. A. Gers and J. Schmidhuber, “LSTM recurrent networks learn simple con-
text free and context sensitive languages,” IEEE Transactions on Neural
Networks, vol. 12, no. 6, pp. 1333–1340, 2001.
[8] F. A. Gers, J. Schmidhuber and F. Cummins, “Learning to Forget: Continual
Prediction with LSTM,” Neural Computation, vol. 12, no. 10, pp. 2451–
2471, 2000.
[9] S. Hochreiter, “ Untersuchungen zu dynamischen neuronalen Netzen. Diploma
thesis, Institut f¨ur Informatik, Lehrstuhl Prof. Brauer, Technische Uni-
versit¨at M¨unchen,” 1991, See http://ni.cs.tu-berlin.de/∼hochreit/papers/-
hochreiter.dipl.ps.gz.
[10] S. Hochreiter, Y. Bengio, P. Frasconi and J. Schmidhuber, “Gradient flow in
recurrent nets: the difficulty of learning long-term dependencies,” in S. C.
Kremer and J. F. Kolen (eds.), A Field Guide to Dynamical Recurrent
Neural Networks, IEEE Press, 2001.
[11] M. Joost and W. Schiffmann, “Speeding up backpropagation algorithms by
using cross-entropy combined with pattern normalization,” International
Journal of Uncertainty, Fuzziness and Knowledge-Based Systems,
vol. 6, no. 2, pp. 117–126, 1998.
[12] B. Laden and D. H. Keefe, “The representation of pitch in a neural net model
of chord classification,” Computer Music Journal, vol. 13, no. 4, pp. 44–53,
1989.
[13] M. C. Mozer, “Neural network composition by prediction: Exploring the bene-
fits of psychophysical constraints and multiscale processing,” Cognitive Sci-
ence, vol. 6, pp. 247–280, 1994.
[14] J. A. P´erez-Ortiz, F. A. Gers, D. Eck and J. Schmidhuber, “Kalman filters
improve LSTM network performance in problems unsolvable by traditional
recurrent nets,” Neural Networks, 2002, In press.
[15] D. C. Plaut, S. J. Nowlan and G. E. Hinton, “Experiments on learning back
propagation,” Techn. Report CMU–CS–86–126, Carnegie–Mellon Univer-
sity, Pittsburgh, PA, 1986.
[16] A. J. Robinson and F. Fallside, “The Utility Driven Dynamic Error Propaga-
tion Network,” Techn. Report CUED/F-INFENG/TR.1, Cambridge Uni-
versity Engineering Department, 1987.
[17] R. N. Shepard, “Geometrical approximations to the structure of pitch,” Psy-
chological Review, vol. 89, pp. 305–333, 1982.
[18] C. Stevens and J. Wiles, “Representations of Tonal Music: A Case study
in the development of temporal relationship,” in M. Mozer, P. Smolensky,
D. Touretsky, J. Elman and A. S. Weigend (eds.), Proceedings of the 1993
Connectionist Models Summer School, Hillsdale, NJ: Erlbaum, pp. 228–
235, 1994.
[19] P. M. Todd, “A connectionist approach to algorithmic composition,” Com-
puter Music Journal, vol. 13, no. 4, pp. 27–43, 1989.
10