Deep Models and Optimization Conference Paper 2024

Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks

Author(s): Sieber, Jerome and Amo Alonso, Carmen and Didier, Alexandre and Zeilinger, Melanie and Orvieto, Antonio
Book Title: Proceedings of the Thirty-Eighth Annual Conference on Neural Information Processing Systems
Year: 2024
Month: October
Day: 01
Bibtex Type: Conference Paper (inproceedings)
Event Name: Thirty-Eighth Annual Conference on Neural Information Processing Systems
Event Place: Vancouver, Canada
State: Published
URL: https://arxiv.org/pdf/2405.15731
Organization: NeurIPS
Links:

BibTex

@inproceedings{UnderstandingSSMs2024,
  title = {Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks},
  booktitle = {Proceedings of the Thirty-Eighth Annual Conference on Neural Information Processing Systems},
  organization = {NeurIPS},
  month = oct,
  year = {2024},
  slug = {understandingssms2024},
  author = {Sieber, Jerome and Amo Alonso, Carmen and Didier, Alexandre and Zeilinger, Melanie and Orvieto, Antonio},
  url = {https://arxiv.org/pdf/2405.15731},
  month_numeric = {10}
}