Back To Top
Go back

AI & Advanced Computing

Antonio Orvieto

Principal Investigator, AI2050 Early Career Fellow

  • Year

    2024

  • Program

    AI2050

  • Institution

    Max Planck Institute, ELLIS Institute Tubingen

  • Location

    Germany

Antonio Orvieto is an independent group leader at the Max Planck Institute for Intelligent Systems and a Hector endowed principal investigator at the ELLIS Institute Tübingen, Germany. He holds a Ph.D. from ETH Zürich and spent time at Google Deepmind, Meta, MILA, INRIA Paris, and HILTI. His main area of expertise is optimization for Deep Learning and the design of neural networks for reasoning in complex sequential data. He has published in NeurIPS, ICML, ICLR, AISTATS, and CVPR; he organized the “Optimization for Data Science and Machine Learning” session at the International Conference on Continuous Optimization (ICCOPT) in 2022 and the Workshop on Next Generation of Sequence Modeling Architectures at ICML 2024.
In his research, Antonio strives to improve deep learning technologies by pioneering new architectures and training techniques grounded in theoretical knowledge. His work encompasses two main areas: designing innovative models capable of handling complex data and understanding the intricacies of large-scale training dynamics. Central to his studies is exploring innovative techniques for decoding patterns in sequential data, with implications in natural language processing, biology, neuroscience, and music generation. His LRU architecture is the basis for some of Google’s Gemma language model variants.

Schmidt Sciences
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.