AI Seminar Series: Kajetan Schweighofer | TII NEWS

Kajetan Schweighofer

Kajetan Schweighofer

Venue: TII Yas Showcase

28th October 2025,11:00AM - 12:00PM (GST)

Title:	Beyond Attention: xLSTM Scales Competitively with Linear Time-Complexity
Abstract:	This talk discusses recent advances in the understanding of scaling behavior for large language models (LLMs), comparing the attention-based Transformer architecture to the recurrent xLSTM architecture. We begin with a brief introduction to xLSTM and its relation to other recently proposed Transformer alternatives such as Mamba-2. Next, we examine results on the comparison of scaling behavior between Transformers and xLSTM by means of empirically determined scaling laws. Those show that the xLSTM architecture is Pareto-dominant in terms of cross-entropy loss and compute budget. This means that it is always possible to obtain an xLSTM model that is both better (lower loss) and cheaper (less compute) within the analyzed ranges. We then analyze how these architectures scale with different compute budgets and how compute-optimal models compare to each other on the language modeling task. This is also extended to the practically relevant overtraining regime, showing that the established scaling laws remain stable even at high token to parameter ratios. Importantly, training scaling behavior is examined with respect to the context length, a critical aspect when comparing Transformers that scale quadratic in context length to alternative architectures that scale linearly. The results show that the benefit of xLSTM over Transformers increases for larger context lengths. Finally, inference scaling behavior is analyzed, finding that xLSTM has both lower time to first token (latency) and lower step time (per-token generation speed).
Bio:	Kajetan Schweighofer is a final year PhD student at Johannes Kepler University Linz, supervised by Prof. Sepp Hochreiter, inventor of LSTMs. His PhD focuses on predictive uncertainty quantification for deep learning models. This extends to natural language generation, using uncertainty information for hallucination detection of LLMs. Recently, his work has expanded to studying the scaling properties of LLM architectures, including transformer alternatives such as xLSTM. His research has been published at the major AI conferences NeurIPS, ICML and ICLR. Kajetan is part of the ELLIS PhD Program, fostering collaboration between centers of AI excellence across Europe, and conducted a research stay at the ELLIS Alicante Foundation. Prior to his PhD, he obtained a Bachelor's and Master's in Physics, as well as a Master's in Artificial Intelligence from Johannes Kepler University Linz.

More News

tii_logo_thumbnail

Nov 21, 2025

TII and Canada’s Mila Announce Strategic Partnership to Accelerate Global AI Research

Technology Innovation Institute

Nov 20, 2025

DASSAULT AVIATION, TII, and ASPIRE Sign Strategic Cooperation Agreement to Drive Next-Gen Aerospace Innovation at Dubaï Airshow 2025

Technology Innovation Institute

Nov 20, 2025

Technology Innovation Institute and Thales Partner to Advance Research in Quantum, Autonomy, and Directed Energy

Technology Innovation Institute

Nov 19, 2025

Safran and the Technology Innovation Institute intend to lead the next evolution in geospatial intelligence

Technology Innovation Institute

Nov 19, 2025

TII and Space42 Join Forces to Build the UAE’s First Space-to-Ground Quantum Communication Network

Technology Innovation Institute

Nov 18, 2025

TII Advances UAE’s Global Quantum Tech Leadership with New Milestone NVIDIA CUDA-Q Integration Supporting NVQLink

tii_logo_thumbnail

Technology Innovation Institute

Nov 17, 2025

World First: Autonomous Racing Leaps Forward in Abu Dhabi as A2RL Season 2 Showcases Record Speed, Bold Overtakes and Real-Time AI Decision-Making

Technology Innovation Institute

Nov 13, 2025

ADNOC, TII and ASPIRE Begin Autonomous Drone Integration to Transform Emergency Response Operations

tii_logo_thumbnail

Technology Innovation Institute

Nov 10, 2025

UAE Among Global Pioneers in Developing Regulations for Autonomous Flying Taxis and Delivery Drones

Technology Innovation Institute

Oct 06, 2025

Abu Dhabi’s Technology Innovation Institute Successfully Fires UAE’s First Liquid Rocket Engine, Marking Milestone in National Space Capability

SCROLL TO EXPLORE MORE