84 - HPC Café on March 11, 2025: Large Language Models for Dummies/ClipID:56417 previous clip next clip

The automatic subtitles generated using Whisper Open AI in this video player (and in the Multistream video player) are provided for convenience and accessibility purposes. However, please note that accuracy and interpretation may vary. For more information, please refer to the FAQs (Paragraph 14).
Recording date 2025-03-19

Via

Free

Language

English

Organisational Unit

Zentrum für Nationales Hochleistungsrechnen Erlangen (NHR@FAU)

Producer

Zentrum für Nationales Hochleistungsrechnen Erlangen (NHR@FAU)

Topic: Large Language Models for Dummies

Speaker: Sebastian Wind, NHR@FAU

Slides

Abstract:
Large Language Models (LLMs) are revolutionizing the way we interact with artificial intelligence, and the open-source community plays a pivotal role in driving their accessibility and innovation. This talk delves into the inner workings of LLMs, exploring their foundational mechanisms and architectures. Additionally, we examine how these models can be efficiently trained on high-performance computing (HPC) systems, leveraging state-of-the-art scaling strategies and principles derived from scaling laws. By understanding these methodologies, attendees will gain valuable insights into the challenges and opportunities of developing and deploying LLMs in diverse computational environments.

Material from past events is available at: https://hpc.fau.de/teaching/hpc-cafe/

More clips in this category "Friedrich-Alexander-Universität Erlangen-Nürnberg Zentralbereich"

2025-04-15
IdM-login
protected  
2025-03-28
Free
public