Cover Image for Shaping the Future of AI from the History of Transformer: A UPenn Lecture | Sponsored by Turing
Cover Image for Shaping the Future of AI from the History of Transformer: A UPenn Lecture | Sponsored by Turing
Avatar for Turing Events
Presented by
Turing Events
Unleashing the world's untapped human potential to accelerate AGI. Solving the human intelligence bottleneck with genAI products and solutions.
Hosted By
36 Went

Shaping the Future of AI from the History of Transformer: A UPenn Lecture | Sponsored by Turing

Zoom
Registration
Past Event
Welcome! To join the event, please register below.
About Event

​You are invited to join a special virtual stream of Prof. Mayur Naik’s CIS 7000 course on Large Language Models at the University of Pennsylvania, sponsored by Turing.

Abstract: AI is developing at such an overwhelming pace that it is hard to keep up. Instead of spending all our energy catching up with the latest development, I argue that we should study the change itself. First step is to identify and understand the driving force behind the change. For AI, it is the exponentially cheaper compute and associated scaling. I will provide a highly-opinionated view on the early history of Transformer architectures, focusing on what motivated each development and how each became less relevant with more compute. This analysis will help us connect the past and present in a unified perspective, which in turn makes it more manageable to project where the field is heading.

Bio: Hyung Won Chung is a research scientist at OpenAI. His recent work focuses on o1. He has worked on various aspects of Large Language Models: pre-training, instruction fine-tuning, reinforcement learning with human feedback, reasoning, multilinguality, parallelism strategies, etc. Some of the notable work includes scaling Flan paper (Flan-T5, Flan-PaLM) and T5X, the training framework used to train the PaLM language model. Before OpenAI, he was at Google Brain and before that he received a PhD from MIT.

​Learn more about the course at CIS 7000 - Large Language Models.

Avatar for Turing Events
Presented by
Turing Events
Unleashing the world's untapped human potential to accelerate AGI. Solving the human intelligence bottleneck with genAI products and solutions.
Hosted By
36 Went