Title: Training of Modern Large Language Models

Abstract: The talk will showcase the development of modern large language models such as the Falcon3 series of open models, with an emphasis on the specificities of training with knowledge distillation and up-scaling techniques, which allow significant pre-training cost reduction to build the models. We will also discuss models’ performances and capabilities by providing some insights on the main components to build large efficient language models.

Dates

March 1st, 2025 → March 15th, 2025

Abstract submission deadline

March 8th, 2025 → March 15th, 2025

Paper submission deadline

April 14th ,2025

Accept/Reject notification

May 21-23 ,2025

Netys Conference

Proceedings

Revised selected papers will be published as a post-proceedings in Springer's LNCS "Lecture Notes in Computer Science"

Partners & Sponsors (TBA)