Open-source Large Language Models Optimization

This chalk talk delves into optimizing and deploying large language models (LLMs) at scale. Explore large model hosting, optimization techniques, model partitioning, batch processing, and model fine-tuning.

Quick Info

Conference

HKOSCon 2024

Event Type

Talk

Venue

MWT1

Is Topic

Yes

Generative AI

LLM

AGI

Fri, 07/05/2024 - 18:00 - Fri, 07/05/2024 - 18:15

Content

Language

English

Level

Advanced

Target Audience

Developer, Power User, General User

Audience Requriement

They should learn basic machine learning, model training and fie-tuning knowledge.

Speaker

Haowen Huang

Haowen Huang is senior evangelist at Amazon Web Services, based in Hong Kong. He has more than 20 years of experience in architecture design, technology, and startup management across the telecommunications, internet, and cloud computing industries. Additionally, he has worked for renowned companies like Microsoft, Sun Microsystems, and China Telecom. His current research interests include generative AI, large language models (LLMs), machine learning, and data science.

Background

Country / Region

Hong Kong

Affiliations

Amazon

Internal

Is Remote Presentation

false