Speaker: Karin Sevegnani, Senior Solutions Architect, Nvidia
In this talk, we will explore the
intricate balance between computational resources, memory limitations,
and parameter-efficient fine-tuning techniques in large language models
(LLMs). We will analyse strategies to optimize the performance of LLMs
while managing these constraints effectively. From efficient memory
utilization to streamlined parameter fine-tuning methods, we will
discuss practical approaches to maximize the efficiency of LLMs without
sacrificing performance.
Presented at Best Practices in AI Afternoon event 2024-07-05
…Read more
Less…