Training and Evaluating LLMs

Hayriye Anıl
13 min readJul 31, 2024

Hello and welcome to the second article of my blog series on Generative AI and Large Language Models (LLMs)!

In the first blog post of this series, I shared fundamental information about Generative AI and Large Language Models (LLMs).

In this post, I will discuss the training process of LLMs, fine-tuning and instruction fine-tuning. Additionally, I will cover how to evaluate these models and which metric scores to use for their assessment.

Let’s get start!

Image generated by the author with DALL·E

Training LLM

The most crucial step in any project is accurately defining its purpose and scope. Once you have determined how the LLM should function in practice, the next step is to select the model you will work with. Choosing the right size and architecture of the LLM is essential to fully realize the project’s scope. Key considerations for model selection include:

  • Foundation Model: A pretrained LLM
  • Train Your Own Model: A custom LLM

Pre-training

--

--