Category: NLP

Attention to LLM Architectures: An Intelligent ML Engineering Guide

The article presents an informed view on different architectures used in modern language models (LLMs), including Encoder-Decoder, Encoder-Only, and Decoder-Only models. It defines their functions and specific uses. While the less complex Decoder-Only models like GPT have achieved excellent results, the article suggests that the choice of model should depend on the end application and…

9th Oct 2023
Not-So-Large Language Models: Good Data Overthrows the Goliath

In this article, we will see how Language Models (LM) can focus on better data and training strategies rather than just brute size to achieve LLM-like results (sometimes even better) and how people are already doing it successfully and democratically.

20th Aug 2023

Attention to LLM Architectures: An Intelligent ML Engineering Guide