✍️ Author Biography
Shaobo Wang
🌍 American
📚 1 free book
A large language model (LLM) is a neural network trained on a vast amount of text for natural language processing tasks, especially language generation. LLMs can generate, summarize, translate and parse text in many contexts, and are a foundational technology behind modern chatbots. Biased or inaccurate training data can make an LLM's output less reliable.
As of 2024, the largest and most capable LLMs are all based on transformer architectures, which, according to the 2017 paper Attention Is Al…