πŸ¦™Large Language Model

Large language model

Concepts

LLM is a type of artificial intelligence(AI) algorithm that use deep learning techniques and massively large data sets to understand, summarize, generate, and predict new content. LLMs are trained with immense amounts of data and use self-supervised learning to predict the next token in a sentence, given the surrounding context. Once an LLM has been trained, it can be fine-tuned for a wide range of NLP tasks, including generating and classifying text, answering questions.

Stop Words

Stop words are a set of commonly used words in any language that are used to eliminate unimportant words, allowing applications to focus on the important words instead. In natural language(NLP) and text mining applications, stop words are used to eliminate words that are so widely used that they carry very little useful information.

LLMs

LLaMA

In March 2023, a large language model (LLM) called LLaMA was leaked online. LLaMA was developed by Meta (formerly Facebook) and is trained on a massive dataset of text and code. It can generate text, translate languages, write different kinds of creative content, and answer your questions in an informative way.

Chinchilla

It is not available to the public

Chinchilla is a large language model developed by OpenAI. It is trained on a massive dataset of text and code, and it can generate text, translate languages, write different kinds of creative content, and answer your questions in an informative way.

Megatron-Turing NLG

It requires filling out the form on the Nvidia website. And I do not have their device.

Megatron-Turing NLG is an open-source large language model developed by Google AI. It is trained on a massive dataset of text and code.

Jurassic-1 Jumbo

Jurassic-1 Jumbo is an open-source large language model developed by Google AI. It is trained on a massive dataset of text and code.

Libraries

ChatLlama

It is a library that allows you to efficiently leverage LLMs fine-tuning capabilities using your own data and the least amount of computing possible.

Langchain

Large language models (LLMs) are emerging as a transformative technology, enabling developers to build applications that they previously could not.

More open-source models on Hugging Face(This is the best way)

https://github.com/huggingface/transformers#model-architectures

Last updated