Large Language Models (LLMs)

Articles

Language understanding by generative pre-training - Alec et al. openAI
LLM are few shot learners - scaling LLMs with data is enough to make them few shot.

Training language models to follow instructions with human feedback (using RLHF)
Instructor model - "We introduce Instructor👨‍🏫, an instruction-finetuned text embedding model that can generate text embeddings tailored to any task (e.g., classification, retrieval, clustering, text evaluation, etc.) and domains (e.g., science, finance, etc.) by simply providing the task instruction, without any finetuning. Instructor achieves sota on 70 diverse embedding tasks!"

Scikit-LLM
LangChain
1. An amazing tutorial in Youtube by Patrick Loeber about
  - LLMs
    Prompt Templates
    Chains
    Agents and Tools
    Memory
    Document Loaders
    Indexes
2. Langchain in 13 minutes
ReAct & LangChain
LangFlow, Medium, HuggingFace - is a UI for LangChain, designed with react-flow to provide an effortless way to experiment and prototype flows.
PandasAI - PandasAI, asking data Qs using LLMs on Panda's DFs with two code lines. 𝚙𝚊𝚗𝚍𝚊𝚜_𝚊𝚒 = 𝙿𝚊𝚗𝚍𝚊𝚜𝙰𝙸(𝚕𝚕𝚖) & 𝚙𝚊𝚗𝚍𝚊𝚜_𝚊𝚒.𝚛𝚞𝚗(𝚍𝚏, 𝚙𝚛𝚘𝚖𝚙𝚝='𝚆𝚑𝚒𝚌𝚑 𝚊𝚛𝚎 𝚝𝚑𝚎 𝟻 𝚑𝚊𝚙𝚙𝚒𝚎𝚜𝚝 𝚌𝚘𝚞𝚗𝚝𝚛𝚒𝚎𝚜?')
LLaMa Index - LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data.
LLM-foundry - LLM training code for Databricks foundation models using MoasicML
Awesome ChatGPT - Curated list of awesome tools, demos, docs for ChatGPT and GPT-3
GPT4 All Privacy-oriented software for chatting with large language models that run on your own computer.
MinGPT - A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
NanoGPT - The simplest, fastest repository for training/finetuning medium-sized GPTs.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

GuardrailsAI
Safeguarding LLMs with Guardrails
Databricks GR - Implementing LLM Guardrails for Safe and Responsible Generative AI Deployment on Databricks

RLHF: Reinforcement Learning from Human Feedback by Chip Huyen
Yoav on RL
John Schulman - Reinforcement Learning from Human Feedback: Progress and Challenges

Understanding ROUGE - a family of metrics that evaluate the performance of a LLM in text summarization, i.e., ROUGE-1, ROUGE-2, ROUGE-L, for unigrams, bi grams, LCS, respectively.

Last updated 5 months ago