Hacker News new | ask | show | jobs
by deeplstm 1188 days ago
Blog: https://crfm.stanford.edu/2023/03/13/alpaca.html

Demo: https://alpaca-ai0.ngrok.io/

Video summary: https://youtu.be/6qdzsDSduww

Text summary

Alpaca is a solution to NLP centralization proposed by Stanford researchers

Alpaca learns from GPT-3.5/ChatGPT to generate data for training

Base model: Llama (7 billion parameters)

Training cost: 3 hours on an A100 GPU*8 (< $100) Can be used with various models (sequence-to-sequence)

Usage: Primarily for academic research, commercial use prohibited

Future directions: Scaling models, properties of base models and instruction data, alternatives to supervised training on ChatGPT data