| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by deeplstm 1188 days ago

Text summary

Alpaca is a solution to NLP centralization proposed by Stanford researchers

Alpaca learns from GPT-3.5/ChatGPT to generate data for training

Base model: Llama (7 billion parameters)

Training cost: 3 hours on an A100 GPU*8 (< $100) Can be used with various models (sequence-to-sequence)

Usage: Primarily for academic research, commercial use prohibited

Future directions: Scaling models, properties of base models and instruction data, alternatives to supervised training on ChatGPT data