samim

tags

Prompt Engineering Tricks - Poor mans RLHF

2023-01-24 20:21:50

Prompt Engineering Tricks: Poor man's RLHF

Have user indicate when model is correct
Store associated (input, output) in embedding index
At inference time, retrieve nearest K previous inputs
Put these top K (inputs, output) pairs into context as few-shot examples
Works like a charm ✨

(Source) #ML #Generative