Prompt Engineering Tricks: Poor man's RLHF

- Have user indicate when model is correct
- Store associated (input, output) in embedding index
- At inference time, retrieve nearest K previous inputs
- Put these top K (inputs, output) pairs into context as few-shot examples
- Works like a charm ✨
