Straightforwardly eliciting probabilities from GPT-3

NunoSempere

Straightforwardly eliciting probabilities from GPT-3

Comments 5

Sorted by

New & upvoted

aog

Some papers you might like if you haven't seen them yet:

This paper uses your two methods, as well as mapping verbal descriptions to probabilities
Anthropic investigation of language model calibration. Interesting techniques include asking the model whether its answer was correct, using temperature scaling to restore calibration after RLHF, and training models to be better calibrated.
Foundational overview of calibration in ML models, advocates temperature scaling
Paper showing calibration often suffers under distribution shift of the dataset
Forecasting benchmark for language models

NunoSempere

Nice, thanks

alexlyzhov

I tried the Anthropic model on this dataset with roughly your prompt and it's much better in terms of KL divergence between its predictions and Manifold probabilities. Giving it 10 web search results in a prompt further improves the performance. But the difference no search -> search is smaller compared to GPT-3 -> Anthropic, I'd say mainly because of unhelpful search results.

Eevee🔹

Are those negative numbers logits?

NunoSempere

I think logits are usually log(1/(1-p)), but I think that those negative numbers are just log(p).

Comments

Straightforwardly eliciting probabilities from GPT-3

Straightforwardly eliciting probabilities from GPT-3

Straightforward strategies

Look at the probability of yes/no completion

Have the model output the probability verbally

More elaborate strategies

Various templates, and choosing the template depending on the type of question

GPT consulting GPT

Query and interact with the internet.

Fine-tune the model on good worked examples of forecasting reasoning

Parting thoughts

Acknowledgements