In particular, "good, aligned, conversational AI" is just one of many possible different rollouts. Finetuning / alignment tries to "collapse" and control the entropy to that region of the simulator. Jailbreak prompts try to knock the state into other logprob ravines.
— Andrej Karpathy (@karpathy) 6 mars 2023
see also: lesswrong
idea: quantifies average level of uncertainty information associated with a variable’s potential states or possible outcome.
definition
Given a discrete random variable , which takes a value in a set distributed according to , the entropy is defined as
Base 2 gives unit of “bits” (or “shannons”), while natural base gives “natural units” (or “nat”), and base 10 gives unit of “dits” (or “bans”, or “hartleys”)
joint
measure of uncertainty associated with a set of variables
namely, the joint Shannon entropy,
joint Shannon entropy
in bits, of two discrete random variable and with images and is defined as:
where is the joint probability of both and occurring together.
For more than two random variables, this expands onto: