Privileged Information
State information available during training (in simulation) but not at deployment time. Examples: exact object poses, contact forces, friction coefficients. Teacher policies trained with privileged information achieve high performance; they are then distilled into student policies that use only deployment-available inputs (images, proprioception). This asymmetric training paradigm is standard in sim-to-real.