Hindsight Experience Replay
A data augmentation technique for goal-conditioned RL that relabels failed trajectories with the states actually reached as goals. If the robot tried to place a block at position A but reached position B, HER creates an additional training example where B is the goal and the trajectory is a success. This dramatically improves sample efficiency for sparse-reward goal-reaching tasks.