See the difference between our kernel, hausdorff and sinkhorn loss functions:
Gradient flows in 1D
Gradient flows in 2D