Learning gene regulation with cross-modal integration of observations and perturbations