Multi-modal alignment via hyperbolic geometry