Contrastive and Adaptive Multi-modal Masked Autoencoder for Spatial Transcriptomics