GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors