Text this: Efficient and assured reinforcement learning-based building HVAC control with heterogeneous expert-guided training