Text this: A Study on Systematic Improvement of Transformer Models for Object Pose Estimation