Text this: Mix-layers semantic extraction and multi-scale aggregation transformer for semantic segmentation