Text this: Application of machine learning based on habitat imaging and vision transformer to predict treatment response of locally advanced esophageal squamous cell carcinoma following neoadjuvant chemoimmunotherapy: a multi-center study