Text this: Generalizable and automated classification of TNM stage from pathology reports with external validation