Text this: Advancing ALS Applications with Large-Scale Pre-Training: Framework, Dataset, and Downstream Assessment