Text this: A dimension reduction assisted credit scoring method for big data with categorical features