Text this: Multi-branch feature learning based speech emotion recognition using SCAR-NET