Text this: Similarity-Based Summarization of Music Files for Support Vector Machines