Text this: Efficient k-mer based curation of raw sequence data: application in Drosophila suzukii