Text this: Efficient top-k string similarity query algorithms