Text this: Exploring similarity patterns in a large scientific corpus.