Text this: A lightweight approach to two-person interaction classification in sparse image sequences