Text this: Foundation models assist in human–robot collaboration assembly