Text this: IoT-enhanced multi-attention and lightweight feature integration for human pose estimation in motion training systems