Text this: Optimizing Subchannel Assignment and Power Allocation for Network Slicing in High-Density NOMA Networks: A Q-Learning Approach