Reinforcement learning for data scheduling in internet of things (IoT) networks