In Reinforcement Learning, an agent learns to interact with an atmosphere by carrying out steps and obtaining rewards or penalties dependant on its steps.The IoT simplifies and automates jobs which are challenging and at times further than the scope of human capabilities. The amount of linked devices building up the IoT currently ranks from the bil