Distributed learning algorithm with synchronized epochs for dynamic spectrum access in unknown environment using multi-user restless multi-armed bandit
Dynamic spectrum access using cognitive radio has many application areas like smart-grid, Internet of Things, and various other device-to-device communication paradigms.In dynamic spectrum access, a user picks a wanted reward sign channel out of N channels to transmit during each time-slot.Thus, the user gets an arbitrary reward from a limited set