Sökning: "Adaptive Stochastic Control"
Visar resultat 21 - 22 av 22 avhandlingar innehållade orden Adaptive Stochastic Control.
21. Online Combinatorial Optimization under Bandit Feedback
Sammanfattning : Multi-Armed Bandits (MAB) constitute the most fundamental model for sequential decision making problems with an exploration vs. exploitation trade-off. In such problems, the decision maker selects an arm in each round and observes a realization of the corresponding unknown reward distribution. LÄS MER
22. A study of wireless communications with reinforcement learning
Sammanfattning : The explosive proliferation of mobile users and wireless data traffic in recent years pose imminent challenges upon wireless system design. The trendfor wireless communications becoming more complicated, decentralized andintelligent is inevitable. LÄS MER