Practice Stochastic Bandits (9.9.2.1) - Reinforcement Learning and Bandits
Students

Academic Programs

AI-powered learning for grades 8-12, aligned with major curricula

Professional

Professional Courses

Industry-relevant training in Business, Technology, and Design

Games

Interactive Games

Fun games to boost memory, math, typing, and English skills

Stochastic Bandits

Practice - Stochastic Bandits

Learning

Practice Questions

Test your understanding with targeted questions

Question 1 Easy

What is a stochastic bandit?

💡 Hint: Think about how you would make choices with uncertain outcomes.

Question 2 Easy

Define exploration in stochastic bandits.

💡 Hint: What would you do to gather more information about rewards?

4 more questions available

Interactive Quizzes

Quick quizzes to reinforce your learning

Question 1

What defines a stochastic bandit?

A model with known rewards
A model with fixed unknown distributions
A model with continuous feedback

💡 Hint: Focus on what 'stochastic' implies about the rewards.

Question 2

True or False: Exploration is less important than exploitation.

True
False

💡 Hint: Think about why learning is necessary in uncertain environments.

Get performance evaluation

Challenge Problems

Push your limits with advanced challenges

Challenge 1 Hard

Consider a scenario where you have three advertising options with unknown click-through rates. Explain how you would apply the ε-greedy strategy over ten trials.

💡 Hint: Think about how often you'd switch versus stick with the current best performer.

Challenge 2 Hard

Create a simulation for a stochastic bandit with five arms where the true rewards for each arm are hidden. Explain how you would implement Thompson Sampling and what outcomes to track.

💡 Hint: Consider how you would keep track of successes and how it affects future probabilities.

Get performance evaluation

Reference links

Supplementary resources to enhance your learning experience.