HyperAIHyperAI

Command Palette

Search for a command to run...

Markov Decision Process

Date

2 years ago

Markov decision process (MDP) is used to describe dynamic systems with randomness and decision elements. It provides a mathematical framework model for decision makers to make decisions in a random environment, and provides effective mathematical tools for optimization problems in dynamic programming and reinforcement learning. MDP is useful for studying optimization problems solved by dynamic programming. It has been known since at least the 1950s and is used in many fields, including robotics, automation, economics, and manufacturing. 

Markov decision processes are an extension of Markov chains, with the addition of actions (allowing choices) and rewards (giving motivation).

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding
Ready-to-use GPUs
Best Pricing
Get Started

Hyper Newsletters

Subscribe to our latest updates
We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning
Powered by MailChimp
Markov Decision Process | Wiki | HyperAI