azure ai ai-rl ai-ml
Facebook ReAgent - An End-to-End Use Case

Facebook decided to release their end-to-end applied reinforcement learning platform called ReAgent, after reading their vision on this, I…

Xavier Geerinck

ai ai-ml ai-rl
Facebook's Open-Source Reinforcement Learning Platform - A Deep Dive

Facebook decided to open-source the platform that they created to solve end-to-end Reinforcement Learning problems at the scale they are…

Xavier Geerinck

ai ai-ml ai-rl
Reinforcement Learning - Terminology

Tags: Machine LearningReinforcement LearningQ LearningArtificial IntelligenceMonte Carlo Reinforcement Learning - An Overview of Today…

Xavier Geerinck

ai ai-ml ai-rl
Writing a C# SDK for the OpenAI Gym using .NET Core

When we take a look at the OpenAI Gym on Github (https://github.com/openai/gym-http-api), we see that it does not have bindings available…

Xavier Geerinck

ai ai-ml ai-rl
OpenAI Gym Problems - Solving the CartPole Gym

In a previous post we set-up the OpenAI Gym to interface with our Javascript environment. Let’s now look at how we can use this interface to…

Xavier Geerinck

coding coding-javascript ai-rl
Dividing numbers into equal buckets or bins through Bucketization

A common practice in Reinforcement Learning is to go from a continuous space towards a discrete space. What does this mean? Take for example…

Xavier Geerinck

ai ai-ml ai-rl
Bellman Equations

Summary So what did we learn up until now in our Introduction and Markov Property, Chain, Reward Process and Decision Process posts…

Xavier Geerinck

ai ai-ml ai-rl
The Markov Property, Chain, Reward Process and Decision Process

As seen in the previous article, we now know the general concept of Reinforcement Learning. But how do we actually get towards solving our…

Xavier Geerinck

ai ai-ml ai-rl
How to run OpenAI Gym on Windows and with Javascript

Reinforcement learning does not only requires a lot of knowledge about the subject to get started, it also requires a lot of tools to help…

Xavier Geerinck

ai ai-ml ai-rl
Multi-armed bandit framework

To start solving the problem of exploration, we are going to introduce the Multi-armed bandits framework. But what exactly does this solve…

Xavier Geerinck

ai ai-ml ai-rl
An introduction to Reinforcement Learning (RL)

So as we learned in the intro to Machine Learning, Reinforcement Learning is this technique where we have an agent who will take specific…

Xavier Geerinck

Xavier Geerinck © 2020

Twitter - LinkedIn