4.6 3rd Party Support

MLPro allows the user to reuse widely-used 3rd-party packages and integrate them to the MLPro interface and also the other way around via wrapper classes. Therefore, the user is free to select an environment and/or a policy from the 3rd-party packages and the native MLPro-RL. It is also possible to combine an environment and a policy from different packages.

At the moment, we have five ready-to-use wrapper classes related to RL from 3rd-party packages to MLPro and two wrapper classes from MLPro to 3rd-party packages, such as:

No	Wrapper Class	Origin	Target	Wrapped RL Components
1	WrEnvGYM2MLPro	OpenAI Gym	MLPro	RL Environments
2	WrEnvMLPro2GYM	MLPro	OpenAI Gym	RL Environments
3	WrEnvPZOO2MLPro	PettingZoo	MLPro	Multi-Agent RL Environments
4	WrEnvMLPro2PZoo	MLPro	PettingZoo	Multi-Agent RL Environments
5	WrPolicySB32MLPro	StableBaselines3	MLPro	Off-Policy and On-Policy RL Algorithms

Moreover, wrapper classes for hyperparameter tuning by Optuna and Hyperopt can also be incorporated to your RL training.

RL Environment: OpenAI Gym to MLPro

Here is the wrapper class to convert RL Environment from OpenAI Gym to MLPro. The implementation is pretty simple and straightforward. The user can call the wrapper class while setting up an environment, as follows:

from mlpro.wrappers.openai_gym import WrEnvGYM2MLPro
import gym

p_gym_env = gym.make('CartPole-v1', new_step_api=True, render_mode=None)
self._env = WrEnvGYM2MLPro(p_gym_env, p_logging=True)

For more information, please check the how-to file.

RL Environment: MLPro to OpenAI Gym

Here is the wrapper class to convert RL Environment from MLPro to OpenAI Gym. The implementation is pretty simple and straightforward. The user can call the wrapper class while setting up an environment, as follows:

from mlpro.wrappers.openai_gym import WrEnvMLPro2GYM
from mlpro.rl.pool.envs.gridworld import GridWorld

mlpro_env = GridWorld(p_logging=Log.C_LOG_ALL)
env = WrEnvMLPro2GYM(mlpro_env, p_state_space=None, p_action_space=None, p_new_step_api=True)

For more information, please check the how-to file.

RL Environment: PettingZoo to MLPro

Here is the wrapper class to convert RL Environment from PettingZoo to MLPro. The implementation is pretty simple and straightforward. The user can call the wrapper class while setting up an environment, as follows:

from pettingzoo.butterfly import pistonball_v6
from mlpro.wrappers.pettingzoo import WrEnvPZOO2MLPro

p_zoo_env = pistonball_v6.env()
self._env = WrEnvPZOO2MLPro(p_zoo_env, p_logging=True)

For more information, please check the how-to file.

RL Environment: MLPro to PettingZoo

Here is the wrapper class to convert RL Environment from MLPro to PettingZoo. The implementation is pretty simple and straightforward. The user can call the wrapper class while setting up an environment, as follows:

from mlpro.wrappers.pettingzoo import WrEnvMLPro2PZoo
from mlpro.rl.pool.envs.bglp import BGLP

mlpro_env = BGLP(p_logging=Mode.C_LOG_ALL)
env = WrEnvMLPro2PZoo(mlpro_env, p_num_agents=5, p_state_space=None, p_action_space=None).pzoo_env

For more information, please check the how-to file.

RL Policy: StableBaselines3 to MLPro

Here is the wrapper class to convert RL Environment from StableBaselines3 to MLPro. The wrapper provides both the On-Policy and Off-Policy from StableBaselines3. The implementation is pretty simple and straightforward. The user can call the wrapper class while setting up an environment, as follows:

from stable_baselines3 import PPO
from mlpro.rl.wrappers import WrPolicySB32MLPro

class MyScenario(Scenario):

    C_NAME      = 'Matrix'

    def _setup(self, p_mode, p_ada, p_logging):
        gym_env     = gym.make('CartPole-v1')
        self._env   = WrEnvGYM2MLPro(gym_env, p_logging=False)

        policy_sb3 = PPO(
            policy="MlpPolicy",
            n_steps=5,
            env=None,
            _init_setup_model=False,
            device="cpu")

        policy_wrapped = WrPolicySB32MLPro(
            p_sb3_policy=policy_sb3,
            p_cycle_limit=self._cycle_limit,
            p_observation_space=self._env.get_state_space(),
            p_action_space=self._env.get_action_space(),
            p_ada=p_ada,
            p_logging=p_logging)

        return Agent(
            p_policy=policy_wrapped,
            p_envmodel=None,
            p_name='Smith',
            p_ada=p_ada,
            p_logging=p_logging
        )

For more information, please check the how-to file.

Read the Docs v: v0.9.2

Versions: latest; v0.9.2; v0.9.1; v0.9.0; v0.8.6; v0.8.5; v0.8.1; v0.8.0; v0.5.0

Downloads

On Read the Docs: Project Home; Builds