Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

This course delves into the core principles and practical implementation of reinforcement learning (RL) and sequential decision-making within agentic AI systems. Participants will gain the skills to design, train, and evaluate autonomous agents that interact dynamically with their environments, achieving long-term objectives through continuous learning and adaptation.

Offered as an instructor-led live training (available online or on-site), this programme is tailored for advanced engineers and researchers seeking to embed reinforcement learning and planning algorithms into agentic systems for applications in automation, robotics, and adaptive reasoning.

Upon completion of this training, participants will be equipped to:

Grasp the mathematical foundations underlying reinforcement learning and decision-making processes.
Implement essential RL algorithms, including DQN, PPO, and A3C, leveraging Python and PyTorch.
Model environments using OpenAI Gym and engineer custom simulation scenarios.
Train, assess, and troubleshoot agents for both continuous and discrete control tasks.
Apply reinforcement learning methodologies to agentic AI use cases in robotics and planning.
Effectively balance exploration, exploitation, and safety constraints during real-world deployment.

Course Format

Instructor-led lectures complemented by live coding demonstrations.
Practical, hands-on exercises utilizing open-source frameworks and simulation environments.
An applied project focused on integrating decision-making capabilities into an agentic AI system.

Customisation Options

To arrange a bespoke training session for this course, please contact us directly.

This course is available as onsite live training in South Africa or online live training.

Thank you for sending your enquiry! One of our team members will contact you shortly.

Thank you for sending your booking! One of our team members will contact you shortly.

Course Outline

Introduction to Reinforcement Learning and Agentic AI

Decision-making under uncertainty and sequential planning.
Core components of RL: agents, environments, states, and rewards.
The role of RL in adaptive and agentic AI systems.

Markov Decision Processes (MDPs)

Formal definition and properties of MDPs.
Value functions, Bellman equations, and dynamic programming.
Policy evaluation, improvement, and iteration.

Model-Free Reinforcement Learning

Monte Carlo and Temporal-Difference (TD) learning.
Q-learning and SARSA.
Hands-on: Implementing tabular RL methods in Python.

Deep Reinforcement Learning

Integrating neural networks with RL for function approximation.
Deep Q-Networks (DQN) and experience replay.
Actor-Critic architectures and policy gradients.
Hands-on: Training an agent using DQN and PPO with Stable-Baselines3.

Exploration Strategies and Reward Shaping

Balancing exploration versus exploitation (e-greedy, UCB, entropy methods).
Designing reward functions and preventing unintended behaviours.
Reward shaping and curriculum learning.

Advanced Topics in RL and Decision-Making

Multi-agent reinforcement learning and cooperative strategies.
Hierarchical reinforcement learning and the options framework.
Offline RL and imitation learning for safer deployment.

Simulation Environments and Evaluation

Utilising OpenAI Gym and custom environments.
Continuous versus discrete action spaces.
Metrics for assessing agent performance, stability, and sample efficiency.

Integrating RL into Agentic AI Systems

Combining reasoning and RL within hybrid agent architectures.
Integrating reinforcement learning with tool-using agents.
Operational considerations for scaling and deployment.

Capstone Project

Design and implement a reinforcement learning agent for a simulated task.
Analyse training performance and optimise hyperparameters.
Demonstrate adaptive behaviour and decision-making within an agentic context.

Summary and Next Steps

Requirements

Strong proficiency in Python programming.
A solid understanding of machine learning and deep learning concepts.
Familiarity with linear algebra, probability theory, and fundamental optimization methods.

Target Audience

Reinforcement learning engineers and applied AI researchers.
Robotics and automation developers.
Engineering teams focused on developing adaptive and agentic AI systems.

28 Hours

Need help picking the right course?
southafrica@nobleprog.co.za or +27 (0)10 005 5793

Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

Course Outline

Requirements

Testimonials (3)

CLIFFORD TABARES - Universal Leaf Philippines, Inc.

Course - Agentic AI for Business Automation: Use Cases & Integration

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Autonomous Decision-Making with Agentic AI

Upcoming Courses

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

Reinforcement & Decision-Making for Agentic AI (with Python) Training Course

Course Outline

Requirements

Testimonials (3)

CLIFFORD TABARES - Universal Leaf Philippines, Inc.

Course - Agentic AI for Business Automation: Use Cases & Integration

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Agentic AI for Enterprise Applications

Ion Mironescu - Facultatea S.A.I.A.P.M.

Course - Autonomous Decision-Making with Agentic AI

Upcoming Courses

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Reinforcement & Decision-Making for Agentic AI (with Python)

Related Courses

Autonomous Decision-Making with Agentic AI

Understanding Agentic AI: Concepts and Capabilities

Agentic AI for Business Automation: Use Cases & Integration

Agentic AI for Enterprise Applications

Agentic AI and the Future of Work

Governance and Security Patterns for WrenAI in the Enterprise

Modernizing Legacy BI with WrenAI: Adoption, Migration, and Change Management

Quality and Observability for WrenAI: Evaluation, Prompt Tuning, and Monitoring

Course Format

Course Customisation Options

Building with the WrenAI API: Applications, Charts, and NL to SQL

WrenAI Cloud Essentials: From Data Sources to Dashboards

WrenAI for Financial Analytics: KPI Modeling and Regulatory-Aware Dashboards

WrenAI OSS Deep Dive: Semantic Modeling, Text to SQL, and Guardrails

WrenAI for Product Teams: Conversational Analytics and Self-Service BI

Deploying WrenAI for SaaS: Embedded GenBI in Customer-Facing Products

Operational Analytics with WrenAI Spreadsheets and Metrics Library

Related Categories

Agentic AI

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites