Skip to main content

7 docs tagged with "Connect 4"

View All Tags

Agent

The final policy is used to make the agent, which like the analytical agent, follows some rules.

Alphabeta

Two modifications are made to the agent: the minimax algorithm is optimized with alpha-beta pruning and complexity is added to the heuristic.

Environment

I made the class for the environment using OpenAI gymnasium and initialized an environment.

Introduction

This is the Kaggle competition regarding Game AI and Reinforcement Learning.

Introduction

This is the Kaggle competition regarding Game AI and Reinforcement Learning.

Minimax

The agent follows this algorithm to decide its next move deterministically:

Self-play

The policy is a custom CNN policy made using PyTorch: