Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

CSKrishna

We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting

9 Stars
4 Forks
9 Watchers
Jupyter Notebook Language
Cost to Build
$500
Market Value
$500

Growth over time

2 data points  ·  2021-08-07 → 2021-11-21
Stars Forks Watchers
💬

How do you feel about this project?

Ask AI about Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

Question copied to clipboard

What is the CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting GitHub project? Description: "We use policy gradient to help agents learn optimal policies in a competitive multi-agent contextual bandit setting". Written in Jupyter Notebook. Explain what it does, its main use cases, key features, and who would benefit from using it.

Question is copied to clipboard — paste it after the AI opens.

How to clone Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting

Clone via HTTPS

git clone https://github.com/CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting.git

Clone via SSH

[email protected]:CSKrishna/Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting.git

Download ZIP

Download master.zip

Found an issue?

Report bugs or request features on the Optimal-bidding-policy-using-Policy-Gradient-in-a-Multi-agent-Contextual-Bandit-setting issue tracker:

Open GitHub Issues