diff --git a/README.md b/README.md index b159e57..e6a7bf8 100644 --- a/README.md +++ b/README.md @@ -173,6 +173,7 @@ If you aren't able to access any paper on this list, please [try using Sci-Hub]( - [The Hanabi challenge: From Artificial Teams to Mixed Human-Machine Teams](http://oru.diva-portal.org/smash/record.jsf?pid=diva2%3A1691114&dswid=-1981) (thesis) - [A Graphical User Interface For The Hanabi Challenge Benchmark](http://urn.kb.se/resolve?urn=urn:nbn:se:oru:diva-94615) (thesis) - [Analysis of Symmetry and Conventions in Off-Belief Learning (OBL) in Hanabi](https://fanpu.io/blog/2022/symmetry-and-conventions-in-obl-hanabi/) (blogPost) +- [Using intuitive behavior models to adapt to and work with human teammates in Hanabi](http://reports-archive.adm.cs.cmu.edu/anon/anon/usr0/ftp/usr/ftp/2022/abstracts/22-119.html) (thesis) # Hearthstone - [Mapping Hearthstone Deck Spaces through MAP-Elites with Sliding Boundaries](http://arxiv.org/abs/1904.10656) (journalArticle) diff --git a/boardgame-research.rdf b/boardgame-research.rdf index ccb92ae..415f98e 100644 --- a/boardgame-research.rdf +++ b/boardgame-research.rdf @@ -9328,6 +9328,45 @@ guaranteed decent high score. The algorithm got a lowest score of 79 and a 2023-03-01 11:40:51 3 + + thesis + + + Computer Science Department School of Computer Science, Carnegie Mellon University + + + + + + + Arnav Mahajan + + + + + + Using intuitive behavior models to adapt to and work with human teammates in Hanabi + An agent that can rapidly and accurately model its teammate is a powerful tool in the field of Collaborative AI. Furthermore, if an approximation for this goal was possible in the field of Human-AI Collaboration, teams of people and machines could be more efficient and effective immediately after starting to work together. Using the cooperative card game Hanabi as a testbed, we developed the Chief agent, which models teammates using a pool of intuitive behavioral models. To achieve the goal of rapid learning, it uses Bayesian inference to quickly evaluate the different models relative to each other. To generate an accurate model, it uses historical data augmented by up-to-date knowledge and sampling methods to handle environmental noise and unknowns. We demonstrate that the Chief's mechanisms for modeling and understanding the teammate show promise, but the overall performance still can use improvement to reliably outperform a solution which skips inferring a best strategy and assumes all strategies in the pool are equally likely for the teammate. + en + + + http://reports-archive.adm.cs.cmu.edu/anon/anon/usr0/ftp/usr/ftp/2022/abstracts/22-119.html + + + 43 + M.S. Thesis + + + attachment + CMU-CS-22-119.pdf + + + http://reports-archive.adm.cs.cmu.edu/anon/2022/CMU-CS-22-119.pdf + + + 2023-03-11 03:49:56 + 3 + 2048 @@ -9442,6 +9481,7 @@ guaranteed decent high score. The algorithm got a lowest score of 79 and a + Hearthstone