Changes

Jump to: navigation, search

Timeline of OpenAI

No change in size, 18:19, 5 April 2020
no edit summary
| 2017 || {{dts|August 18}} || Software release || OpenAI releases two implementations: ACKTR, a {{w|reinforcement learning}} algorithm, and A2C, a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C).<ref>{{cite web |title=OpenAI Baselines: ACKTR & A2C |url=https://openai.com/blog/baselines-acktr-a2c/ |website=openai.com |accessdate=5 April 2020}}</ref>
|-
| 2017 || {{Dts|September 13}} || Publication || "Learning with Opponent-Learning Awareness" is first uploaded to the {{w|arXivArXiv}}. The paper presents Learning with Opponent-Learning Awareness (LOLA), a method in which each agent shapes the anticipated learning of the other agents in an environment.<ref>{{cite web |url=https://arxiv.org/abs/1709.04326 |title=[1709.04326] Learning with Opponent-Learning Awareness |accessdate=March 2, 2018}}</ref><ref>{{cite web |url=https://www.gwern.net/newsletter/2017/09 |author=gwern |date=August 16, 2017 |title=September 2017 news - Gwern.net |accessdate=March 2, 2018}}</ref>
|-
| 2017 || September || Staff || OpenAI Research Scientist Bowen Baker joins the organization.<ref>{{cite web |title=Bowen Baker |url=https://www.linkedin.com/in/bowen-baker-59b48a65/ |website=linkedin.com |accessdate=28 February 2020}}</ref>
62,434
edits

Navigation menu