Changes

Timeline of OpenAI

407 bytes added, 13:33, 16 May 2020

no edit summary

| 2016 || {{dts|May 31}} || Generative models || Publication || "VIME: Variational Information Maximizing Exploration", a paper on generative models, is submitted to the {{w|ArXiv}}. The paper introduces Variational Information Maximizing Exploration (VIME), an exploration strategy based on maximization of information gain about the agent's belief of environment dynamics.<ref>{{cite web |last1=Houthooft |first1=Rein |last2=Chen |first2=Xi |last3=Duan |first3=Yan |last4=Schulman |first4=John |last5=De Turck |first5=Filip |last6=Abbeel |first6=Pieter |title=VIME: Variational Information Maximizing Exploration |url=https://arxiv.org/abs/1605.09674 |website=arxiv.org |accessdate=27 March 2020}}</ref>

|-

| 2016 || {{dts|June 5}} || {{w|Reinforcement learning}} || Publication || "OpenAI Gym", a paper on {{w|reinforcement learning}}, is submitted to the {{w|ArXiv}}. It presents OpenAI Gym as a toolkit for reinforcement learning research.<ref>{{cite web |last1=Brockman |first1=Greg |last2=Cheung |first2=Vicki |last3=Pettersson |first3=Ludwig |last4=Schneider |first4=Jonas |last5=Schulman |first5=John |last6=Tang |first6=Jie |last7=Zaremba |first7=Wojciech |title=OpenAI Gym |url=https://arxiv.org/abs/1606.01540 |website=arxiv.org |accessdate=27 March 2020}}</ref> OpenAI Gym is considered by some as "a huge opportunity for speeding up the progress in the creation of better reinforcement algorithms, since it provides an easy way of comparing them, on the same conditions, independently of where the algorithm is executed".<ref>{{cite web |title=OPENAI GYM |url=https://www.theconstructsim.com/tag/openai_gym/ |website=theconstructsim.com |accessdate=16 May 2020}}</ref>

|-

| 2016 || {{dts|June 10}} || Generative models || Publication || "Improved Techniques for Training GANs", a paper on generative models, is submitted to the {{w|ArXiv}}. It presents a variety of new architectural features and training procedures that OpenAI applies to the generative adversarial networks (GANs) framework.<ref>{{cite web |last1=Salimans |first1=Tim |last2=Goodfellow |first2=Ian |last3=Zaremba |first3=Wojciech |last4=Cheung |first4=Vicki |last5=Radford |first5=Alec |last6=Chen |first6=Xi |title=Improved Techniques for Training GANs |url=https://arxiv.org/abs/1606.03498 |website=arxiv.org |accessdate=27 March 2020}}</ref>

Sebastian

62,734

edits

Changes

Timeline of OpenAI

Navigation menu

Personal tools

Namespaces

Variants

Views

More

Search

Navigation

Tools