TechMediaGamesBlog@ailinux.me

General 07.09.2020 Loading...

Generative language modeling for automated theorem proving

Post Content

General 04.09.2020 Loading...

Learning to summarize with human feedback

We’ve applied reinforcement learning from human feedback to train language models that are better at summarization.

General 09.07.2020 Loading...

OpenAI Scholars 2020: Final projects

Our third class of OpenAI Scholars presented their final projects at virtual Demo Day, showcasing their research results from over the past five months.

General 20.06.2020 Loading...

Procgen and MineRL Competitions

We’re excited to announce that OpenAI is co-organizing two NeurIPS 2020 competitions with AIcrowd, Carnegie Mellon University, and DeepMind, using Procgen Benchmark and MineRL.

General 17.06.2020 Loading...

Image GPT

We find that, just as a large transformer model trained on language can generate coherent text, the same exact model trained on pixel sequences can generate...

General 11.06.2020 Loading...

OpenAI API

We’re releasing an API for accessing new AI models developed by OpenAI.

General 28.05.2020 Loading...

Language models are few-shot learners

Post Content

General 05.05.2020 Loading...

AI and efficiency

We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet classification has been...

General 30.04.2020 Loading...

Jukebox

We’re introducing Jukebox, a neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist styles. We’re releasing the...

General 16.04.2020 Loading...

Improving verifiability in AI development

We’ve contributed to a multi-stakeholder report by 58 co-authors at 30 organizations, including the Centre for the Future of Intelligence, Mila, Schwartz Reisman Institute for Technology and Society, Center for Advanced Study...

General 14.04.2020 Loading...

OpenAI Microscope

We’re introducing OpenAI Microscope, a collection of visualizations of every significant layer and neuron of eight vision “model organisms” which are often studied in interpretability. Microscope makes...

General 30.01.2020 Loading...

OpenAI standardizes on PyTorch

We are standardizing OpenAI’s deep learning framework on PyTorch.

General 23.01.2020 Loading...

Scaling laws for neural language models

Post Content

General 13.12.2019 Loading...

Dota 2 with large scale deep reinforcement learning

Post Content

General 05.12.2019 Loading...

Deep double descent

We show that the double descent phenomenon occurs in CNNs, ResNets, and transformers: performance first improves, then gets worse, and then improves again with increasing model size, data size, or...

General 03.12.2019 Loading...

Procgen Benchmark

We’re releasing Procgen Benchmark, 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a reinforcement learning agent learns generalizable skills.

General 21.11.2019 Loading...

Safety Gym

We’re releasing Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safety constraints while training.

General 21.11.2019 Loading...

Benchmarking safe exploration in deep reinforcement learning

Post Content

General 05.11.2019 Loading...

GPT-2: 1.5B release

As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and model weights to facilitate detection of outputs of...

General 15.10.2019 Loading...

Solving Rubik’s Cube with a robot hand

We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in simulation, using...

General 11.10.2019 Loading...

OpenAI Scholars 2020: Applications open

We are now accepting applications for our third class of OpenAI Scholars.

General 19.09.2019 Loading...

Fine-tuning GPT-2 from human preferences

We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human labelers, though those preferences...

General 17.09.2019 Loading...

Emergent tool use from multi-agent interaction

We’ve observed agents discovering progressively more complex tool use while playing a simple game of hide-and-seek. Through training in our new simulated hide-and-seek environment, agents build...

General 22.08.2019 Loading...

Testing robustness against unforeseen adversaries

We’ve developed a method to assess whether a neural network classifier can reliably defend against adversarial attacks not seen during training. Our method yields a new...

General 20.08.2019 Loading...

GPT-2: 6-month follow-up

We’re releasing the 774 million parameter GPT-2 language model after the release of our small 124M model in February, staged release of our medium 355M model in May, and subsequent...

General 01.08.2019 Loading...

Learning Day

At OpenAI, each Thursday is Learning Day: a day where employees have the option to self-study technical skills that will make them better at their job...

General 22.07.2019 Loading...

Microsoft invests in and partners with OpenAI to support us building beneficial AGI

Microsoft is investing $1 billion in OpenAI to support us building artificial general intelligence (AGI) with widely distributed economic benefits. We’re partnering to develop a hardware...

General 10.07.2019 Loading...

Why responsible AI development needs cooperation on safety

We’ve written a policy research paper identifying four strategies that can be used today to improve the likelihood of long-term industry cooperation on safety norms in...

General 05.06.2019 Loading...

OpenAI Robotics Symposium 2019

We hosted the first OpenAI Robotics Symposium on April 27, 2019.

General 23.05.2019 Loading...

OpenAI Scholars 2019: Final projects

Our second class of OpenAI Scholars has concluded, with all eight scholars producing an exciting final project showcased at Scholars Demo Day at OpenAI.

General 17.05.2019 Loading...

OpenAI Fellows Fall 2018: Final projects

Our second class of OpenAI Fellows has wrapped up, with each Fellow going from a machine learning beginner to core OpenAI contributor in the course of...

General 03.05.2019 Loading...

Transfer of adversarial robustness between perturbation types

Post Content

General 25.04.2019 Loading...

MuseNet

We’ve created MuseNet, a deep neural network that can generate 4-minute musical compositions with 10 different instruments, and can combine styles from country to Mozart to...

General 23.04.2019 Loading...

Generative modeling with sparse transformers

We’ve developed the Sparse Transformer, a deep neural network which sets new records at predicting what comes next in a sequence—whether text, images, or sound. It...

General 15.04.2019 Loading...

OpenAI Five defeats Dota 2 world champions

OpenAI Five is the first AI to beat the world champions in an esports game, having won two back-to-back games versus the world champion Dota 2...

General 26.03.2019 Loading...

OpenAI Five Finals

We’ll be holding our final live event for OpenAI Five at 11:30am PT on April 13.

General 21.03.2019 Loading...

Implicit generation and generalization methods for energy-based models

We’ve made progress towards stable and scalable training of energy-based models (EBMs) resulting in better sample quality and generalization ability than existing models. Generation in EBMs spends more...

General 13.03.2019 Loading...

OpenAI Scholars 2019: Meet our Scholars

Our class of eight scholars (out of 550 applicants) brings together collective expertise in literature, philosophy, cell biology, statistics, economics, quantum physics, and business innovation.

General 11.03.2019 Loading...

OpenAI LP

We’ve created OpenAI LP, a new “capped-profit” company that allows us to rapidly increase our investments in compute and talent while including checks and balances to...

General 06.03.2019 Loading...

Introducing Activation Atlases

We’ve created activation atlases (in collaboration with Google researchers), a new technique for visualizing what interactions between neurons can represent. As AI systems are deployed in increasingly sensitive contexts, having...

General 04.03.2019 Loading...

Neural MMO: A massively multiagent game environment

We’re releasing a Neural MMO, a massively multiagent game environment for reinforcement learning agents. Our platform supports a large, variable number of agents within a persistent and...

General 26.02.2019 Loading...

Spinning Up in Deep RL: Workshop review

On February 2, we held our first Spinning Up Workshop as part of our new education initiative at OpenAI.

General 19.02.2019 Loading...

AI safety needs social scientists

We’ve written a paper arguing that long-term AI safety research needs social scientists to ensure AI alignment algorithms succeed when actual humans are involved. Properly aligning...

General 14.02.2019 Loading...

Better language models and their implications

We’ve trained a large-scale unsupervised language model which generates coherent paragraphs of text, achieves state-of-the-art performance on many language modeling benchmarks, and performs rudimentary reading comprehension,...

General 04.02.2019 Loading...

Computational limitations in robust classification and win-win results

Post Content

General 19.12.2018 Loading...

OpenAI Fellows Summer 2018: Final projects

Our first cohort of OpenAI Fellows has concluded, with each Fellow going from a machine learning beginner to core OpenAI contributor in the course of a...

General 14.12.2018 Loading...

How AI training scales

We’ve discovered that the gradient noise scale, a simple statistical metric, predicts the parallelizability of neural network training on a wide range of tasks. Since complex...

General 06.12.2018 Loading...

Quantifying generalization in reinforcement learning

We’re releasing CoinRun, a training environment which provides a metric for an agent’s ability to transfer its experience to novel situations and has already helped clarify...

General 08.11.2018 Loading...

Spinning Up in Deep RL

We’re releasing Spinning Up in Deep RL, an educational resource designed to let anyone learn to become a skilled practitioner in deep reinforcement learning. Spinning Up...

General 07.11.2018 Loading...

Learning concepts with energy functions

We’ve developed an energy-based model that can quickly learn to identify and generate instances of concepts, such as near, above, between, closest, and furthest, expressed as sets of...

General 05.11.2018 Loading...

Plan online, learn offline: Efficient learning and exploration via model-based control

Post Content

General 31.10.2018 Loading...

Reinforcement learning with prediction-based rewards

We’ve developed Random Network Distillation (RND), a prediction-based method for encouraging reinforcement learning agents to explore their environments through curiosity, which for the first time exceeds average...

General 22.10.2018 Loading...

Learning complex goals with iterated amplification

We’re proposing an AI safety technique called iterated amplification that lets us specify complicated behaviors and goals that are beyond human scale, by demonstrating how to...

General 11.10.2018 Loading...

OpenAI Scholars 2019: Applications open

We are now accepting applications for our second cohort of OpenAI Scholars, a program where we provide 6–10 stipends and mentorship to individuals from underrepresented groups...

General 09.10.2018 Loading...

OpenAI Fellows Winter 2019 & Interns Summer 2019

We are now accepting applications for OpenAI Fellows and Interns for 2019.

General 02.10.2018 Loading...

FFJORD: Free-form continuous dynamics for scalable reversible generative models

Post Content

General 10.09.2018 Loading...

OpenAI Scholars 2018: Final projects

Our first cohort of OpenAI Scholars has now completed the program.

General 23.08.2018 Loading...

The International 2018: Results

OpenAI Five lost two games against top Dota 2 players at The International in Vancouver this week, maintaining a good chance of winning for the first...

General 13.08.2018 Loading...

Large-scale study of curiosity-driven learning

Post Content

General 06.08.2018 Loading...

OpenAI Five Benchmark: Results

Yesterday, OpenAI Five won a best-of-three against a team of 99.95th percentile Dota players: Blitz, Cap, Fogged, Merlini, and MoonMeander—four of whom have played Dota professionally—in front of a live audience and 100,000...

General 30.07.2018 Loading...

Learning dexterity

We’ve trained a human-like robot hand to manipulate physical objects with unprecedented dexterity.

General 26.07.2018 Loading...

Variational option discovery algorithms

Post Content

General 25.07.2018 Loading...

OpenAI Scholars 2018: Meet our Scholars

Our first class of OpenAI Scholars is underway, and you can now follow along as this group of experienced software developers becomes machine learning practitioners.

General 18.07.2018 Loading...

OpenAI Five Benchmark

The OpenAI Five Benchmark match is now over!

General 09.07.2018 Loading...

Glow: Better reversible generative models

We introduce Glow, a reversible generative model which uses invertible 1×1 convolutions. It extends previous work on reversible generative models and simplifies the architecture. Our model can generate realistic high...

General 04.07.2018 Loading...

Learning Montezuma’s Revenge from a single demonstration

We’ve trained an agent to achieve a high score of 74,500 on Montezuma’s Revenge from a single human demonstration, better than any previously published result. Our algorithm is...

General 25.06.2018 Loading...

OpenAI Five

Our team of five neural networks, OpenAI Five, has started to defeat amateur human teams at Dota 2.

General 22.06.2018 Loading...

Retro Contest: Results

The first run of our Retro Contest—exploring the development of algorithms that can generalize from previous experience—is now complete.

General 17.06.2018 Loading...

Learning policy representations in multiagent systems

Post Content

General 11.06.2018 Loading...

Improving language understanding with unsupervised learning

We’ve obtained state-of-the-art results on a suite of diverse language tasks with a scalable, task-agnostic system, which we’re also releasing. Our approach is a combination of...

General 02.06.2018 Loading...

GamePad: A learning environment for theorem proving

Post Content

General 30.05.2018 Loading...

OpenAI Fellows Fall 2018

We’re now accepting applications for the next cohort of OpenAI Fellows, a program which offers a compensated 6-month apprenticeship in AI research at OpenAI.

General 25.05.2018 Loading...

Gym Retro

We’re releasing the full version of Gym Retro, a platform for reinforcement learning research on games. This brings our publicly-released game count from around 70 Atari games...

General 16.05.2018 Loading...

AI and compute

We’re releasing an analysis showing that since 2012, the amount of compute used in the largest AI training runs has been increasing exponentially with a 3.4-month...

General 03.05.2018 Loading...

AI safety via debate

We’re proposing an AI safety technique which trains agents to debate topics with one another, using a human to judge who wins.

General 18.04.2018 Loading...

Evolved Policy Gradients

We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning agents, which can enable fast training on...

General 10.04.2018 Loading...

Gotta Learn Fast: A new benchmark for generalization in RL

Post Content

General 10.04.2018 Loading...

Gotta Learn Fast: A new benchmark for generalization in RL

Post Content

General 05.04.2018 Loading...

Retro Contest

We’re launching a transfer learning contest that measures a reinforcement learning algorithm’s ability to generalize from previous experience.

General 20.03.2018 Loading...

Variance reduction for policy gradient with action-dependent factorized baselines

Post Content

General 15.03.2018 Loading...

Improving GANs using optimal transport

Post Content

General 15.03.2018 Loading...

Report from the OpenAI hackathon

On March 3rd, we hosted our first hackathon with 100 members of the artificial intelligence community.

General 08.03.2018 Loading...

On first-order meta-learning algorithms

Post Content

General 07.03.2018 Loading...

Reptile: A scalable meta-learning algorithm

We’ve developed a simple meta-learning algorithm called Reptile which works by repeatedly sampling a task, performing stochastic gradient descent on it, and updating the initial parameters...

General 06.03.2018 Loading...

OpenAI Scholars

We’re providing 6–10 stipends and mentorship to individuals from underrepresented groups to study deep learning full-time for 3 months and open-source a project.

General 03.03.2018 Loading...

Some considerations on learning to explore via meta-reinforcement learning

Post Content

General 26.02.2018 Loading...

Ingredients for robotics research

We’re releasing eight simulated robotics environments and a Baselines implementation of Hindsight Experience Replay, all developed for our research over the past year. We’ve used these...

General 26.02.2018 Loading...

Multi-Goal Reinforcement Learning: Challenging robotics environments and request for research

Post Content

General 22.02.2018 Loading...

OpenAI hackathon

Come to OpenAI’s office in San Francisco’s Mission District for talks and a hackathon on Saturday, March 3rd.

General 20.02.2018 Loading...

OpenAI supporters

We’re excited to welcome new donors to OpenAI.

General 20.02.2018 Loading...

Preparing for malicious uses of AI

We’ve co-authored a paper that forecasts how malicious actors could misuse AI technology, and potential ways we can prevent and mitigate these threats. This paper is...

General 15.02.2018 Loading...

Interpretable machine learning through teaching

We’ve designed a method that encourages AIs to teach each other with examples that also make sense to humans. Our approach automatically selects the most informative...

General 07.02.2018 Loading...

Discovering types for entity disambiguation

We’ve built a system for automatically figuring out which object is meant by a word by having a neural network decide if the word belongs to...

General 31.01.2018 Loading...

Requests for Research 2.0

We’re releasing a new batch of seven unsolved problems which have come up in the course of our research at OpenAI.

General 18.01.2018 Loading...

Scaling Kubernetes to 2,500 nodes

Post Content

General 06.12.2017 Loading...

Block-sparse GPU kernels

We’re releasing highly-optimized GPU kernels for an underexplored class of neural network architectures: networks with block-sparse weights. Depending on the chosen sparsity, these kernels can run...

General 04.12.2017 Loading...

Learning sparse neural networks through L₀ regularization

Post Content

General 02.11.2017 Loading...

Interpretable and pedagogical examples

Post Content

General 26.10.2017 Loading...

Learning a hierarchy

We’ve developed a hierarchical reinforcement learning algorithm that learns high-level actions useful for solving a range of tasks, allowing fast solving of tasks requiring thousands of...

General 19.10.2017 Loading...

Generalizing from simulation

Our latest robotics techniques allow robot controllers, trained entirely in simulation and deployed on physical robots, to react to unplanned changes in the environment as they...

General 18.10.2017 Loading...

Asymmetric actor critic for image-based robot learning

Post Content

General 18.10.2017 Loading...

Sim-to-real transfer of robotic control with dynamics randomization

Post Content

General 17.10.2017 Loading...

Domain randomization and generative models for robotic grasping

Post Content

General 11.10.2017 Loading...

Meta-learning for wrestling

We show that for the task of simulated robot wrestling, a meta-learning agent can learn to quickly defeat a stronger non-meta-learning agent, and also show that...

General 11.10.2017 Loading...

Competitive self-play

We’ve found that self-play allows simulated AIs to discover physical skills like tackling, ducking, faking, kicking, catching, and diving for the ball, without explicitly designing an...

General 29.09.2017 Loading...

Nonlinear computation in deep linear networks

Post Content

General 14.09.2017 Loading...

Learning to model other minds

We’re releasing an algorithm which accounts for the fact that other agents are learning too, and discovers self-interested yet collaborative strategies like tit-for-tat in the iterated...

General 13.09.2017 Loading...

Learning with opponent-learning awareness

Post Content

General 18.08.2017 Loading...

OpenAI Baselines: ACKTR & A2C

We’re releasing two new OpenAI Baselines implementations: ACKTR and A2C. A2C is a synchronous, deterministic variant of Asynchronous Advantage Actor Critic (A3C) which we’ve found gives...

General 16.08.2017 Loading...

More on Dota 2

Our Dota 2 result shows that self-play can catapult the performance of machine learning systems from far below human level to superhuman, given sufficient compute. In...

General 11.08.2017 Loading...

Dota 2

We’ve created a bot which beats the world’s top professionals at 1v1 matches of Dota 2 under standard tournament rules. The bot learned the game from...

General 03.08.2017 Loading...

Gathering human feedback

RL-Teacher is an open-source implementation of our interface to train AIs via occasional human feedback rather than hand-crafted reward functions. The underlying technique was developed as...

General 27.07.2017 Loading...

Better exploration with parameter noise

We’ve found that adding adaptive noise to the parameters of reinforcement learning algorithms frequently boosts performance. This exploration method is simple to implement and very rarely...

General 20.07.2017 Loading...

Proximal Policy Optimization

We’re releasing a new class of reinforcement learning algorithms, Proximal Policy Optimization (PPO), which perform comparably or better than state-of-the-art approaches while being much simpler to...

General 17.07.2017 Loading...

Robust adversarial inputs

We’ve created images that reliably fool neural network classifiers when viewed from varied scales and perspectives. This challenges a claim from last week that self-driving cars...

General 05.07.2017 Loading...

Hindsight Experience Replay

Post Content

General 01.07.2017 Loading...

Teacher–student curriculum learning

Post Content

General 28.06.2017 Loading...

Faster physics in Python

We’re open-sourcing a high-performance Python library for robotic simulation using the MuJoCo engine, developed over our past year of robotics research.

General 13.06.2017 Loading...

Learning from human preferences

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex...

General 08.06.2017 Loading...

Learning to cooperate, compete, and communicate

Multiagent environments where agents compete for resources are stepping stones on the path to AGI. Multiagent environments have two useful properties: first, there is a natural...

General 05.06.2017 Loading...

UCB exploration via Q-ensembles

Post Content

General 24.05.2017 Loading...

OpenAI Baselines: DQN

We’re open-sourcing OpenAI Baselines, our internal effort to reproduce reinforcement learning algorithms with performance on par with published results. We’ll release the algorithms over upcoming months;...

General 16.05.2017 Loading...

Robots that learn

We’ve created a robotics system, trained entirely in simulation and deployed on a physical robot, which can learn a new task after seeing it done once.

General 15.05.2017 Loading...

Roboschool

We are releasing Roboschool: open-source software for robot simulation, integrated with OpenAI Gym.

General 21.04.2017 Loading...

Equivalence between policy gradients and soft Q-learning

Post Content

General 10.04.2017 Loading...

Stochastic Neural Networks for hierarchical reinforcement learning

Post Content

General 10.04.2017 Loading...

Stochastic Neural Networks for hierarchical reinforcement learning

Post Content

General 06.04.2017 Loading...

Unsupervised sentiment neuron

We’ve developed an unsupervised system which learns an excellent representation of sentiment, despite being trained only to predict the next character in the text of Amazon...

General 01.04.2017 Loading...

Spam detection in the physical world

We’ve created the world’s first Spam-detecting AI trained entirely in simulation and deployed on a physical robot.

General 24.03.2017 Loading...

Evolution strategies as a scalable alternative to reinforcement learning

We’ve discovered that evolution strategies (ES), an optimization technique that’s been known for decades, rivals the performance of standard reinforcement learning (RL) techniques on modern RL...

General 21.03.2017 Loading...

One-shot imitation learning

Post Content

General 21.03.2017 Loading...

One-shot imitation learning

Post Content

General 20.03.2017 Loading...

Distill

We’re excited to support today’s launch of Distill, a new kind of journal aimed at excellent communication of machine learning results (novel or existing).

General 16.03.2017 Loading...

Learning to communicate

In this post we’ll outline new OpenAI research in which agents develop their own language.

General 15.03.2017 Loading...

Emergence of grounded compositional language in multi-agent populations

Post Content

General 12.03.2017 Loading...

Prediction and control with temporal segment models

Post Content

General 06.03.2017 Loading...

Third-person imitation learning

Post Content

General 24.02.2017 Loading...

Attacking machine learning with adversarial examples

Adversarial examples are inputs to machine learning models that an attacker has intentionally designed to cause the model to make a mistake; they’re like optical illusions...

General 08.02.2017 Loading...

Adversarial attacks on neural network policies

Post Content

General 30.01.2017 Loading...

Team update

The OpenAI team is now 45 people. Together, we’re pushing the frontier of AI capabilities—whether by validating novel ideas, creating new software systems, or deploying machine...

General 19.01.2017 Loading...

PixelCNN++: Improving the PixelCNN with discretized logistic mixture likelihood and other modifications

Post Content

General 21.12.2016 Loading...

Faulty reward functions in the wild

Reinforcement learning algorithms can break in surprising, counterintuitive ways. In this post we’ll explore one failure mode, which is where you misspecify your reward function.

General 05.12.2016 Loading...

Universe

We’re releasing Universe, a software platform for measuring and training an AI’s general intelligence across the world’s supply of games, websites and other applications.

General 15.11.2016 Loading...

#Exploration: A study of count-based exploration for deep reinforcement learning

Post Content

General 15.11.2016 Loading...

OpenAI and Microsoft

We’re working with Microsoft to start running most of our large-scale experiments on Azure.

General 14.11.2016 Loading...

On the quantitative analysis of decoder-based generative models

Post Content

General 11.11.2016 Loading...

A connection between generative adversarial networks, inverse reinforcement learning, and energy-based models

Post Content

General 09.11.2016 Loading...

RL²: Fast reinforcement learning via slow reinforcement learning

Post Content

General 08.11.2016 Loading...

Variational lossy autoencoder

Post Content

General 02.11.2016 Loading...

Extensions and limitations of the neural GPU

Post Content

General 18.10.2016 Loading...

Semi-supervised knowledge transfer for deep learning from private training data

Post Content

General 13.10.2016 Loading...

Report from the self-organizing conference

Last week we hosted over a hundred and fifty AI practitioners in our offices for our first self-organizing conference on machine learning.

General 11.10.2016 Loading...

Transfer from simulation to real world through learning deep inverse dynamics model

Post Content

General 29.08.2016 Loading...

Infrastructure for deep learning

Deep learning is an empirical science, and the quality of a group’s infrastructure is a multiplier on progress. Fortunately, today’s open-source ecosystem makes it possible for...

General 18.08.2016 Loading...

Machine Learning Unconference

The latest information about the Unconference is now available at the Unconference wiki, which will be periodically updated with more information for attendees.

General 16.08.2016 Loading...

Team update

We’ve hired more great people to help us achieve our goals. Welcome, everyone!

General 28.07.2016 Loading...

Special projects

Impactful scientific work requires working on the right problems—problems which are not just interesting, but whose solutions matter.

General 21.06.2016 Loading...

Concrete AI safety problems

We (along with researchers from Berkeley and Stanford) are co-authors on today’s paper led by Google Brain researchers, Concrete Problems in AI Safety. The paper explores many...

General 20.06.2016 Loading...

OpenAI technical goals

OpenAI’s mission is to build safe AI, and ensure AI’s benefits are as widely and evenly distributed as possible.

General 16.06.2016 Loading...

Generative models

This post describes four projects that share a common theme of enhancing or using generative models, a branch of unsupervised learning techniques in machine learning. In...

General 25.05.2016 Loading...

Adversarial training methods for semi-supervised text classification

Post Content

General 25.05.2016 Loading...

Team update

We’d like to welcome the latest set of team members to OpenAI (and we’re still hiring!)

General 27.04.2016 Loading...

OpenAI Gym Beta

We’re releasing the public beta of OpenAI Gym, a toolkit for developing and comparing reinforcement learning (RL) algorithms. It consists of a growing suite of environments...

General 26.04.2016 Loading...

Welcome, Pieter and Shivon!

We have two more team updates.

General 31.03.2016 Loading...

Team++

We’ve had some fantastic people join over the past few months (and we’re still hiring). Welcome, everyone!

General 25.02.2016 Loading...

Weight normalization: A simple reparameterization to accelerate training of deep neural networks

Post Content

General 11.12.2015 Loading...

Introducing OpenAI

OpenAI is a non-profit artificial intelligence research company. Our goal is to advance digital intelligence in the way that is most likely to benefit humanity as...