machine learning

Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments

Submitted by erantone

on Thu, 02/09/2023 - 20:43

Deriving robust control policies for realistic urban navigation scenarios is not a trivial task. In an end-to-end approach, these policies must map high-dimensional images from the vehicle's cameras to low-level actions such as steering and throttle. While pure Reinforcement Learning (RL) approaches are based exclusively on rewards, Generative Adversarial Imitation Learning (GAIL) agents learn from expert demonstrations while interacting with the environment, which favors GAIL on tasks for which a reward signal is difficult to derive.

In this work, the hGAIL architecture was proposed to solve the autonomous navigation of a vehicle in an end-to-end approach, mapping sensory perceptions directly to low-level actions, while simultaneously learning mid-level input representations of the agent's environment. The proposed hGAIL consists of an hierarchical Adversarial Imitation Learning architecture composed of two main modules: the GAN (Generative Adversarial Nets) which generates the Bird's-Eye View (BEV) representation mainly from the images of three frontal cameras of the vehicle, and the GAIL which learns to control the vehicle based mainly on the BEV predictions from the GAN as input.

Our experiments have shown that GAIL exclusively from cameras (without BEV) fails to even learn the task, while hGAIL, after training, was able to autonomously navigate successfully in all intersections of the city.

Fig. 1 - Hierarchical Generative Adversarial Imitation Learning (hGAIL) for policy learning with mid-level input representation. It basically consists of chained GAN and GAIL networks, where the first one (GAN) generates BEV representation from the vehicle's three frontal cameras, sparse trajectory and high-level command, while the latter (GAIL) outputs the acceleration and steering based on the predicted BEV input (generated by GAN), the current speed and the last applied actions. Both GAN and GAIL learn simultaneously while the agent interacts to the CARLA environment. The discriminator parts of both networks are not shown for the sake of simplicity.

This research has been summarized in a paper that is under review. More details at:

https://github.com/gustavokcouto/hgail

Some of the results can be seen on Youtube as series of videos, showing the learning through interaction with the urban environment:
https://www.youtube.com/playlist?list=PLDjIHGRag4wB2v5KdnInWj7MTCTNEPQ3U

Both the intermediate mid-level input BEV representation and the control policy are learned as the agent navigates in an urban town.
After training, we can see the autonomous vehicle navigation in the video below:

We can also observe the learning of the Bird's-Eye view representation of the Conditional GAN embedded in the hGAIL architecture, which learns concomitantly with the GAIL policy:

Read more about Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments

Links to Interesting projects

Submitted by erantone

on Thu, 06/25/2020 - 15:16

Physics simulation for games and robotics:
https://github.com/bulletphysics/bullet3
YOLO: real time object detection system based on ConvNets
https://pjreddie.com/darknet/yolo/

Tags:

machine learning

Inteligência Artificial, Educação e Trabalho

Submitted by erantone

on Fri, 08/30/2019 - 16:16

Entrevista concedida por mim para a Revista Texto Livro: Linguagem e Tecnologia,
por meio da entrevistadora Tacia Rocha, replicada logo abaixo.

URL original da publicação: http://www.periodicos.letras.ufmg.br/index.php/textolivre/article/view/15469/1125612566

Título: INTELIGÊNCIA ARTIFICIAL, EDUCAÇÃO E TRABALHO: ENTREVISTA COM ERIC AISLAN ANTONELO

A Inteligência Artificial (IA) é um campo de estudo acadêmico e da engenharia no qual você se especializou e que passa por várias áreas da ciência da computação. O que é a IA e qual a combinação de tecnologias que a fazem funcionar?

A Inteligência

Tags:

inteligência artificial

redes neurais

machine learning

deep learning

You are here

Search form

Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments

Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments

Links to Interesting projects

Inteligência Artificial, Educação e Trabalho

A Inteligência Artificial (IA) é um campo de estudo acadêmico e da engenharia no qual você se especializou e que passa por várias áreas da ciência da computação. O que é a IA e qual a combinação de tecnologias que a fazem funcionar?