Deep Q-Learning for Dry Stacking Irregular Objects

  • Yifang Liu
  • , Seyed Mahdi Shamsi
  • , Le Fang
  • , Changyou Chen
  • , Nils Napp

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

18 Scopus citations

Abstract

We propose a reinforcement learning approach for automatically building dry stacked (i.e. no mortar) structures with irregular objects. Stacking irregular objects is a challenging problem since each assembly action can be drawn from a continuous space of poses for an object, and several local geometric and physical considerations strongly affect the stability. To tackle this challenge, we concentrate on a simplified 2D version of the problem. We present a reinforcement learning algorithm based on deep $Q$-learning, where the learned $Q$-function, which maps state-action pairs into expected long-term rewards, is represented by a deep neural network. As the action space is continuous the $Q$-network is trained by sampling a finite number of actions that consider both geometric and physical constraints to approximate the target $Q$-values, Experiments show that the proposed method outperforms previous heuristics-based planning, leading to super construction with objects containing a significant amount of variations. We validate the generated stacking plans by executing them using a robot arm and manufactured, irregular objects.

Original languageEnglish
Title of host publication2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1569-1576
Number of pages8
ISBN (Electronic)9781538680940
DOIs
StatePublished - Dec 27 2018
Externally publishedYes
Event2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2018 - Madrid, Spain
Duration: Oct 1 2018Oct 5 2018

Publication series

NameIEEE International Conference on Intelligent Robots and Systems
ISSN (Print)2153-0858
ISSN (Electronic)2153-0866

Conference

Conference2018 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2018
Country/TerritorySpain
CityMadrid
Period10/1/1810/5/18

Fingerprint

Dive into the research topics of 'Deep Q-Learning for Dry Stacking Irregular Objects'. Together they form a unique fingerprint.

Cite this