A novel method for finding grasping handles in a clutter using RGBD Gaussian mixture models

Olyvia Kundu; Samrat Dutta; Swagat Kumar

doi:10.1017/S0263574721000503

A novel method for finding grasping handles in a clutter using RGBD Gaussian mixture models

Published online by Cambridge University Press: 16 June 2021

Olyvia Kundu ,

Samrat Dutta and

Swagat Kumar

Show author details

Olyvia Kundu: Affiliation:
TATA Consultancy Services, Bangalore560066, India
Samrat Dutta: Affiliation:
TATA Consultancy Services, Bangalore560066, India
Swagat Kumar*: Affiliation:
TATA Consultancy Services, Bangalore560066, India
*: *Corresponding author. Email: swagat.kumar@tcs.com

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

The paper proposes a novel method to detect graspable handles for picking objects from a confined and cluttered space, such as the bins of a rack in a retail warehouse. The proposed method combines color and depth curvature information to create a Gaussian mixture model that can segment the target object from its background and imposes the geometrical constraints of a two-finger gripper to localize the graspable regions. This helps in overcoming the limitations of a poorly trained deep network object detector and provides a simple and efficient method for grasp pose detection that does not require a priori knowledge about object geometry and can be implemented online with near real-time performance. The efficacy of the proposed approach is demonstrated through simulation as well as real-world experiment.

Keywords

grasp pose detection graspable affordance grasping RGBD point cloud Gaussian mixture model (GMM)surface normals region growing algorithms primitive shape identification

Type: Article
Information: Robotica , Volume 40 , Issue 3 , March 2022 , pp. 447 - 463

DOI: https://doi.org/10.1017/S0263574721000503 [Opens in a new window]
Copyright: © The Author(s), 2021. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Marwan, Q. M., Chua, S. C. and Kwek, L. C., “Comprehensive review on reaching and grasping of objects in robotics,” Robotica, 1–34 (2021). doi: 10.1017/S0263574721000023 Google Scholar

Saxena, A., Driemeyer, J. and Ng, A. Y., “Robotic grasping of novel objects using vision,” Int. J. Rob. Res. 27(2), 157–173 (2008).CrossRef Google Scholar

Montesano, L. and Lopes, M., “Learning Grasping Affordances from Local Visual Descriptors,” IEEE 8th International Conference on Development and Learning, 2009. ICDL 2009 (IEEE, 2009) pp. 1–6.CrossRef Google Scholar

Jiang, Y., Moseson, S. and Saxena, A., “Efficient Grasping from RGBD Images: Learning Using a New Rectangle Representation,” 2011 IEEE International Conference on Robotics and Automation (ICRA) (IEEE, 2011) pp. 3304–3311.Google Scholar

Gualtieri, M., ten Pas, A., Saenko, K. and Platt, R., “High precision grasp pose detection in dense clutter,” CoRR, vol. abs/1603.01564 (2016).CrossRef Google Scholar

Jain, S. and Argall, B., “Grasp Detection for Assistive Robotic Manipulation,” 2016 IEEE International Conference on Robotics and Automation (ICRA) (2016) pp. 2015–2021.Google Scholar

Fischinger, D. and Vincze, M., “Shape based learning for grasping novel objects in cluttered scenes,” IFAC Proc, Vol. 45(22), 787–792 (2012).CrossRef Google Scholar

Zeng, A., Song, S., Yu, K.-T., Donlon, E., Hogan, F. R., Bauza, M., Ma, D., Taylor, O., Liu, M., Romo, E. and Fazeli, N., “Robotic Pick-and-Place of Novel Objects in Clutter with Multi-Affordance Grasping and Cross-Domain Image Matching,” 2018 IEEE International Conference on Robotics and Automation (ICRA) (IEEE, 2018) pp. 3750–3757.CrossRef Google Scholar

Lenz, I., Lee, H. and Saxena, A., “Deep learning for detecting robotic grasps,” Int. J. Rob. Res. 34(4-5), 705–724 (2015).CrossRef Google Scholar

Redmon, J. and Angelova, A., “Real-Time Grasp Detection Using Convolutional Neural Networks,” 2015 IEEE International Conference on Robotics and Automation (ICRA) (IEEE, 2015) pp. 1316–1322.CrossRef Google Scholar

Kim, D., Li, A. and Lee, J., “Stable robotic grasping of multiple objects using deep neural networks,” Robotica 39(4), 735–748 (2021). doi: 10.1017/S0263574720000703 CrossRef Google Scholar

Dong, M., Wei, S., Yu, X. and Yin, J., “Mask-gd segmentation based robotic grasp detection,” arXiv preprint arXiv:2101.08183 (2021).CrossRef Google Scholar

Aslan, S. N., Uçar, A. and Güzeliş, C., “Semantic Segmentation for Object Detection and Grasping with Humanoid Robots,” 2020 Innovations in Intelligent Systems and Applications Conference (ASYU) (IEEE, 2020) pp. 1–6.CrossRef Google Scholar

Miller, A. T. and Allen, P. K., “Graspit! a versatile simulator for robotic grasping,” IEEE Rob. Autom. Mag. 11(4), 110–122 (2004).CrossRef Google Scholar

Klank, U., Pangercic, D., Rusu, R. B. and Beetz, M., “Real-Time CAD Model Matching for Mobile Manipulation and Grasping,” 9th IEEE-RAS International Conference on Humanoid Robots, 2009. Humanoids 2009 (IEEE, 2009) pp. 290–296.CrossRef Google Scholar

Yan, W., Deng, Z., Chen, J., Nie, H. and Zhang, J., “Precision grasp planning for multi-finger hand to grasp unknown objects,” Robotica 37(8), 1415–1437 (2019).CrossRef Google Scholar

Pas, A. T. and Platt, R., “Using geometry to detect grasps in 3d point clouds,” (2015). arXiv preprint arXiv:1501.03100.Google Scholar

Ten Pas, A. and Platt, R., “Localizing Handle-like Grasp Affordances in 3d Point Clouds,” In: Experimental Robotics (Springer, 2016) pp. 623–638.Google Scholar

ten Pas, A. and Platt, R., “Localizing grasp affordances in 3d points clouds using taubin quadric fitting,” CoRR, vol. abs/1311.3192 (2013).Google Scholar

Wurman, P. R. and Romano, J. M., “The amazonpicking challenge 2015,” IEEE Rob. Autom. Mag. 22(3), 10–12 (2015).CrossRef Google Scholar

Eppner, C., Höfer, S., Jonschkowski, R., Martn-Martn, R., Sieverling, A., Wall, V. and Brock, O., “Lessons from the Amazon Picking Challenge: Four Aspects of Building Robotic Systems,” Proceedings of Robotics: Science and Systems, AnnArbor, Michigan (2016).Google Scholar

Richtsfeld, M. and Vincze, M. ., “Grasping of Unknown Objects from a Table Top,” Workshop on Vision in Action (2008).CrossRef Google Scholar

Ren, S., He, K., Girshick, R. B. and Sun, J., “Faster R-CNN: Towards real-time object detection with region proposal networks,” CoRR, vol. abs/1506.01497 (2015).Google Scholar

Zivkovic, Z., “Improved Adaptive Gaussian Mixture Model for Background Subtraction,” Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004, vol. 2 (IEEE, 2004) pp. 28–31.CrossRef Google Scholar

Revol, C. and Jourlin, M., “A new minimum variance region growing algorithm for image segmentation,” Pattern Recogn. Lett. 18(3), 249–258 (1997).CrossRef Google Scholar

Izadi, S., Kim, D., Hilliges, O., Molyneaux, D., Newcombe, R., Kohli, P., Shotton, J., Hodges, S., Freeman, D., Davison, A. and Fitzgibbon, A., “Kinectfusion: Real-Time 3d Reconstruction and Interaction Using a Moving Depth Camera,” Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology (ACM, 2011) pp. 559–568.CrossRef Google Scholar

Rabbani, T., Van Den Heuvel, F. and Vosselmann, G., “Segmentation of point clouds using smoothness constraint,” Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 36(5), 248–253 (2006).Google Scholar

Mahler, J., Liang, J., Niyaz, S., Laskey, M., Doan, R., Liu, X., Ojea, J. A. and Goldberg, K., “Dex-net 2.0: Deep learning to plan robust grasps with synthetic point clouds and analytic grasp metrics,” arXiv preprint arXiv:1703.09312 (2017).CrossRef Google Scholar

Kumra, S. and Kanan, C., “Robotic Grasp Detection Using Deep Convolutional Neural Networks,” 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE, 2017) pp. 769–776.CrossRef Google Scholar

Birglen, L. and Schlicht, T., “A statistical review of industrial robotic grippers,” Rob. Comput. Integr. Manuf. 49, 88–97 (2018).CrossRef Google Scholar

LeCun, Y., Bengio, Y. and Hinton, G., “Deep learning,” Nature 521(7553), 436 (2015).CrossRef Google Scholar PubMed

Krizhevsky, A., Sutskever, I. and Hinton, G. E., “Imagenet Classification with Deep Convolutional Neural Networks,” In: Advances in Neural Information Processing Systems (2012) pp. 1097–1105.Google Scholar

Redmon, J., Divvala, S., Girshick, R. and Farhadi, A., “You Only Look Once: Unified, Real-Time Object Detection,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016) pp. 779–788.Google Scholar

Dai, J., Li, Y., He, K. and Sun, J., “R-FCN: Object Detection via Region-Based Fully Convolutional Networks,” In: Advances in Neural Information Processing Systems (2016) pp. 379–387.Google Scholar

Zhao, H., Shi, J., Qi, X., Wang, X. and Jia, J., “Pyramid Scene Parsing Network,” Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017) pp. 2881–2890.Google Scholar

Mahajan, K., Majumder, A., Nanduri, H. and Kumar, S., “A Deep Framework for Automatic Annotation with Application to Retail Warehouses,” BMVC (2018).Google Scholar

Singh, C. K., Majumder, A., Kumar, S. and Behera, L., “Deep Network Based Automatic Annotation for Warehouse Automation,” 2018 International Joint Conference on Neural Networks (IJCNN) (IEEE, 2018) pp. 1–8.CrossRef Google Scholar

Eitel, A., Springenberg, J. T., Spinello, L., Riedmiller, M. and Burgard, W., “Multimodal Deep Learning for Robust RGB-D Object Recognition,” 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE, 2015) pp. 681–687.CrossRef Google Scholar

Bo, L., Ren, X. and Fox, D., “Unsupervised Feature Learning for RGB-D Based Object Recognition,” In: Experimental Robotics (Springer, 2013) pp. 387–402.CrossRef Google Scholar

Maturana, D. and Scherer, S., “Voxnet: A 3d Convolutional Neural Network for Real-Time Object Recognition,” 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE, 2015) pp. 922–928.CrossRef Google Scholar

PCL, “The point cloud library.” http://pointclouds.org/.Google Scholar

Kundu, O., “TCS grasping dataset.” https://sites.google.com/view/grasping-tcs-research/home.Google Scholar

Singh, A., Sha, J., Narayan, K. S., Achim, T. and Abbeel, P., “Bigbird: A Large-Scale 3d Database of Object Instances,” 2014 IEEE International Conference on Robotics and Automation (ICRA) (IEEE, 2014) pp. 509–516.CrossRef Google Scholar

OnRobot, “RG2 Industrial Gripper.” http://onrobot.dk/Product.html.Google Scholar

Smisek, J., Jancosek, M. and Pajdla, T., “3d with Kinect,” In: Consumer Depth Cameras for Computer Vision (Springer, 2013) pp. 3–25.CrossRef Google Scholar

Kundu, O., “Demonstration video for the proposed grasping algorithm.” https://www.youtube.com/watch?v=P9BIPXtnQrw (2017).Google Scholar

Kundu, O., “Program source code on gitlab.” https://gitlab.com/olyvia/primitives_fit.git.Google Scholar

University of Berkeley, “Caffe deep learning framework.” https://caffe.berkeleyvision.org/ (2020).Google Scholar

Kundu et al. supplementary material

Video 9.1 MB

Article contents

A novel method for finding grasping handles in a clutter using RGBD Gaussian mixture models

Abstract

Keywords

Access options

References

Kundu et al. supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests