Domain adaptation-based transfer learning using adversarial networks

Farzaneh Shoeleh; Mohammad Mehdi Yadollahi; Masoud Asadpour

doi:10.1017/S0269888920000107

Domain adaptation-based transfer learning using adversarial networks

Part of: Adaptive Learning Agents 2019

Published online by Cambridge University Press: 26 February 2020

Farzaneh Shoeleh

Mohammad Mehdi Yadollahi and

Masoud Asadpour

Show author details

Farzaneh Shoeleh: Affiliation:
University of New Brunswick, Fredericton, New Brunswick, Canada e-mails: fshoeleh@unb.ca, mehdiyadollahi@unb.ca
Mohammad Mehdi Yadollahi: Affiliation:
University of New Brunswick, Fredericton, New Brunswick, Canada e-mails: fshoeleh@unb.ca, mehdiyadollahi@unb.ca
Masoud Asadpour: Affiliation:
University of Tehran, Tehran, Iran e-mail: asadpour@ut.ac.ir

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

There is an implicit assumption in machine learning techniques that each new task has no relation to the tasks previously learned. Therefore, tasks are often addressed independently. However, in some domains, particularly reinforcement learning (RL), this assumption is often incorrect because tasks in the same or similar domain tend to be related. In other words, even though tasks are quite different in their specifics, they may have general similarities, such as shared skills, making them related. In this paper, a novel domain adaptation-based method using adversarial networks is proposed to do transfer learning in RL problems. Our proposed method incorporates skills previously learned from source task to speed up learning on a new target task by providing generalization not only within a task but also across different, but related tasks. The experimental results indicate the effectiveness of our method in dealing with RL problems.

Type: Research Article
Information: The Knowledge Engineering Review , Volume 35 , 2020 , e7

DOI: https://doi.org/10.1017/S0269888920000107 [Opens in a new window]
Copyright: © Cambridge University Press, 2020

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Abel, D., et al. 2018. Policy and value transfer in lifelong reinforcement learning. In International Conference on Machine Learning, 20–29.Google Scholar

Ammar, H. B., Eaton, E., Ruvolo, P. & Taylor, M. 2014. Online multi-task learning for policy gradient methods. In Proceedings of the 31st International Conference on Machine Learning (ICML-14), 1206–1214.Google Scholar

Ammar, H. B., Eaton, E., Ruvolo, P. & Taylor, M. E. 2015. Unsupervised cross-domain transfer in policy gradient reinforcement learning via manifold alignment. In Proceedings of AAAI.Google Scholar

Ammar, H. B., Eaton, E., Taylor, M. E., et al. 2014. An automated measure of MDP similarity for transfer in reinforcement learning. In Workshops at the Twenty-Eighth AAAI Conference on Artificial Intelligence.Google Scholar

Ammar, H. B., et al. 2012. Reinforcement learning transfer via sparse coding. In Proceedings of the 11th International Conference on Autonomous Agents and Multiagent Systems,. 1. International Foundation for Autonomous Agents and Multiagent Systems, 383–390.Google Scholar

Asadi, M. & Huber, M. 2007. Effective control knowledge transfer through learning skill and representation hierarchies. In 20th International Joint Conference on Artificial Intelligence, ICML, 2054–2059.Google Scholar

Asadi, M. & Huber, M. 2015. A dynamic hierarchical task transfer in multiple robot explorations. In Proceedings on the International Conference on Artificial Intelligence (ICAI), 8, 22–27.Google Scholar

Barreto, A., et al. 2017. Successor features for transfer in reinforcement learning. In Advances in Neural Information Processing Systems, 4055–4065.Google Scholar

Ben-David, S., Blitzer, J., Crammer, K., Kulesza, A., et al. 2010. A theory of learning from different domains. Machine Learning 79(1–2), 151–175.CrossRef Google Scholar

Ben-David, S., Blitzer, J., Crammer, K. & Pereira, F. 2007. Analysis of representations for domain adaptation. In: Advances in Neural Information Processing Systems, 137–144.Google Scholar

Bocsi, B., Csató, L. & Peters, J. 2013. Alignment-based transfer learning for robot models. In The 2013 International Joint Conference on Neural Networks (IJCNN), 1–7. IEEE.CrossRef Google Scholar

Celiberto, L. A. Jr., et al. 2011. Using cases as heuristics in reinforcement learning: a transfer learning application. In: IJCAI Proceedings-International Joint Conference on Artificial Intelligence 22(1), 1211.Google Scholar

Cheng, Q., Wang, X. & Shen, L. 2017. An autonomous inter-task mapping learning method via artificial neural network for transfer learning. In 2017 IEEE International Conference on Robotics and Biomimetics (ROBIO), 768–773. IEEE.CrossRef Google Scholar

Cheng, Q., Wang, X. & Shen, L. 2019. A survey on transfer learning for multiagent reinforcement learning systems. Journal of Artificial Intelligence Research 64, 645–703.Google Scholar

Da Silva, F. L. & Reali Costa, A. H. 2017. Towards zero-shot autonomous inter-task mapping through object-oriented task description. In: Workshop on Transfer in Reinforcement Learning (TiRL). Google Scholar

Da Silva, F. L. & Reali Costa, A. H. 2019. A survey on transfer learning for multiagent reinforcement learning systems. Journal of Artificial Intelligence Research 64, 645–703.CrossRef Google Scholar

Da Silva, F. L., Glatt, R. & Reali Costa, A. H. 2017. Simultaneously learning and advising in multiagent reinforcement learning. In Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems, 1100–1108. International Foundation for Autonomous Agents and Multiagent Systems.Google Scholar

Dabney, W. & Barto, A. G. 2012. Adaptive step-size for online temporal difference learning. In Twenty-Sixth AAAI Conference on Artificial Intelligence.Google Scholar

Fachantidis, A., et al. 2011. Transfer learning via multiple inter-task mappings. In European Workshop on Reinforcement Learning, 225–236. Springer.CrossRef Google Scholar

Fachantidis, A., et al. 2015. Transfer learning with probabilistic mapping selection. Adaptive Behavior 23(1), 3–19.CrossRef Google Scholar

Ferns, N., Panangaden, P. & Precup, D. 2011. Bisimulation metrics for continuous Markov decision processes. SIAM Journal on Computing 40(6), 1662–1714.CrossRef Google Scholar

Ganin, Y. & Lempitsky, V. 2014. Unsupervised domain adaptation by backpropagation. arXiv preprint arXiv:1409.7495.Google Scholar

Ganin, Y. & Lempitsky, V. S. 2015. Unsupervised domain adaptation by back-propagation. In ICML.Google Scholar

Ganin, Y., Ustinova, E., et al. 2016. Domain-adversarial training of neural networks. The Journal of Machine Learning Research 17(1), 2096–2030.Google Scholar

Goodfellow, I., et al. 2014. Generative adversarial nets. In Advances in Neural Information Processing Systems, 2672–2680.Google Scholar

Hoffman, J., et al. 2017. Simultaneous deep transfer across domains and tasks. In Domain Adaptation in Computer Vision Applications, 173–187. Springer.CrossRef Google Scholar

Konidaris, G. & Barto, A. G. 2009. Skill discovery in continuous reinforcement learning domains using skill chaining. In Advances in Neural Information Processing Systems, 1015–1023.Google Scholar

Konidaris, G., Thomas, P., et al. 2011. Value function approximation in reinforcement learning using the Fourier basis. In Proceedings of the Twenty-Fifth Conference on Artificial Intelligence, 380–385.Google Scholar

Konidaris, G., et al. 2012. Robot learning from demonstration by constructing skill trees. The International Journal of Robotics Research 31(3), 360–375.CrossRef Google Scholar

Lazaric, A. 2012. Transfer in reinforcement learning: a framework and a survey. Reinforcement Learning 12, 143–173.CrossRef Google Scholar

Lazaric, A. & Restelli, M. 2011. Transfer from multiple MDPs. In Advances in Neural Information Processing Systems, 1746–1754.Google Scholar

Lazaric, A., Restelli, M. & Bonarini, A. 2008. Transfer of samples in batch reinforcement learning. In: Proceedings of the 25th International Conference on Machine Learning – ICML 2008, pp. 544–551. ACM Press.CrossRef Google Scholar

Liu, M.-Y. & Tuzel, O. 2016. Coupled generative adversarial networks. In Advances in Neural Information Processing Systems, 469–477.Google Scholar

Mahadevan, S. & Maggioni, M. 2007. Proto-value functions: a Laplacian framework for learning representation and control in Markov decision processes. Journal of Machine Learning Research 8, 2169–2231, 16.Google Scholar

Moradi, P., et al. 2012. Automatic skill acquisition in reinforcement learning using graph centrality measures. Intelligent Data Analysis 16, 113–135.CrossRef Google Scholar

Puterman, M. L. 2014. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley & Sons.Google Scholar

Shoeleh, F. & Asadpour, M. 2017. Graph based skill acquisition and transfer Learning for continuous reinforcement learning domains. Pattern Recognition Letters 87, 104–116.CrossRef Google Scholar

Shoeleh, F. & Asadpour, M. 2019. Skill based transfer learning with domain adaptation for continuous reinforcement learning domains. Applied Intelligence, 1–17.Google Scholar

Spector, B. & Belongie, S. 2018. Sample-effcient reinforcement learning through transfer and architectural priors. arXiv preprint arXiv:1801.02268.Google Scholar

Sutton, R. S. S., Precup, D. & Singh, S. 1999. Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112(1–2), 181–211.CrossRef Google Scholar

Taylor, M. E. & Stone, P. 2009. Transfer learning for reinforcement learning domains: a survey. Journal of Machine Learning Research 10, 1633–1685.Google Scholar

Taylor, M. E. & Stone, P. 2011. An introduction to intertask transfer for reinforcement learning. AI Magazine 32(1), 15.CrossRef Google Scholar

Taylor, M. E., Stone, P. & Liu, Y. 2007. Transfer learning via inter-task mappings for temporal difference learning. Journal of Machine Learning Research 8, 2125–2167.Google Scholar

Tzeng, E., et al. 2017. Adversarial discriminative domain adaptation. Computer Vision and Pattern Recognition (CVPR) 1(2), 4.Google Scholar

Article contents

Domain adaptation-based transfer learning using adversarial networks

Abstract

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests