Amir-massoud Farahmand

Cited by

	All	Since 2019
Citations	2102	1289
h-index	23	19
i10-index	40	31

300

150

225

200820092010201120122013201420152016201720182019202020212022202320248 42 43 66 63 79 77 77 98 102 138 157 207 239 294 278 113

Public access

View all

6 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Mohammad GhavamzadehAmazonVerified email at amazon.com
Daniel NikovskiChief Scientist, Mitsubishi Electric Research LabsVerified email at merl.com
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchVerified email at technion.ac.il
Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
Azad ShademanIntuitive Surgical Inc.Verified email at intusurg.com
Martin JagersandUniversity of AlbertaVerified email at cs.ualberta.ca
Yangchen PanUniversity of OxfordVerified email at eng.ox.ac.uk
Andre BarretoResearch Scientist, Google DeepMindVerified email at google.com
Martha WhiteUniversity of AlbertaVerified email at ualberta.ca
Majid Nili AhmadabadiProfessor of ECE, University of TehranVerified email at ut.ac.ir
Saleh NabiAI Researcher, Schneider ElectricVerified email at se.com
Joelle PineauSchool of Computer Science, McGill University; FAIR, Meta AI; MilaVerified email at cs.mcgill.ca
Babak N AraabiProfessor of ECE, University of TehranVerified email at ut.ac.ir
Beomjoon KimKorea Advanced Institute of Science & Technology (KAIST)Verified email at kaist.ac.kr
J. Andrew BagnellCarnegie Mellon UniversityVerified email at ri.cmu.edu
mouhacine benosmanMERL- Data Analytics GroupVerified email at merl.com
Rémi MunosDeepMindVerified email at inria.fr
Claas VoelckerPhD student at University of TorontoVerified email at cs.toronto.edu
Romina AbachiUniversity of Toronto, Vector InstituteVerified email at mail.utoronto.ca

Amir-massoud Farahmand

University of Toronto

Verified email at cs.toronto.edu - Homepage

Machine Learning Reinforcement Learning Sequential Decision Making Statistical Learning Theory


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Error propagation for approximate policy and value iteration A Farahmand, C Szepesvári, R Munos Advances in Neural Information Processing Systems (NeurIPS), 568-576, 2010	254	2010
Regularized Policy Iteration A Farahmand, M Ghavamzadeh, S Mannor, C Szepesvári Advances in Neural Information Processing Systems 21 (NeurIPS 2008), 441-448, 2009	162	2009
Manifold-adaptive dimension estimation A Farahmand, C Szepesvári, JY Audibert Proceedings of the 24th International Conference on Machine Learning (ICML …, 2007	136	2007
Learning from Limited Demonstrations B Kim, A Farahmand, J Pineau, D Precup Advances in Neural Information Processing Systems (NeurIPS), 2859-2867, 2013	131	2013
Regularized policy iteration with nonparametric function spaces A Farahmand, M Ghavamzadeh, C Szepesvári, S Mannor Journal of Machine Learning Research (JMLR) 17 (1), 4809-4874, 2016	123*	2016
Value-aware loss function for model-based reinforcement learning A Farahmand, A Barreto, D Nikovski Artificial Intelligence and Statistics (AISTATS), 1486-1494, 2017	116	2017
Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems A Farahmand, M Ghavamzadeh, C Szepesvári, S Mannor American Control Conference (ACC), 725-730, 2009	96*	2009
Robust jacobian estimation for uncalibrated visual servoing A Shademan, A Farahmand, M Jägersand IEEE International Conference on Robotics and Automation (ICRA), 5564-5569, 2010	85	2010
Model Selection in Reinforcement Learning AM Farahmand, C Szepesvári Machine learning 85 (3), 299-332, 2011	70	2011
Iterative Value-Aware Model Learning A Farahmand Advances in Neural Information Processing Systems (NeurIPS), 9072-9083, 2018	64	2018
Action-Gap Phenomenon in Reinforcement Learning AM Farahmand Neural Information Processing Systems (NeurIPS), 2011	62	2011
Global visual-motor estimation for uncalibrated visual servoing A Farahmand, A Shademan, M Jagersand IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS …, 2007	53*	2007
Deep reinforcement learning for partial differential equation control A Farahmand, S Nabi, DN Nikovski American Control Conference (ACC), 3120-3127, 2017	47	2017
Regularization in Reinforcement Learning AM Farahmand Department of Computing Science, University of Alberta, 2011	45	2011
Model-based and model-free reinforcement learning for visual servoing A Farahmand, A Shademan, M Jagersand, C Szepesvári IEEE International Conference on Robotics and Automation (ICRA), 2917-2924, 2009	39*	2009
Attentional network for visual object detection K Hara, MY Liu, O Tuzel, A Farahmand arXiv preprint arXiv:1702.01478, 2017	38	2017
Approximate MaxEnt Inverse Optimal Control and its Application for Mental Simulation of Human Interactions DA Huang, AM Farahmand, KM Kitani, JA Bagnell AAAI Conference on Artificial Intelligence (AAAI), 2015	32	2015
Policy-aware model learning for policy gradient methods R Abachi, M Ghavamzadeh, A Farahmand arXiv:2003.00030, 2020	31	2020
Interaction of Culture-based Learning and Cooperative Co-evolution and its Application to Automatic Behavior-based System Design AM Farahmand, MN Ahmadabadi, C Lucas, BN Araabi IEEE Transactions on Evolutionary Computation 14 (1), 23-57, 2010	26	2010
Method for Data-Driven Learning-based Control of HVAC Systems using High-Dimensional Sensory Observations A Farahmand, S Nabi, P Grover, DN Nikovski US Patent App. 15/290,038, 2018	24	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors