X
Search Filters
Format Format
Subjects Subjects
Subjects Subjects
X
Sort by Item Count (A-Z)
Filter by Count
reinforcement learning (71) 71
actor-critic (49) 49
algorithms (39) 39
learning (29) 29
neural networks (29) 29
computer science, artificial intelligence (22) 22
engineering, electrical & electronic (21) 21
automation & control systems (20) 20
machine learning (19) 19
reinforcement (19) 19
optimal control (17) 17
basal ganglia (16) 16
computer simulation (16) 16
neurosciences (15) 15
analysis (14) 14
artificial intelligence (14) 14
dopamine (14) 14
index medicus (13) 13
actor-critic algorithms (12) 12
neuroscience (12) 12
approximate dynamic programming (11) 11
dynamic programming (11) 11
optimization (11) 11
robots (11) 11
adaptive control (10) 10
control systems (10) 10
robotics (10) 10
actor–critic (9) 9
computer science (9) 9
engineering (9) 9
mathematical models (9) 9
neurons (9) 9
artificial neural networks (8) 8
computer science, interdisciplinary applications (8) 8
convergence (8) 8
mathematical model (8) 8
neural network (8) 8
reward (8) 8
actor-critic models (7) 7
decision making (7) 7
deep learning (7) 7
games (7) 7
humans (7) 7
instruments & instrumentation (7) 7
mathematics, applied (7) 7
policy iteration (7) 7
systems (7) 7
tracking control (7) 7
animals (6) 6
computer science, hardware & architecture (6) 6
computer science, theory & methods (6) 6
design (6) 6
equations (6) 6
function approximation (6) 6
heuristic algorithms (6) 6
markov processes (6) 6
operations research & management science (6) 6
stability (6) 6
usage (6) 6
action selection (5) 5
actor critic (5) 5
actor-critic learning (5) 5
actor-critic methods (5) 5
approximation algorithms (5) 5
computer science, information systems (5) 5
controllers (5) 5
dynamical systems (5) 5
feedback (5) 5
markov analysis (5) 5
markov decision processes (5) 5
markov-prozess (5) 5
motor control (5) 5
neostriatum (5) 5
nonlinear systems (5) 5
q-learning (5) 5
simulation (5) 5
stochastic-approximation (5) 5
striatum (5) 5
training (5) 5
actor-critic algorithm (4) 4
actor-critic method (4) 4
actor-critic model (4) 4
actor-critic structure (4) 4
actor/critic structures (4) 4
adaptation (4) 4
adaptive dynamic programming (4) 4
behavior (4) 4
biology (4) 4
cmac (4) 4
computation by abstract devices (4) 4
control, robotics, mechatronics (4) 4
cortex (4) 4
deep reinforcement learning (4) 4
dorsal striatum (4) 4
dynamics (4) 4
dynamische programmierung (4) 4
electrical engineering (4) 4
entropy (4) 4
exploration (4) 4
h-infinity control (4) 4
more...
Language Language
Publication Date Publication Date
Click on a bar to filter by decade
Slide to change publication date range


Journal Article
International Journal of Robust and Nonlinear Control, ISSN 1049-8923, 11/2017, Volume 27, Issue 16, pp. 2900 - 2920
Journal Article
International Journal of Robust and Nonlinear Control, ISSN 1049-8923, 07/2019, Volume 29, Issue 11, pp. 3502 - 3517
Journal Article
Automatica, ISSN 0005-1098, 08/2019, Volume 106, pp. 221 - 229
Journal Article
IEEE Transactions on Neural Networks and Learning Systems, ISSN 2162-237X, 2/2020, pp. 1 - 15
.... We finally leverage an actor/critic structure to solve the problem online while guaranteeing optimality, stability, and safety... 
safety-critical systems | asymptotic stability | reinforcement learning (RL) | Actor/critic structures | barrier functions
Journal Article
Applied Soft Computing, ISSN 1568-4946, 12/2015, Volume 37, pp. 702 - 714
Journal Article
International Journal of Robust and Nonlinear Control, ISSN 1049-8923, 06/2020, Volume 30, Issue 9, pp. 3706 - 3726
Journal Article
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN 0302-9743, 2018, Volume 10842, pp. 763 - 776
Conference Proceeding
IFAC-PapersOnLine, 07/2017, Volume 50, Issue 1, pp. 4920 - 4928
.... Various ideas for exploiting the knowledge on the structure and the properties of the optimal value function and the optimal policy in reinforcement learning theory and practice are presented... 
Model predictive control | approximate dynamic programming | multi-parametric programming | actor-critic structure | reinforcement learning
Journal Article
Applied Mathematical Finance, ISSN 1350-486X, 09/2019, Volume 26, Issue 5, pp. 387 - 452
In corporate bond markets, which are mainly OTC markets, market makers play a central role by providing bid and ask prices for bonds to asset managers.... 
stochastic optimal control | actor-critic algorithms | Market making | reinforcement learning
Journal Article
控制理论与应用:英文版, ISSN 1672-6340, 2011, Volume 9, Issue 3, pp. 421 - 430
The adaptive critic heuristic has been a popular algorithm in reinforcement learning(RL) and approximate dynamic programming(ADP) alike.It is one of the first... 
Engineering | Control | Adaptive critics | Reinforcement learning | Approximate dynamic programming | Actor critics | Semi-Markov | Control Structures and Microprogramming | Markov processes | Airlines | Algorithms | Scientists | Mechanical engineering | Consulting services | Commercial planes | Revenues | Mathematical models | Management | Commercial aircraft | Dynamic programming | Heuristic
Journal Article
電子情報通信学会技術研究報告. NC, ニューロコンピューティング, ISSN 0913-5685, 03/2007, Volume 106, pp. 31 - 36
... 
Journal Article
Transactions of the Institute of Measurement & Control, ISSN 0142-3312, 08/2008, Volume 30, Issue 3-4, pp. 207 - 223
...) with classical control structures. It has been shown that certain types of ANN can extend the capabilities of adaptive controllers by making them applicable for more complex... 
Approximate dynamic programming | Policy iteration | Actor/Critic structures | Linear quadratic regulation | approximate dynamic programming | INSTRUMENTS & INSTRUMENTATION | linear quadratic regulation | ZERO-SUM GAMES | policy iteration | AUTOMATION & CONTROL SYSTEMS | DESIGNS | Control systems | Models | Biology | Control theory | Dynamic programming
Journal Article
Automatica, ISSN 0005-1098, 01/2013, Volume 49, Issue 1, pp. 82 - 92
...) is proposed to approximate the Hamilton–Jacobi–Bellman equation using three neural network (NN) structures—actor and critic NNs approximate the optimal control... 
Approximate dynamic programming | Learning control | Actor–critic–identifier | Adaptive control | Optimal control | Actor-critic-identifier | ASYMPTOTIC TRACKING | TIME | AUTOMATION & CONTROL SYSTEMS | ENGINEERING, ELECTRICAL & ELECTRONIC | Control systems | Algorithms | Neural networks | Analysis
Journal Article
No results were found for your search.

Cannot display more than 1000 results, please narrow the terms of your search.