コーパス検索結果 (1語後でソート)
通し番号をクリックするとPubMedの該当ページを表示します
1 basal ganglia are important for movement and reinforcement learning.
2 separation and conjunctive representation in reinforcement learning.
3 ereas choice bias results from supervised or reinforcement learning.
4 s to reveal their separable contributions to reinforcement learning.
5 gnment with more recent theories of Bayesian reinforcement learning.
6 ap in the quest for the neural mechanisms of reinforcement learning.
7 thms known as "model-based" and "model-free" reinforcement learning.
8 ents a form of spatial credit assignment for reinforcement learning.
9 ty in response to stimuli is a key factor in reinforcement learning.
10 ant consequences for computational models of reinforcement learning.
11 arallel contribution of MB and MF systems in reinforcement learning.
12 roduce effects of dopaminergic medication on reinforcement learning.
13 CC behavioral patterns could be explained by reinforcement learning.
14 mine (DA) and regulates appetitive drive and reinforcement learning.
15 r to deviate sharply from the predictions of reinforcement learning.
16 e algorithmic implementation of imitation in reinforcement learning.
17 udy failed to replicate previous findings on reinforcement learning.
18 ntial hypothesis posits that dopamine biases reinforcement learning.
19 sure may be a critical cellular component of reinforcement learning.
20 licit influence of movement error signals on reinforcement learning.
21 y aging does not significantly impair simple reinforcement learning.
22 "model-free" and "model-based" strategies in reinforcement learning.
23 s: dynamical systems, Bayesian inference and reinforcement learning.
24 ntribution of these regions to probabilistic reinforcement learning.
25 neural control of instrumental behaviors by reinforcement learning.
26 d opposing effects on locomotor behavior and reinforcement learning.
27 riatum where they contribute to movement and reinforcement learning.
28 nisms underlying adaptive imitation in human reinforcement learning.
29 oss, which is associated with impairments in reinforcement learning.
30 h opacities, based on deep learning and deep reinforcement learning.
31 Training is achieved with Deep Reinforcement Learning.
32 a model that combines signal detection with reinforcement learning.
33 from regression to image classification and reinforcement learning.
34 clei that contribute to action selection and reinforcement learning.
35 n striatum, a central node in feedback-based reinforcement learning.
36 recognized to play an important role beyond reinforcement learning.
37 e for a neural realization of distributional reinforcement learning.
38 s, including motivation, motor learning, and reinforcement learning.
43 cent evidence indicates that, beyond classic reinforcement learning adaptations, individuals may also
46 able to other complex domains: a multi-agent reinforcement learning algorithm that uses data from bot
47 s' behaviour was better explained by a basic reinforcement learning algorithm, adults' behaviour inte
48 bine them with model-free, experience-based, reinforcement learning algorithms to train the gliders.
50 midbrain and the reward prediction errors of reinforcement learning algorithms, which express the dif
55 We then use both optimal control theory and reinforcement learning alongside a combination of analys
57 MRI, we found that adolescents showed better reinforcement learning and a stronger link between reinf
58 Moreover, the inverse association between reinforcement learning and anhedonia in patients implies
60 triatal DA D2 receptors (D2Rs) also regulate reinforcement learning and are implicated in glucose-rel
62 the representation of "state," in studies of reinforcement learning and decision making, and also in
63 ns in the human striatum, a key substrate in reinforcement learning and decision making, are modulate
68 ershman review concepts and advances in deep reinforcement learning and discuss how these can inform
69 an remember a reward-baited location through reinforcement learning and do so quickly and without req
70 rcement learning and a stronger link between reinforcement learning and episodic memory for rewarding
71 that depression has the strongest effects on reinforcement learning and expectations about the future
72 pamine is thought to play a critical role in reinforcement learning and goal-directed behavior, but i
73 ptive function in this context, and also how reinforcement learning and incentive salience models may
74 prefrontal cortex-basal ganglia circuit for reinforcement learning and is ultimately reflected in do
76 ion of corticostriatal circuitry involved in reinforcement learning and motivation, although the inte
78 n the dorsal striatum has been implicated in reinforcement learning and regulation of motivation, but
80 sms that underlie these processes, including reinforcement learning and spike-timing-dependent plasti
81 d role for D3 receptors in select aspects of reinforcement learning and suggest that individual varia
82 emonstrator's value function through inverse reinforcement learning and uses it to bias action select
83 se time (RT) distributions that arise during reinforcement learning and value-based decision-making.
88 implications to research on self-projection, reinforcement learning, and predictive-processing models
90 uences for brain and computational models of reinforcement learning are discussed.SIGNIFICANCE STATEM
93 In particular, we aim to test the role of reinforcement learning as the microscopic mechanism used
94 domain knowledge and iteratively training a reinforcement learning-based chemical graph-set designer
95 nference, and that older adults rely more on reinforcement learning-based predictions than younger ad
97 ipheral glucose levels and glucose-dependent reinforcement learning behaviors and highlight the notio
98 at sensory processing, sequence learning and reinforcement learning, but are limited in their ability
99 the relative influence of the two systems in reinforcement learning, but few studies have manipulated
100 error signal proposed to support model-free reinforcement learning, cached-value errors are typicall
101 showing that individuals adopting a type of reinforcement learning, called aspiration learning, phen
103 In particular, learning from reward, or reinforcement learning, can be driven by two distinct co
104 uterized conformity task, assumed to rely on reinforcement learning circuits, to 32 patients with sch
105 t multiple sources of uncertainty impinge on reinforcement learning computations: uncertainty about t
110 changes were not associated with deficits in reinforcement learning during an object discrimination r
114 s indicate the great potential of multiagent reinforcement learning for artificial intelligence resea
115 of an approach from artificial intelligence-reinforcement learning-for the control of co-cultures wi
117 odel that augments the standard reward-based reinforcement learning formulation by associating a valu
118 s formal description of avoidance within the reinforcement learning framework provides a new means of
122 n to overcome the key technical challenge of reinforcement learning from imperfect data, which has pr
123 escribe inferences based on a combination of reinforcement learning from past feedback and participan
125 Computational modeling of trial-by-trial reinforcement learning further indicated that lower OFC
127 rch on artificial neural networks trained by reinforcement learning has made it possible to model fun
128 the mesoaccumbal circuit for motivation and reinforcement learning have not yet been examined in pri
129 yield choice patterns similar to model-free reinforcement learning; however, samples can vary from t
130 imply that older adults are only impaired in reinforcement learning if they additionally need to lear
132 eal an important role for the hippocampus in reinforcement learning in adolescence and suggest that r
134 l link between IL-6 and striatal RPEs during reinforcement learning in the context of acute psycholog
135 of these tasks ignores important aspects of reinforcement learning in the real world: (a) State spac
136 Here we propose an account of dopamine-based reinforcement learning inspired by recent artificial int
137 ork focused on agent imitation and show that reinforcement learning is a good candidate to explain ma
142 FICANCE STATEMENT In aversive and appetitive reinforcement learning, learned effects show extinction
144 al studies, the present results suggest that reinforcement learning may be a major proximate mechanis
145 ing is not accounted for by varying a single reinforcement learning mechanism, but by changing the se
146 egulation as resulting from a self-adjusting reinforcement-learning mechanism that infers latent stat
147 engagement are captured by a self-adjusting reinforcement-learning mechanism that tracks changing en
148 These associations can be attained with reinforcement learning mechanisms using a reward predict
149 iatal plasticity can be induced by classical reinforcement learning mechanisms, and might be central
151 emale Long-Evans rats are linked to specific reinforcement-learning mechanisms and are predictive of
152 tational framework was used to elucidate the reinforcement-learning mechanisms that change in adolesc
153 sional action and state spaces than existing reinforcement learning methods to model real-life comple
154 at this new approach is better than baseline reinforcement-learning methods in terms of overall perfo
155 behavior was analyzed using both a standard reinforcement learning model and analysis of choice swit
157 i delivered at random times and formulated a reinforcement learning model based on belief states.
158 slot machine game play as well as a simpler reinforcement learning model based on the Rescorla-Wagne
160 s behavioral and neural data compared with a reinforcement learning model inspired by rating systems
162 mood and anxiety group on a parameter of our reinforcement learning model that characterizes a prepot
163 ask performance was described using a simple reinforcement learning model that dissociates the contri
164 icipants' decisions were best explained by a reinforcement learning model that independently learned
165 To cope with uncertainty, we extended a reinforcement learning model with a belief state about t
171 alance between two dissociable strategies of reinforcement learning: model-free and model-based.
172 their inferences over time, we pitted simple reinforcement learning models against more specific "com
173 ic reinforcement learning task combined with reinforcement learning models and fMRI, we found that ad
175 stimuli, not actions.SIGNIFICANCE STATEMENT Reinforcement learning models of the ventral striatum (V
179 We found that, across different conditions, reinforcement learning models were approximately as accu
181 een implemented primarily as state-dependent reinforcement learning models with bias parameters to qu
182 iction errors underlie learning of values in reinforcement learning models, are represented by phasic
185 and food consumption through its effects on reinforcement learning, motivation, and hedonic experien
186 ional mechanisms, model-based and model-free reinforcement learning, neuronally implemented in fronto
188 ecting 1) Pavlovian biases in the context of reinforcement learning or 2) hyperprecise prior beliefs
189 ions may relate to abnormal decision making, reinforcement learning or somatic processing in TS.
190 Adolescents and adults carried out a novel reinforcement learning paradigm in which participants le
192 icated that in such an unstable environment, reinforcement learning parameters are downregulated depe
194 mechanism between model-based and model-free reinforcement learning, placing such a mechanism within
195 eralization of two fundamental operations in reinforcement learning: policy improvement and policy ev
197 of tasks previously solved, we can reduce a reinforcement-learning problem to a simpler linear regre
199 thin two hundred trials and errors, as their reinforcement learning processes interact with metacogni
201 d drug use, may be because of disruptions in reinforcement-learning processes that enable behavior to
202 euroscience and psychology; however, quantum reinforcement learning (QRL), which shows superior perfo
205 To test this assumption, we simulated a reinforcement learning (RL) agent equipped with a perfec
210 We have demonstrated the effectiveness of reinforcement learning (RL) in bluff body flow control p
211 emical spaces were used as training sets for reinforcement learning (RL) in combination with differen
216 ping paradigm, where subjects solve multiple reinforcement learning (RL) problems differing in struct
218 ed fMRI analysis revealed a fractionation of reinforcement learning (RL) signals in the ventral stria
219 g a novel oculomotor paradigm, combined with reinforcement learning (RL) simulations, we show that mo
220 eract during learning.SIGNIFICANCE STATEMENT Reinforcement learning (RL) theory has been remarkably p
224 nymous with the 'reward prediction error' of reinforcement learning (RL), and are thought to update n
225 ble systems, such as working memory (WM) and reinforcement learning (RL), contribute simultaneously t
226 e capacities and, thanks to progress in deep reinforcement learning (RL), it is now possible to apply
227 We review the psychology and neuroscience of reinforcement learning (RL), which has experienced signi
232 ve reliability of model-based and model-free reinforcement-learning (RL) systems plays a role in the
233 or signals consistent with formal models of "reinforcement learning" (RL) have repeatedly been found
234 ventral tegmental area (VTA) contributes to reinforcement learning, rodent evidence suggests that sl
236 librium, level-k cognition, fictitious play, reinforcement learning, selective payoff-biased imitatio
237 reased ventral striatal RPE signaling during reinforcement learning (session 2), though there was no
239 sing a combination of evolutionary analysis, reinforcement learning simulations, and behavioral exper
240 p to predict expert actions, and (ii) a deep reinforcement learning step to estimate the long-term va
241 ng in dynamic environments requires multiple reinforcement-learning steps that may be implemented by
243 older adults rely more heavily on suboptimal reinforcement-learning strategies supported by the ventr
244 showed worse performance and relied more on reinforcement-learning strategies than younger adults, w
247 ipiprazole on more cognitive facets of human reinforcement learning, such as learning from the forgon
249 gests an imbalance in the influence of these reinforcement learning systems on behavior in individual
252 his, we used a modified version of a classic reinforcement learning task in which feedback indicated
254 e and female) completed multiple blocks of a reinforcement learning task that contained a global hier
255 n counterfactual learning, we administered a reinforcement learning task that involves both direct le
256 supports the transition to exploitation on a reinforcement learning task with a spatially structured
257 retrospectively predicted performance on the reinforcement learning task, demonstrating that the bias
259 event-related potentials in humans during a reinforcement learning task, we show strong evidence in
264 al variability in behavioural responses to a reinforcement-learning task encompassing a novelty manip
265 umbens activation in a simple unidimensional reinforcement-learning task was not significantly affect
268 performance of two groups of participants on reinforcement learning tasks using a computational model
270 knowledge of chemistry and state-of-the-art reinforcement learning techniques (double Q-learning and
271 ices of older adults are better predicted by reinforcement learning than Bayesian inference, and that
275 Does learning in human observers comply with reinforcement learning theories, which describe how subj
279 re abnormal with respect to predictions from reinforcement learning theory were associated with lower
280 d prediction errors (RPEs), a key concept of reinforcement learning theory, are crucial to the format
285 is no doubt that social signals affect human reinforcement learning, there is still no consensus abou
286 rationalization and theory of mind, inverse reinforcement learning, thought experiments, and reflect
287 ncode reward prediction errors and can drive reinforcement learning through their projections to stri
288 ate their expectations after playing a DG by reinforcement learning to construct a model that explain
289 the DG and also to the wide applicability of reinforcement learning to explain many strategic interac
290 stinction between model-based and model-free reinforcement learning to investigate the unique and sha
294 of accumulation-to-bound decision models and reinforcement learning, we modeled the performance of hu
295 on, we extend our model with multi-objective reinforcement learning, which maximizes drug-likeness wh
296 f the environment by combining principles of reinforcement learning with accumulation-to-bound models
298 physiological (pupil dilation) signatures of reinforcement learning with eligibility trace across mul
299 n a single task, adapting a standard task of reinforcement learning with incidental episodic encoding
300 re we introduce an algorithm based solely on reinforcement learning, without human data, guidance or