• No results found

neural reinforcement learning system

Neural Logic Reinforcement Learning

Neural Logic Reinforcement Learning

... of reinforcement learning or generally all machine learning algorithms for system verification and ...AI system is safe and whether it complies with existing rules, ethnically or ...

10

Reinforcement learning in a large-scale photonic recurrent neural network

Reinforcement learning in a large-scale photonic recurrent neural network

... the system such that it performs the desired computation, typically achieved by modifying connection weights according to some learning rou- ...constrain learning- induced weight adjustment to the ...

5

Grammatical Error Correction with Neural Reinforcement Learning

Grammatical Error Correction with Neural Reinforcement Learning

... To capture fluency as well as grammaticality in evaluation on such references, we use GLEU as the reward. We have shown GLEU to be more strongly preferred than other GEC metrics by na- tive speakers (Sakaguchi et al., ...

7

The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task

The UMD Neural Machine Translation Systems at WMT17 Bandit Learning Task

... later reinforcement learning model, for which beam search cannot be ...adaptation system that we submit to the training server is the uniformly random combination of 6 systems, and their individual ...

7

Detection of online phishing email using dynamic evolving neural network based on reinforcement learning

Detection of online phishing email using dynamic evolving neural network based on reinforcement learning

... Aburrous and Khelifi (2013) proposed a fuzzy logic and data mining algorithm to de- tect phishing websites (Aburrous and Khelifi, 2013). The proposed model depends on 27-features that stamp the fraudulent websites and it ...

178

An Adaptive Controller using Radial Basis Function Neural Network with Reinforcement Learning

An Adaptive Controller using Radial Basis Function Neural Network with Reinforcement Learning

... using reinforcement learning is given to deal with conventional tracking control ...Actor-Critic learning is used to tune PID parameters in an adaptive way to take advantage of the ...

7

Reinforcement Learning based Curriculum Optimization for Neural Machine Translation

Reinforcement Learning based Curriculum Optimization for Neural Machine Translation

... We consider the problem of making efficient use of heterogeneous training data in neu- ral machine translation (NMT). Specifically, given a training dataset with a sentence-level feature such as noise, we seek an optimal ...

8

A Study of Reinforcement Learning for Neural Machine Translation

A Study of Reinforcement Learning for Neural Machine Translation

... t learning (RL) is an effective approach for improving the performance of neural machine translation (NMT) ...NMT system- s trained with source/target monolingual da- ...

10

Artificial intelligence as a means to facilitate mechanism design based negotiations

Artificial intelligence as a means to facilitate mechanism design based negotiations

... the system to the new ...machine learning, data mining and processing, deep neural networks, and reinforcement learning are still embracing weak AI because of its task-oriented ...

66

Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method

Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method

... Learning System Setup. Two actions are available to the learning controller, -10N and +10N. For training, initial starting positions for the cart are drawn randomly from [ − 2.3, 2.3], initial pole ...

12

Hybrid Code Networks: practical and efficient end to end dialog control with supervised and reinforcement learning

Hybrid Code Networks: practical and efficient end to end dialog control with supervised and reinforcement learning

... chine learning to dialog control. The first de- composes a dialog system into a pipeline, typ- ically including language understanding, dialog state tracking, action selection policy, and lan- guage ...

13

Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback

Reinforcement Learning for Bandit Neural Machine Translation with Simulated Human Feedback

... of learning to solve complex joint prediction problems (like parsing or machine translation) under a very lim- ited feedback model: a system must produce a sin- gle structured output ...“reference” ...

11

A Study of a Robotic Assembly System as a Collaborative Multi-Agent Organization

A Study of a Robotic Assembly System as a Collaborative Multi-Agent Organization

... assembly system as a multi-agent system opens a new way of designing intelligent and autonomous robotic ...collaborative system that can achieve social ...AI system is built from simple ...

11

Evolutionary Function Approximation for Reinforcement Learning

Evolutionary Function Approximation for Reinforcement Learning

... probabilistic reinforcement learn- ing task from the field of autonomic computing (Kephart and Chess, ...a reinforcement learning task that requires effective function ...continual learning on ...

41

Reinforcement Learning in Neural Networks: A Survey

Reinforcement Learning in Neural Networks: A Survey

... a system that has continuous dynamics in contrast to discrete-time MDP (Vamvoudakis, 2010; Hanselmann et ...2007). Learning prediction and learning control are the two main tasks in ...

19

Residual Reinforcement Learning using Neural Networks

Residual Reinforcement Learning using Neural Networks

... An example of TD becoming unstable can be found in the star problem illustrated in Figure 4.1. It shows six states with the value of each state given by a linear combination of two weights and each transition yields a ...

93

Road tracking using deep reinforcement learning for self driving car applications

Road tracking using deep reinforcement learning for self driving car applications

... applying reinforcement learning for tracking ob- jects ...tracking system controller by using a recurrent neural network combined with the rein- forcement ...utilised reinforcement ...

11

Adolescent-specific patterns of behavior and neural activity during social reinforcement learning

Adolescent-specific patterns of behavior and neural activity during social reinforcement learning

... of reinforcement and social cognition have shown increased sensitivity in affective-motivational circuitry in early versus late adolescence (Engelmann, Moore, Monica Capra, & Berns, 2012; Pfeifer & ...

47

Learning to Trade via Direct Reinforcement

Learning to Trade via Direct Reinforcement

... trading system remains long during the 1991 stock market correction associated with the Persian Gulf war, a political event, though the Q-Trader system is fortunately short during the ...Q-Trader ...

15

Algebraic Neural Architecture Representation, Evolutionary Neural Architecture Search, and Novelty Search in Deep Reinforcement Learning

Algebraic Neural Architecture Representation, Evolutionary Neural Architecture Search, and Novelty Search in Deep Reinforcement Learning

... which neural network architectures and their parameterizations are simultaneously ...[18]. Learning directly from raw pixels, on the other hand, had remained very challenging for RL researchers until the ...

93

Show all 10000 documents...

Related subjects