• No results found

stochastic gradient

Online Bootstrap Confidence Intervals for the Stochastic Gradient Descent Estimator

Online Bootstrap Confidence Intervals for the Stochastic Gradient Descent Estimator

... efficiency, stochastic gradient descent (Robbins and Monro 1951)[SGD] is a scalable algorithm for parameter estimation and has recently drawn a great deal of ...the gradient of the objective function ...

21

Communication-Efficient Stochastic Gradient MCMC for Neural Networks

Communication-Efficient Stochastic Gradient MCMC for Neural Networks

... as Stochastic Gradient Markov Chain Monte Carlo (SG-MCMC) offer an elegant framework to rea- son about model uncertainty in neural ...for gradient computations, while the mas- ter collects the final ...

8

Stochastic Modified Equations and Dynamics of Stochastic Gradient Algorithms I: Mathematical Foundations

Stochastic Modified Equations and Dynamics of Stochastic Gradient Algorithms I: Mathematical Foundations

... using stochastic differential equations to study the precise dynamical properties of stochastic gradient descent are the independent works of Li et ...the stochastic algorithms as well as ...

47

Stochastic gradient optimization of importance sampling for the efficient simulation of digital communications systems

Stochastic gradient optimization of importance sampling for the efficient simulation of digital communications systems

... Stochastic Gradient Optimization of Importance Sampling for the Efficient Simulation of Digital Communication Systems 1.. Wael A.[r] ...

26

Stochastic Gradient MCMC for Nonlinear State Space Models

Stochastic Gradient MCMC for Nonlinear State Space Models

... series. Stochastic gradient MCMC methods have been developed to scale inference for hidden Markov models (HMMs) and linear SSMs using buffered stochastic gradient estimates to account for ...

27

Stochastic Gradient Descent using Linear Regression with Python

Stochastic Gradient Descent using Linear Regression with Python

... This paper is organized in as drafted below in section 1 features of python programming. Section 2 stochastic Gradient Descent is discussed. In section 3 Linear Regression concepts. Section 4 describes the ...

6

Making Asynchronous Stochastic Gradient Descent Work for Transformers

Making Asynchronous Stochastic Gradient Descent Work for Transformers

... Asynchronous stochastic gradient descent (SGD) converges poorly for Transformer mod- els, so synchronous SGD has become the norm for Transformer training. This is unfortu- nate because asynchronous SGD is ...

10

Calibrated Stochastic Gradient Descent for Convolutional Neural Networks

Calibrated Stochastic Gradient Descent for Convolutional Neural Networks

... In stochastic gradient descent (SGD) and its variants, the op- timized gradient estimators may be as expensive to compute as the true gradient in many ...calibrated stochastic ...

8

Stochastic Gradient Descent as Approximate Bayesian Inference

Stochastic Gradient Descent as Approximate Bayesian Inference

... Stochastic Gradient Descent with a constant learning rate (constant SGD) simulates a Markov chain with a stationary ...analyze stochastic-gradient MCMC algorithms. For ...

35

Katyusha: The First Direct Acceleration of Stochastic Gradient Methods

Katyusha: The First Direct Acceleration of Stochastic Gradient Methods

... primal-only stochastic gradient method to fix this ...finite-sum stochastic optimization, Katyusha has an optimal accelerated convergence rate, and enjoys an optimal parallel linear speedup in the ...

51

Optimal Rates for Multi-pass Stochastic Gradient Methods

Optimal Rates for Multi-pass Stochastic Gradient Methods

... incremental gradient are analyzed in (Rosasco and Villa, 2015), for a cyclic, rather than a stochastic, choice of the gradients and with no ...batch gradient descent sequence, see Subsection ...and ...

47

Utilization of Asynchronous Stochastic Gradient Descent with Additively Homomorphic Encryption

Utilization of Asynchronous Stochastic Gradient Descent with Additively Homomorphic Encryption

... Parallel Stochastic Gradient Descent (DPSGD) (Lian et ...process stochastic angles locally and in the meantime normal its nearby model with its ...figured stochastic slopes are refreshed into ...

7

A Novel Distributed Variant of Stochastic Gradient Descent and Its Optimization

A Novel Distributed Variant of Stochastic Gradient Descent and Its Optimization

... set. Gradient Decent(GD) is a common method to solve this optimization ...Thus, Stochastic Gradient Decent is proposed, which only uses one random sample to compute an estimated ...requires ...

7

Asynchronous Proximal Stochastic Gradient Algorithm for Composition Optimization Problems

Asynchronous Proximal Stochastic Gradient Algorithm for Composition Optimization Problems

... traditional stochastic gradient descent (SGD) algo- rithm and its variants either have low convergence rate or are computationally ...several stochastic composition gradient algorithms have ...

8

SGD-QN: Careful Quasi-Newton Stochastic Gradient Descent

SGD-QN: Careful Quasi-Newton Stochastic Gradient Descent

... plain Stochastic Gradient Descent yields particularly effective algorithms when the input patterns are very sparse, taking less than O (d) space and time per iteration to optimize a system with d ...

18

New Convergence Aspects of Stochastic Gradient Algorithms

New Convergence Aspects of Stochastic Gradient Algorithms

... the stochastic gradient is uniformly ...that stochastic gradients are bounded with respect to the true gradient ...for stochastic problems arising in machine learning such bound always ...

49

Consistency and fluctuations for stochastic gradient Langevin dynamics 

Consistency and fluctuations for stochastic gradient Langevin dynamics 

... the stochastic estimates to the gradient above described, this leads to the stochastic gradient Hamiltonian Monte Carlo algorithm of (Chen et ...

35

Consistency and Fluctuations For Stochastic Gradient Langevin Dynamics

Consistency and Fluctuations For Stochastic Gradient Langevin Dynamics

... This paper is concerned with stochastic gradient Langevin dynamics (SGLD), an alter- native approach proposed by Welling and Teh (2011). This follows the opposite route and chooses to completely avoid the ...

33

Stochastic gradient Markov chain Monte Carlo

Stochastic gradient Markov chain Monte Carlo

... the stochastic gradient MCMC ...mixed-member stochastic block model, which uses both the block structure of the model, and stratified subsampling techniques, to give unbiased gradient ...

31

Certain Systems Arising In Stochastic Gradient Descent

Certain Systems Arising In Stochastic Gradient Descent

... Stochastic approximations arise naturally in many different contexts. Some early results were published by [Rup88] and [PJ92]. There, they dealt with averaged stochastic gradient descent (ASGD) ...

105

Show all 7829 documents...

Related subjects