• No results found

Gradient Descent (GD)

COMPARISON OF SIMPLIFIED GRADIENT DESCENT ALGORITHMS FOR DECODING LDPC CODES

COMPARISON OF SIMPLIFIED GRADIENT DESCENT ALGORITHMS FOR DECODING LDPC CODES

... gradient descent formulation. The behaviour of the proposed algorithms can be explained from the viewpoint of the optimization of a non-linear objective function. Multi-bits flipping GD-BF algorithm gives ...

6

Certain Systems Arising In Stochastic Gradient Descent

Certain Systems Arising In Stochastic Gradient Descent

... In machine learning, processes satisfying (1.1.1) appear in stochastic gradient de- scent (SGD). First, to provide context, let us briefly introduce the gradient descent method (GD) and then see why ...

105

SAR Images Co registration Based on Gradient Descent Optimization

SAR Images Co registration Based on Gradient Descent Optimization

... traditional descent approaches depending on approximations of gradient's finite difference are quite ...step-size gradient descent, quasi-Newton, Powell-Brent, adaptive stochastic gradient ...

7

Gradient Descent Learns Linear Dynamical Systems

Gradient Descent Learns Linear Dynamical Systems

... assumption illustrated earlier, we show in Section 3 that the idealized risk is weakly quasi- convex (Lemma 3.3). Quasi-convexity implies that gradients cannot vanish except at the optimum of the objective function; we ...

44

The Implicit Bias of Gradient Descent on Separable Data

The Implicit Bias of Gradient Descent on Separable Data

... SVM for homogeneous linear predictors. This happens even though neither the norm kwk, nor the margin constraint, are part of the objective or explicitly introduced into optimization. More generally, we show the same ...

57

Stochastic Gradient Descent as Approximate Bayesian Inference

Stochastic Gradient Descent as Approximate Bayesian Inference

... Stochastic Gradient Descent with a constant learning rate (constant SGD) simulates a Markov chain with a stationary distribution. With this perspective, we derive several new results. (1) We show that ...

35

Calibrated Stochastic Gradient Descent for Convolutional Neural Networks

Calibrated Stochastic Gradient Descent for Convolutional Neural Networks

... stochastic gradient descent (SGD) and its variants, the op- timized gradient estimators may be as expensive to compute as the true gradient in many ...stochastic gradient descent ...

8

Utilization of Asynchronous Stochastic Gradient Descent with Additively Homomorphic Encryption

Utilization of Asynchronous Stochastic Gradient Descent with Additively Homomorphic Encryption

... Stochastic Gradient Descent (DPSGD) (Lian et ...Stochastic Gradient Descent ...sub gradient varieties of the decentralized stochastic enhancement calculation for arched ...

7

Predicting the Learning Rate of Gradient Descent for Accelerating Matrix Factorization

Predicting the Learning Rate of Gradient Descent for Accelerating Matrix Factorization

... The model parameters of MF are usually learned by means of numerical methods such as gradient descent, given that the loss function is non-convex. The learning rate of gradient descent is ...

10

A Novel Distributed Variant of Stochastic Gradient Descent and Its Optimization

A Novel Distributed Variant of Stochastic Gradient Descent and Its Optimization

... Abstract. In the age of big data, large-scale learning problems become increasingly significant. Distributed machine learning algorithms thus draw a lot of interest, particularly those based on Stochastic Gradient ...

7

Stochastic Gradient Descent using Linear Regression with Python

Stochastic Gradient Descent using Linear Regression with Python

... In fixed learning rate đťś‚ is implemented in stochastic gradient descent by an adaptive learning rule which decreases over period. This is especially applicable when large data is processed which has to adapt ...

6

Making Asynchronous Stochastic Gradient Descent Work for Transformers

Making Asynchronous Stochastic Gradient Descent Work for Transformers

... Asynchronous stochastic gradient descent (SGD) converges poorly for Transformer mod- els, so synchronous SGD has become the norm for Transformer training. This is unfortu- nate because asynchronous SGD is ...

10

Non-Linear Gradient Descent Algorithm for Smart Antennas

Non-Linear Gradient Descent Algorithm for Smart Antennas

... Nonlinear Gradient Descent algorithm (NGD) is an iterative method that is given an initial point and follows the negative of the gradient in order to move the point towards a critical point, which is ...

7

Optimal Control of Microgrid Networks Using Gradient Descent and Differential Evolution Methods

Optimal Control of Microgrid Networks Using Gradient Descent and Differential Evolution Methods

... Steepest Descent method, Newton method and Differential Evolution ...of gradient descent methods where as the differential evolution is an Evolutionary ...The gradient descent methods ...

7

Online Bootstrap Confidence Intervals for the Stochastic Gradient Descent Estimator

Online Bootstrap Confidence Intervals for the Stochastic Gradient Descent Estimator

... stochastic gradient descent (SGD) is a scalable algorithm to compute parameter estimates and has gained increasing popularity due to its numerical convenience and memory ...

21

Accelerated Gradient Descent Using Instance Eliminating Back Propagation

Accelerated Gradient Descent Using Instance Eliminating Back Propagation

... Abstract: Artificial Intelligence is dominated by Artificial Neural Networks (ANNs). Currently, the Batch Gradient Descent (BGD) is the only solution to train ANN weights when dealing with large datasets. ...

8

The Effect of Adaptive Gain and Adaptive Momentum in Improving Training Time of Gradient Descent Back Propagation Algorithm on Classification Problems

The Effect of Adaptive Gain and Adaptive Momentum in Improving Training Time of Gradient Descent Back Propagation Algorithm on Classification Problems

... In recent years, a number of research studies have attempted to overcome these problems. These involved the development of heuristic techniques, based on studies of properties of the conventional back propagation ...

7

A New Rule-weight Learning Method based on Gradient Descent

A New Rule-weight Learning Method based on Gradient Descent

... Abstract— In this paper, we propose a simple and efficient method to construct an accurate fuzzy classification system. In order to optimize the generalization accuracy, we use rule- weight as a simple mechanism to tune ...

6

mm Wave channel estimation with accelerated gradient descent algorithms

mm Wave channel estimation with accelerated gradient descent algorithms

... Accelerated gradient descent with adaptive restart; AWGN: Additive white Gaussian noise; BPDN: Basis pursuit denoise; BS: Base station; CBP: Continuous basis pursuit; CS: Compressed sensing; DFT: Discrete ...

17

SGD-QN: Careful Quasi-Newton Stochastic Gradient Descent

SGD-QN: Careful Quasi-Newton Stochastic Gradient Descent

... Stochastic Gradient Descent yields particularly effective algorithms when the input patterns are very sparse, taking less than O (d) space and time per iteration to optimize a system with d ...Natural ...

18

Show all 4128 documents...

Related subjects