Deeper insights into neural nets with random weights

IRIS

In this work, the “effective dimension” of the output of the hidden layer of a one-hidden-layer neural network with random inner weights of its computational units is investigated. To do this, a polynomial approximation of the sigmoidal activation function of each computational unit is used, whose degree is chosen based both on a desired upper bound on the approximation error and on an estimate of the range of the input to that computational unit. This estimate of the range is parameterized by the number of inputs to the network and by an upper bound both on the size of the random inner weights of the network and on the size of its inputs. The results show that the Root Mean Square Error (RMSE) on the training set is influenced by the effective dimension and by the quality of the features associated with the output of the hidden layer.

Deeper insights into neural nets with random weights

Ming Li;Giorgio Gnecco;Marcello Sanguineti

2022

Abstract

In this work, the “effective dimension” of the output of the hidden layer of a one-hidden-layer neural network with random inner weights of its computational units is investigated. To do this, a polynomial approximation of the sigmoidal activation function of each computational unit is used, whose degree is chosen based both on a desired upper bound on the approximation error and on an estimate of the range of the input to that computational unit. This estimate of the range is parameterized by the number of inputs to the network and by an upper bound both on the size of the random inner weights of the network and on the size of its inputs. The results show that the Root Mean Square Error (RMSE) on the training set is influenced by the effective dimension and by the quality of the features associated with the output of the hidden layer.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2022
			
	Codice ISBN
	
				978-3-030-97546-3
			
	Parole chiave
	
				Neural networks with random weights, Hyperbolic tangent, Polynomial approximation, Effective dimension, Approximate rank
			
	Appare nelle tipologie:
	
				2.1 Contributo in volume (Capitolo o Saggio)

File in questo prodotto:

File	Dimensione	Formato
AJCAI2021Paper21CameraReadyVersion.pdf Open Access dal 19/03/2023 Tipologia: Documento in Post-print Licenza: Creative commons Dimensione 1.62 MB Formato Adobe PDF Visualizza/Apri	1.62 MB	Adobe PDF	Visualizza/Apri
Li2022_Chapter_DeeperInsightsIntoNeuralNetsWi.pdf non disponibili Tipologia: Versione Editoriale (PDF) Licenza: Nessuna licenza Dimensione 687.14 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	687.14 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11771/21084

Citazioni

ND

2

social impact