MARC View

000			a
999			_c30973 _d30973
008			220610b xxu\|\|\|\|\| \|\|\|\| 00\| 0 eng d
020			_a9789811660450
082			_a006.31 _bYEJ
100			_aYe, Jong Chul
245			_aGeometry of deep learning : a signal processing perspective
260			_bSpringer, _c2022 _aSingapore :
300			_axvi, 330 p. ; _bill., _c25 cm
365			_b74.99 _cEUR _d86.00
490			_aMathematics in industry, 1612-3956 _vv.37
504			_aIncludes bibliographical references and index.
520			_aThe focus of this book is on providing students with insights into geometry that can help them understand deep learning from a unified perspective. Rather than describing deep learning as an implementation technique, as is usually the case in many existing deep learning books, here, deep learning is explained as an ultimate form of signal processing techniques that can be imagined. To support this claim, an overview of classical kernel machine learning approaches is presented, and their advantages and limitations are explained. Following a detailed explanation of the basic building blocks of deep neural networks from a biological and algorithmic point of view, the latest tools such as attention, normalization, Transformer, BERT, GPT-3, and others are described. Here, too, the focus is on the fact that in these heuristic approaches, there is an important, beautiful geometric structure behind the intuition that enables a systematic understanding. A unified geometric analysis to understand the working mechanism of deep learning from high-dimensional geometry is offered. Then, different forms of generative models like GAN, VAE, normalizing flows, optimal transport, and so on are described from a unified geometric perspective, showing that they actually come from statistical distance-minimization problems. Because this book contains up-to-date information from both a practical and theoretical point of view, it can be used as an advanced deep learning textbook in universities or as a reference source for researchers interested in acquiring the latest deep learning algorithms and their underlying principles. In addition, the book has been prepared for a codeshare course for both engineering and mathematics students, thus much of the content is interdisciplinary and will appeal to students from both disciplines.
650			_aFunctional Analysis
650			_aDifferential Geometry
650			_aArtificial Intelligence
650			_aMathematical Models
650			_aCognitive Processes
650			_aNeural Networks
650			_aMathematical and Computational Biology
650			_aActivation function
650			_a Algorithmic robutness
650			_aBias-variance trade-off
650			_aConvex optimization
650			_aDeep convolutional framelets
650			_aEncoder-decoder CNN
650			_aFeature space
650			_a Gradient descent method
650			_aKernel SVM
650			_aLoss surfaces
650			_aNeural tangent Kernel (NTK)
650			_aPositive definite
650			_aRepresenter theorem
650			_aSigmoid function
650			_aTraining data
650			_a Universal approximation theorem
650			_a Vanishing gradient problem
650			_aWeight clipping
942			_2ddc _cBK