Treffer: Linear and Deep Models Basics with Pytorch, Numpy, and Scikit-Learn

Title:

Linear and Deep Models Basics with Pytorch, Numpy, and Scikit-Learn

Authors:

Priam, Rodolphe

Contributors:

University of Southampton

Publisher Information:

HAL CCSD, 2022.

Publication Year:

2022

Subject Terms:

pytorch, scikit-learn, numpy, algorithm, linear models, deep learning, [INFO.INFO-AI]Computer Science [cs], Artificial Intelligence [cs.AI], [INFO.INFO-LG]Computer Science [cs], Machine Learning [cs.LG], [STAT.CO]Statistics [stat], Computation [stat.CO]

Original Identifier:

HAL: hal-03954166

Document Type:

Buch book<br />Books

Language:

English

ISBN:

979-83-7144-157-7

Availability:

https://inria.hal.science/hal-03954166

Rights:

URL: http://hal.archives-ouvertes.fr/licences/copyright/

Accession Number:

edshal.hal.03954166v1

Database:

HAL

Weitere Informationen

This book is an introduction to computational statistics for the generalized linear models (glm) and to machine learning with the python language. Extensions of the glm with nonlinearities come from hidden layer(s) within a neural network for linear and nonlinear regression or classification. This allows to present side by side classical statistics and current deep learning. The loglikelihoods and the corresponding loss functions are explained. The gradient and hessian matrix are discussed and implemented for these linear and nonlinear models. Several methods are implemented from scratch with numpy for prediction (linear, logistic, poisson regressions) and for reduction (principal component analysis, random projection). The gradient descent, newton-raphson, natural gradient and l-fbgs algorithms are implemented. The datasets in stake are with 10 to 10^7 rows, and are tabular such that images or texts are vectorized. The data are stored in a compressed format (memmap or hdf5) and loaded by chunks for several case studies with pytorch or scikit-learn. Pytorch is presented for training with minibatches via a generic implementation for studying with computer programs. Scikit-learn is presented for processing large datasets via the partial fit, after the small examples. Sixty exercises are proposed at the end of the chapters with selected solutions to go beyond the contents. Code available at https://github.com/rpriam/book1

Treffer: Linear and Deep Models Basics with Pytorch, Numpy, and Scikit-Learn

Weitere Informationen

Links

Zusatz-Funktionen