Popis předmětu

Hlavní stránka | Seznam oborů/specializací | Seznam všech skupin předmětů | Seznam všech předmětů | Seznam rolí Návod

Předmět je základní jednotka výuky, jejímž prostřednictvím si student osvojí ucelenou část souboru znalostí a dovedností, potřebnou pro zvládnutí studijního oboru/specializace. Za věcný obsah předmětu zodpovídá garant předmětu. Časovou náročnost předmětu zhruba vyjadřuje atribut předmětu rozsah kontaktní výuky. Například rozsah = 2+2 značí, že předmět bude mít týdně dvě hodiny přednášek a dvě hodiny cvičení týdně. Na závěr semestru musí vyučující provést vyhodnocení, nakolik si ten který student osvojil poznatky a dovednosti, kterých měl během výuky nabýt. Jakým způsobem toto hodnocení vyučující provedou určuje atribut způsob zakončení. U předmětu lze definovat, že předmět je zakončen pouze zápočtem(Z), klasifikovaným zápočtem(KZ), pouze zkouškou(ZK), nebo zápočtem a zkouškou(Z,ZK). Náročnost úspěšného absolvování předmětu je vyjádřena ECTS kreditními body. Výuka předmětu probíhá během semestru. Opakovaně se předmět vyučuje vždy v zimním(Z), nebo v letním(L) semestru každého akademického roku. Výjimečně může předmět být nabízen studentům v obou semestrech(Z,L). Za organizační zajištění výuky zodpovídá přiřazená katedra, která zejména vytvoří časový rozvrh předmětu a zajistí pro předmět vyučující. Někteří přednáší a zkouší, jiní vedou cvičení a udělují zápočty.
Obsahová náplň a další organizační informace, týkající se předmětu je popsána pomocí různých popisných textů(anotace, týdenní osnova, literatura, apod.)
$DODATEK_POPIS

MIE-MZI	Mathematics for data science			Rozsah kontaktní výuky:	2P+1C
Vyučující:				Způsob zakončení:	Z,ZK
Zodpovědná katedra:	18105	ECTS Kredity:	4	Semestr:	L

Anotace:
In this course, the students are introduced to the domains of mathematics necessary for understanding the standard methods and algorithms used in data science. The studied topics include mainly: linear algebra (matrix factorisations, eigenvalues, diagonalization), continuous optimisation (optimisation with constraints, duality principle, gradient methods) and selected notions from probability theory and statistics.

Osnovy přednášek:

1) Mathematical formulation of regression and classification problem.

2) Geometrical view of linear regression model and least squares method (LS).

3) Computing the LS estimate (QR decomposition of a matrix).

4) Hypothesis tests for linear model, model validation.

5) Variable subset selection: ridge regression, best-subset selection, etc.

7) Singular value decomposition and its connection with ridge regression.

8) [2] Principal component analysis and dimensionality reduction.

10) Linear regression and classification.

11) Logistic regression.

12) Local regression and smoothing methods (splines, kernels).

13) [2] Support vector machines.

Osnovy cvičení:

1) Least squares method.

2) Matrix factorisation and matrix eigenvalues.

3) Usage of linear regression and related methods.

4) Principal component analysis.

5) Logistic regression.

6) Support vector machines.

Literatura:

1. Christopher Bishop, Pattern Recognition and Machine Learning, Springer-Verlag New York (2006), ISBN 978-0-387-31073-2

2. Trevor Hastie, Robert Tibshirani, Jerome Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer (2011), ISBN 978-0387848570.

Požadavky:
Knowledge of basic notions of linear algebra and matrix theory, basics of probability theory, course MIE-MPI: Mathematics for informatics.

Information about the course and courseware are available at https://courses.fit.cvut.cz/MI-MZI/

Předmět je zahrnut do těchto studijních plánů:

Plán Obor Role Dop. semestr

NIE-SI.21 Software Engineering 2021 V 2

NIE-TI.21 Computer Science 2021 V 2

NIE-DBE.2023 Digital Business Engineering V 2

NIE-NPVS.21 Design and Programming of Embedded Systems 2021 V 2

NIE-PSS.21 Computer Systems and Networks 2021 V 2

NIE-PB.21 Computer Security 2021 V 2

Stránka vytvořena 25. 4. 2024, semestry: L/2019-20, Z/2022-3, Z/2021-2, Z/2019-20, Z/2020-1, L/2022-3, Z/2024-5, L/2020-1, Z,L/2023-4, L/2021-2, připomínky k informační náplni zasílejte správci studijních plánů

Návrh a realizace: J. Novák, I. Halaška

1)		Mathematical formulation of regression and classification problem.
2)		Geometrical view of linear regression model and least squares method (LS).
3)		Computing the LS estimate (QR decomposition of a matrix).
4)		Hypothesis tests for linear model, model validation.
5)		Variable subset selection: ridge regression, best-subset selection, etc.
7)		Singular value decomposition and its connection with ridge regression.
8)		[2] Principal component analysis and dimensionality reduction.
10)		Linear regression and classification.
11)		Logistic regression.
12)		Local regression and smoothing methods (splines, kernels).
13)		[2] Support vector machines.

1)		Least squares method.
2)		Matrix factorisation and matrix eigenvalues.
3)		Usage of linear regression and related methods.
4)		Principal component analysis.
5)		Logistic regression.
6)		Support vector machines.

1.		Christopher Bishop, Pattern Recognition and Machine Learning, Springer-Verlag New York (2006), ISBN 978-0-387-31073-2
2.		Trevor Hastie, Robert Tibshirani, Jerome Friedman, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Springer (2011), ISBN 978-0387848570.